ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-01-25 12:46:38 +08:00

Author	SHA1	Message	Date
so95	fefea3a2a5	Fixed OpenAI compatibility stream [DONE] (#5389 ) Fixed OpenAI compatibility stream [DONE] - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-26 17:55:12 +08:00
Yongteng Lei	0e920a91dd	FIX: correct typo (#5387 ) ### What problem does this PR solve? Correct typo in supported_models file ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 17:21:09 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
k	5cab6c4ccb	Fix:HTTP API -> Stop parsing documents(AttributeError: ‘list‘ object … (#5375 ) …has no attribute ‘id‘) ### What problem does this PR solve? No PR ![image](https://github.com/user-attachments/assets/988d31bc-6551-4bb8-846c-cbbc1883d804) ![image](https://github.com/user-attachments/assets/8b09681b-1239-4ed9-8bc3-11436c5e90bc) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-02-26 15:57:50 +08:00
Yongteng Lei	b3b341173f	DOCS: add OpenAI-compatible http and python api reference (#5374 ) ### What problem does this PR solve? Add OpenAI-compatible http and python api reference ### Type of change - [x] Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com> Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com>	2025-02-26 15:52:26 +08:00
liwenju0	a9e4695b74	Fix：validate knowledge base association before document upload (#5373 ) ### What problem does this PR solve? fix this bug: https://github.com/infiniflow/ragflow/issues/5368 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-02-26 15:47:34 +08:00
Kevin Hu	4f40f685d9	Code refactor (#5371 ) ### What problem does this PR solve? #5173 ### Type of change - [x] Refactoring	2025-02-26 15:40:52 +08:00
Yongteng Lei	5c6a7cb4b8	Added OpenAI-like completion api (#5351 ) ### What problem does this PR solve? Added OpenAI-like completion api, related to #4672, #4705 This function allows users to interact with a model to get responses based on a series of messages. If `stream` is set to True, the response will be streamed in chunks, mimicking the OpenAI-style API. #### Example usage: ```bash curl -X POST https://ragflow_address.com/api/v1/chats_openai/<chat_id>/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer $RAGFLOW_API_KEY" \ -d '{ "model": "model", "messages": [{"role": "user", "content": "Say this is a test!"}], "stream": true }' ``` Alternatively, you can use Python's `OpenAI` client: ```python from openai import OpenAI model = "model" client = OpenAI(api_key="ragflow-api-key", base_url=f"http://ragflow_address/api/v1/chats_openai/<chat_id>") completion = client.chat.completions.create( model=model, messages=[ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "Who you are?"}, {"role": "assistant", "content": "I am an AI assistant named..."}, {"role": "user", "content": "Can you tell me how to install neovim"}, ], stream=True ) stream = True if stream: for chunk in completion: print(chunk) else: print(completion.choices[0].message.content) ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Related Issues Related to #4672, #4705	2025-02-26 11:37:29 +08:00
Kevin Hu	4e2afcd3b8	Fix FlagRerank max_length issue. (#5366 ) ### What problem does this PR solve? #5352 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 11:01:13 +08:00
Zhenglin Dong	11e6d84d46	Fix: 'Chunk not found!' error in team-sharing knowledge base. (#5361 ) ### What problem does this PR solve? As issue #3268 mentioned, "Chun not found!" exception will occur, especially during the teamwork of knowledge bases. ### The reason of this bug "tenants" are the people on current_user's team, including the team owner itself. The old one only checks the first "tenant", tenants[0], which will cause error when anyone editing the chunk that is not in tenants[0]'s knowledge base. My modification won't introduce new errors while iterate all the tenant then retrieve knowledge bases of each. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 10:24:35 +08:00
Kevin Hu	53b9e7b52f	Add tavily as web searh tool. (#5349 ) ### What problem does this PR solve? #5198 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 10:21:04 +08:00
Kevin Hu	b3d579e2c1	Refine prompt of agentic search. (#5312 ) ### What problem does this PR solve? #5173 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-25 09:21:52 +08:00
Kevin Hu	9aa222f738	Let list_chat go without kb checking. (#5280 ) ### What problem does this PR solve? #5278 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-24 13:21:05 +08:00
Kevin Hu	605cfdb8dc	Refine error message for re-rank model. (#5278 ) ### What problem does this PR solve? #5261 ### Type of change - [x] Refactoring	2025-02-24 13:01:34 +08:00
Omar Leonardo Sanchez Granados	a0b461a18e	Add configuration to choose default llm models (#5245 ) ### What problem does this PR solve? This pull request includes changes to the `api/settings.py` and `docker/service_conf.yaml.template` files to add support for default models in the LLM configuration (specially for LIGHTEN builds). The most important changes include adding default model configurations and updating the initialization settings to use these defaults. For example: With this configuration Bedrock will be enable by default with claude and titan embeddings. ``` user_default_llm: factory: 'Bedrock' api_key: '{}' base_url: '' default_models: chat_model: 'anthropic.claude-3-5-sonnet-20240620-v1:0' embedding_model: 'amazon.titan-embed-text-v2:0' rerank_model: '' asr_model: '' image2text_model: '' ``` ### Type of change - [X] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:13:39 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
Kevin Hu	ef8847eda7	Double check error of adding llm. (#5237 ) ### What problem does this PR solve? #5227 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-21 19:09:49 +08:00
Kevin Hu	3444cb15e3	Refine search query. (#5235 ) ### What problem does this PR solve? #5173 #5214 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-21 18:32:32 +08:00
Kevin Hu	f5d63bb7df	Support chat solo. (#5218 ) ### What problem does this PR solve? #5216 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-21 12:24:02 +08:00
Kevin Hu	7b3d700d5f	Apply agentic searching. (#5196 ) ### What problem does this PR solve? #5173 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-20 17:41:01 +08:00
liwenju0	f298e55ded	Fix: Normalize embedding model ID comparison across datasets (#5169 ) Modify embedding model ID comparison to remove vendor suffixes, ensuring consistent model identification when working with multiple knowledge bases. This change affects dialog creation, chat operations, and document retrieval test functions. ### What problem does this PR solve? resolve this bug: https://github.com/infiniflow/ragflow/issues/5166 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-02-20 12:40:59 +08:00
liwenju0	3ced290eb5	Feat: Add support for document meta fields update through api (#5120 ) ### What problem does this PR solve? add support for update document meta data through api ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: wenju.li <wenju.li@deepctr.cn> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-19 13:39:31 +08:00
petertc	8525f55ad0	Fix: Option ineffective in Chat API (#5118 ) ### What problem does this PR solve? API options like `stream` was ignored when no session_id was provided. This PR fixes the issue. Test command and expected result: ``` curl --request POST \ --url http://:9222/api/v1/chats/2f2e1d30ee6111efafe211749b004925/completions \ --header 'Content-Type: application/json' \ --header 'Authorization: Bearer ragflow-xxx' \ --data '{ "question":"Who are you", "stream":false }' {"code":0,"data":"data:{\"code\": 0, \"message\": \"\", \"data\": {\"answer\": \"Hi! I'm your assistant, what can I do for you?\", \"reference\": {}, \"audio_binary\": null, \"id\": null, \"session_id\": \"82ceb0fcee7111efafe211749b004925\"}}\n\n"} ``` ### Type of change - [*] Bug Fix (non-breaking change which fixes an issue)	2025-02-19 13:18:51 +08:00
zhxlp	00c7ddbc9b	Fix: The max tokens defined by the tenant are not used (#4297 ) (#2817 ) (#5066 ) ### What problem does this PR solve? Fix: The max tokens defined by the tenant are not used (#4297) (#2817) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-18 13:42:22 +08:00
Kevin Hu	84b4b38cbb	Remove <think> for exeSql component. (#5069 ) ### What problem does this PR solve? #5061 #5067 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-18 13:39:37 +08:00
flygithub	409310aae9	Update agent session API, to support uploading files while create a new session (#5039 ) ### What problem does this PR solve? Update the agent session API "POST /api/v1/agents/{agent_id}/sessions", to support uploading files while create a new session: - currently, the API only supports requesting with a json body. If user wants to upload a doc or image when create session, like what is already supported on the web client, we need to update the API. - if upload an image, ragflow will call image2text, and a user_id is needed for the image2text model. So we need to send user_id in the API request. As form-data is needed to upload files, not json body, seems we need to put the user_id in the url as an optional parameter (currently user_id is an optional in json body). ### Type of change - [x] Documentation Update - [x] Other (please describe):	2025-02-18 09:45:40 +08:00
Kevin Hu	9ff825f39d	Ignore exceptions when no index ahead. (#5047 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-18 09:09:22 +08:00
hy89	7b5d831296	Fix: Starting the source code on Windows, the 'HTTP API' returns 404 (#5042 ) Fix: When starting the backend service from source code on Windows, the "HTTP API" no longer returns 404.	2025-02-17 19:33:49 +08:00
Kevin Hu	e4096fbc33	Add another decrypt function. (#5043 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-17 18:09:11 +08:00
Kevin Hu	3aa5c2a699	Ignore exception of empty index. (#5030 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-17 15:59:55 +08:00
kuschzzp	88daa349f9	Optimize conversation when uploading attachments (#4964 ) ### What problem does this PR solve? #4929 ### Type of change - [x] Performance Improvement	2025-02-17 12:03:04 +08:00
zhxlp	194e8ea696	Fix knowledge graph node not found (#4968 ) (#4970 ) ### What problem does this PR solve? Fix knowledge graph node not found (#4968) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-17 11:49:27 +08:00
Kevin Hu	810f997276	Fix <think> in keywords or question auto-generations. (#5021 ) ### What problem does this PR solve? #4983 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-17 11:20:57 +08:00
Kevin Hu	849d9eb463	Ignore tenant not found error while increasing token usage. (#4950 ) ### What problem does this PR solve? #4940 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-14 11:10:49 +08:00
Peterson Alves	042f4c90c6	Fixes KeyError: 'content' when using stream=False (#4944 ) ### 🛠 Fixes `KeyError: 'content'` when using `stream=False` #### 🔍 Problem When calling the chat API with `stream=False`, the code attempts to access `msg[-1]["content"]` without verifying if the key exists. This causes a `KeyError` when the message structure does not contain `"content"`. This issue was discussed in [#4885](https://github.com/infiniflow/ragflow/issues/4885), where we analyzed the root cause. The error does not occur with `stream=True`, as the response is processed differently. #### ✅ Solution - Logging Fix: - Before accessing `msg[-1]["content"]`, we check if the key exists. - If it does not exist, a default value (`"[content not available]"`) is used to prevent errors. - Structural Fix in `msg` Construction: - Ensured that every message in `msg` contains the `"content"` key, even if empty. - This fixes the issue at its root and ensures consistent behavior between `stream=True` and `stream=False`. #### 🔄 Impact - Prevents the `KeyError` without affecting normal application flow. - Ensures the integrity of the `msg` structure, avoiding future failures. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-14 10:27:01 +08:00
Kevin Hu	78982d88e0	Reformat error message. (#4829 ) ### What problem does this PR solve? #4828 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-10 16:47:53 +08:00
Kevin Hu	6fa34d5532	Fix KG circle. (#4823 ) ### What problem does this PR solve? #4760 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-10 11:02:29 +08:00
davidche	588207d7c1	optimize TenantLLMService.increase_usage for "can't update token usag… (#4755 ) …e error " message ### What problem does this PR solve? optimize TenantLLMService.increase_usage Performance ### Type of change - [x] Performance Improvement Co-authored-by: che_shuai <che_shuai@massclouds.com>	2025-02-07 12:16:17 +08:00
Kevin Hu	4b9c4c0705	Update deepseek model provider info. (#4714 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-05 13:43:40 +08:00
Kevin Hu	c354239b79	Make infinity adapt to condition `exist`. (#4657 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-26 18:45:36 +08:00
Kevin Hu	898ae7fa80	Fix missplace for vector sim weight and token sim weight. (#4627 ) ### What problem does this PR solve? #4610 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-24 12:41:06 +08:00
Kevin Hu	e9ccba0395	Add timestamp to messages (#4624 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-24 11:07:55 +08:00
Kevin Hu	e14d6ae441	Refactor. (#4612 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-01-23 18:56:02 +08:00
Kevin Hu	86892959a0	Rebuild graph when it's out of time. (#4607 ) ### What problem does this PR solve? #4543 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2025-01-23 17:26:20 +08:00
Zhichang Yu	240e7d7c22	Unified user_service.py (#4606 ) ### What problem does this PR solve? Unified user_service.py ### Type of change - [x] Refactoring	2025-01-23 15:49:21 +08:00
Kevin Hu	c4b9e903c8	Fix index not found for new user. (#4597 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-23 11:45:22 +08:00
Kevin Hu	dd0ebbea35	Light GraphRAG (#4585 ) ### What problem does this PR solve? #4543 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-22 19:43:14 +08:00
Jin Hai	3894de895b	Update comments (#4569 ) ### What problem does this PR solve? Add license statement. ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-01-21 20:52:28 +08:00
Kevin Hu	f4d084bcf1	Fix doc progress issue. (#4520 ) ### What problem does this PR solve? #4516 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-17 18:28:15 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00

1 2 3 4 5 ...

648 Commits