ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2025-12-08 20:42:30 +08:00

Author	SHA1	Message	Date
Kevin Hu	0a42e5777e	Refa: docker/.env comment refinement. (#6689 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-03-31 18:26:20 +08:00
RedBookOfMemory	e2b66628f4	Feat: extend S3 storage compatibility and add knowledge base ID prefix (#6355 ) ### What problem does this PR solve? - Added support for S3-compatible protocols. - Enabled the use of knowledge base ID as a file prefix when storing files in S3. - Updated docker/README.md to include detailed S3 and OSS configuration instructions. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-31 16:09:43 +08:00
Alex Chen	46b5e32cd7	Feat: support vision llm for gpustack (#6636 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/6138 This PR is going to support vision llm for gpustack, modify url path from `/v1-openai` to `/v1` ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-31 15:33:52 +08:00
Kevin Hu	7d9dd1e5d3	Refa: remove default build-in rerank model. (#6682 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-03-31 15:33:19 +08:00
so95	1985ff7918	add type canvas (#6680 ) add type canvas ### Type of change - [x] Refactoring	2025-03-31 14:46:29 +08:00
Kevin Hu	60b9c027c8	Refa: add meta data to retrieval. (#6676 ) ### What problem does this PR solve? #6619 ### Type of change - [x] Performance Improvement	2025-03-31 11:45:56 +08:00
writinwaters	2793c8e4fe	Added a guide on setting page rank. (#6645 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update --------- Co-authored-by: balibabu <cike8899@users.noreply.github.com>	2025-03-31 11:44:18 +08:00
Yingfeng	805a8f1f47	Update broken discord (#6678 ) ### Type of change - [x] Documentation Update	2025-03-31 11:29:34 +08:00
Song Fuchang	d4a3e9a7cc	Fix table migration on non-exist-yet indexed columns. (#6666 ) ### What problem does this PR solve? Fix #6334 Hello, I encountered the same problem in #6334. In the `api/db/db_models.py`, it calls `obj.create_table()` unconditionally in `init_database_tables`, before the `migrate_db()`. Specially for the `permission` field of `user_canvas` table, it has `index=True`, which causes `peewee` to issue a SQL trying to create the index when the field does not exist (the `user_canvas` table already exists), so `psycopg2.errors.UndefinedColumn: column "permission" does not exist` occurred. I've added a judgement in the code, to only call `create_table()` when the table does not exist, delegate the migration process to `migrate_db()`. Then another problem occurs: the `migrate_db()` actually does nothing because it failed on the first migration! The `playhouse` blindly issue DDLs without things like `IF NOT EXISTS`, so it fails... even if the exception is `pass`, the transaction is still rolled back. So I removed the transaction in `migrate_db()` to make it work. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-31 11:27:20 +08:00
Song Fuchang	ad4e59edb2	Don't split and strip input in retrieval component. (#6662 ) ### What problem does this PR solve? Actually fix #6241 Hello, I ran into the same problem as #6241. When I'm testing my agent flow in the web ui using `Run` button with a file input, the retrieval component always gave an empty output. In the code I found that: `web/src/pages/flow/debug-content/index.tsx`: ```tsx const onOk = useCallback(async () => { const values = await form.validateFields(); const nextValues = Object.entries(values).map(([key, value]) => { const item = parameters[Number(key)]; let nextValue = value; if (Array.isArray(value)) { nextValue = ``; value.forEach((x) => { nextValue += x?.originFileObj instanceof File ? `${x.name}\n${x.response?.data}\n----\n` // Here, the file content always ends in '\n' : `${x.url}\n${x.result}\n----\n`; }); } return { ...item, value: nextValue }; }); ok(nextValues); }, [form, ok, parameters]); ``` while in the `agent/component/retrieval.py`: ```python def _run(self, history, **kwargs): query = self.get_input() query = str(query["content"][0]) if "content" in query else "" lines = query.split('\n') # inputs are split to ['xxx','yyy','----',''] query = lines[-1] if lines else "" # Here we always get '', thus no result kbs = KnowledgebaseService.get_by_ids(self._param.kb_ids) if not kbs: return Retrieval.be_output("") ``` so the code will never got correct result. I'm not sure why the input needs such a split here, so I just removed the splitting, and it works well on my side. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-31 11:26:49 +08:00
liu an	aca4cf4369	Test: Added test cases for Retrieval Chunks HTTP API (#6649 ) ### What problem does this PR solve? cover [retrieval chunk](https://ragflow.io/docs/v0.17.2/http_api_reference#retrieve-chunks) endpoints ### Type of change - [x] add test cases	2025-03-31 10:05:35 +08:00
Song Fuchang	9aa047257a	Fix agent completion requiring calling twice with parameters in begin component (#6659 ) ### What problem does this PR solve? Fix #5418 Actually, the fix #4329 also works for agent flows with parameters, so this PR just relaxes the `else` branch of that. With this PR, it works fine on my side, may need more testing to make sure this does not break something. I guess the real problem may be deeply hidden in the code which relates to conversation and canvas execution. After a few hours of debugging, I see the only difference between with and without parameters in `begin` component, is the `history` field of canvas data. When the `begin` component contains some parameters, the debug log shows: ``` 025-03-29 19:50:38,521 DEBUG 356590 { "component_name": "Begin", "params": {"output_var_name": "output", "message_history_window_size": 22, "query": [{"type": "fileUrls", "key": "fileUrls", "name": "files", "optional": true, "value": "问题.txt\n今天天气怎么样"}], "inputs": [], "debug_inputs": [], "prologue": "你好！我是你的助理，有什么可以帮到你的吗？", "output": null}, "output": null, "inputs": [] }, history: [["user", "请回答我上传文件中的问题。"]], kwargs: {"stream": false} 2025-03-29 19:50:38,523 DEBUG 356590 { "component_name": "Answer", "params": {"output_var_name": "output", "message_history_window_size": 22, "query": [], "inputs": [], "debug_inputs": [], "post_answers": [], "output": null}, "output": null, "inputs": [] }, history: [["user", "请回答我上传文件中的问题。"]], kwargs: {"stream": false} ``` Then it does not go further along the flow. When the `begin` component does not contain any parameter, the debug log shows: ``` 2025-03-29 19:41:13,518 DEBUG 353596 { "component_name": "Begin", "params": {"output_var_name": "output", "message_history_window_size": 22, "query": [], "inputs": [], "debug_inputs": [], "prologue": "你好！我是你的助理，有什么可以帮到你的吗？", "output": null}, "output": null, "inputs": [] }, history: [], kwargs: {"stream": false} 2025-03-29 19:41:13,520 DEBUG 353596 { "component_name": "Answer", "params": {"output_var_name": "output", "message_history_window_size": 22, "query": [], "inputs": [], "debug_inputs": [], "post_answers": [], "output": null}, "output": null, "inputs": [] }, history: [], kwargs: {"stream": false} 2025-03-29 19:41:13,556 INFO 353596 127.0.0.1 - - [29/Mar/2025 19:41:13] "POST /api/v1/agents/fee6886a0c6f11f09b48eb8798e9aa9b/sessions?user_id=123 HTTP/1.1" 200 - 2025-03-29 19:41:21,115 DEBUG 353596 Canvas.prepare2run: Retrieval:LateGuestsNotice 2025-03-29 19:41:21,116 DEBUG 353596 { "component_name": "Retrieval", "params": {"output_var_name": "output", "message_history_window_size": 22, "query": [], "inputs": [], "debug_inputs": [], "similarity_threshold": 0.2, "keywords_similarity_weight": 0.3, "top_n": 8, "top_k": 1024, "kb_ids": ["9aca3c700c5911f0811caf35658b9385"], "rerank_id": "", "empty_response": "", "tavily_api_key": "", "use_kg": false, "output": null}, "output": null, "inputs": [] }, history: [["user", "请回答我上传文件中的问题。"]], kwargs: {"stream": false} ``` It correctly goes along the flow and generates correct answer. You can see the difference: when the `begin` component has any parameter, the `history` field is filled from the beginning, while it is just `[]` if the `begin` component has no parameter. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-31 09:57:56 +08:00
Zhichang Yu	65a8cd1772	Fix knowledge_graph_kwd on infinity. Close #6476 and #6624 (#6651 ) ### What problem does this PR solve? Fix knowledge_graph_kwd on infinity. Close #6476 and #6624 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-28 22:05:40 +08:00
Kevin Hu	563a84beaf	Docs: fix retrieval docs. (#6633 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-28 16:03:37 +08:00
Zhichang Yu	d32a35d8fd	Fix entity_types. Close #6287 and #6608 (#6632 ) ### What problem does this PR solve? Fix entity_types. Close #6287 and #6608 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-28 15:00:24 +08:00
Wanderson Pinto dos Santos	2632493c8b	Consolidate entrypoint to support broader deployment scenarios (#6566 ) ### What problem does this PR solve? This PR gives better control over how we distribute which service will be loaded. With this approach, we can create containers to run only the web server and others to run the task executor. It also introduces the unique ID per task executor host, this will be important when scaling task executors horizontally, considering unique task executor ids will be required. This new `entrypoint.sh` maintains the default behavior of starting the web server and task executor in the same host. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [X] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-28 12:39:34 +08:00
Marcus Yuan	c61df5dd25	Dynamic Context Window Size for Ollama Chat (#6582 ) # Dynamic Context Window Size for Ollama Chat ## Problem Statement Previously, the Ollama chat implementation used a fixed context window size of 32768 tokens. This caused two main issues: 1. Performance degradation due to unnecessarily large context windows for small conversations 2. Potential business logic failures when using smaller fixed sizes (e.g., 2048 tokens) ## Solution Implemented a dynamic context window size calculation that: 1. Uses a base context size of 8192 tokens 2. Applies a 1.2x buffer ratio to the total token count 3. Adds multiples of 8192 tokens based on the buffered token count 4. Implements a smart context size update strategy ## Implementation Details ### Token Counting Logic ```python def count_tokens(text): """Calculate token count for text""" # Simple calculation: 1 token per ASCII character # 2 tokens for non-ASCII characters (Chinese, Japanese, Korean, etc.) total = 0 for char in text: if ord(char) < 128: # ASCII characters total += 1 else: # Non-ASCII characters total += 2 return total ``` ### Dynamic Context Calculation ```python def _calculate_dynamic_ctx(self, history): """Calculate dynamic context window size""" # Calculate total tokens for all messages total_tokens = 0 for message in history: content = message.get("content", "") content_tokens = count_tokens(content) role_tokens = 4 # Role marker token overhead total_tokens += content_tokens + role_tokens # Apply 1.2x buffer ratio total_tokens_with_buffer = int(total_tokens * 1.2) # Calculate context size in multiples of 8192 if total_tokens_with_buffer <= 8192: ctx_size = 8192 else: ctx_multiplier = (total_tokens_with_buffer // 8192) + 1 ctx_size = ctx_multiplier * 8192 return ctx_size ``` ### Integration in Chat Method ```python def chat(self, system, history, gen_conf): if system: history.insert(0, {"role": "system", "content": system}) if "max_tokens" in gen_conf: del gen_conf["max_tokens"] try: # Calculate new context size new_ctx_size = self._calculate_dynamic_ctx(history) # Prepare options with context size options = { "num_ctx": new_ctx_size } # Add other generation options if "temperature" in gen_conf: options["temperature"] = gen_conf["temperature"] if "max_tokens" in gen_conf: options["num_predict"] = gen_conf["max_tokens"] if "top_p" in gen_conf: options["top_p"] = gen_conf["top_p"] if "presence_penalty" in gen_conf: options["presence_penalty"] = gen_conf["presence_penalty"] if "frequency_penalty" in gen_conf: options["frequency_penalty"] = gen_conf["frequency_penalty"] # Make API call with dynamic context size response = self.client.chat( model=self.model_name, messages=history, options=options, keep_alive=60 ) return response["message"]["content"].strip(), response.get("eval_count", 0) + response.get("prompt_eval_count", 0) except Exception as e: return "ERROR: " + str(e), 0 ``` ## Benefits 1. Improved Performance: Uses appropriate context windows based on conversation length 2. Better Resource Utilization: Context window size scales with content 3. Maintained Compatibility: Works with existing business logic 4. Predictable Scaling: Context growth in 8192-token increments 5. Smart Updates: Context size updates are optimized to reduce unnecessary model reloads ## Future Considerations 1. Fine-tune buffer ratio based on usage patterns 2. Add monitoring for context window utilization 3. Consider language-specific token counting optimizations 4. Implement adaptive threshold based on conversation patterns 5. Add metrics for context size update frequency --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-03-28 12:38:27 +08:00
Kevin Hu	1fbc4870f0	Fix: HTTP API delete_chunks issue. (#6621 ) ### What problem does this PR solve? #6611 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-28 12:13:43 +08:00
AdySec	f304492716	Fix: binlog_expire_logs_seconds (#6626 ) This PR updates the MySQL container configuration by setting the parameter --binlog_expire_logs_seconds to 604800 seconds (7 days). This change ensures that MySQL automatically purges binary logs older than 7 days, helping to conserve disk space and maintain precise log management. ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-28 11:37:53 +08:00
balibabu	f35c226ce7	Feat: Add RadioGroup component #3221 (#6622 ) ### What problem does this PR solve? Feat: Add RadioGroup component #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-28 10:20:49 +08:00
donblack01	0b48a2e0d1	Fix: When Excel is a formula, the parsed result is a formula, but cannot be correctly parsed as a value type (#6613 ) ### What problem does this PR solve? Fix: When Excel is a formula, the parsed result is a formula, but cannot be correctly parsed as a value type ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: tangyu <1@1.com>	2025-03-28 09:33:49 +08:00
liu an	fd614a7aef	Test: Added test cases for Delete Chunks HTTP API (#6612 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] add test cases	2025-03-28 09:33:23 +08:00
Kevin Hu	0758c04941	Refa: token similarity calculations. (#6614 ) ### What problem does this PR solve? #6507 ### Type of change - [x] Performance Improvement	2025-03-28 09:33:08 +08:00
Zhichang Yu	fe0396bbb9	Introduced delete_knowledge_graph (#6605 ) ### What problem does this PR solve? Introduced delete_knowledge_graph ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] Documentation Update	2025-03-27 17:16:48 +08:00
Xc1995	974a467cf6	Fix: The rule of Categorize operator is adjusted. (#6599 ) ### What problem does this PR solve? When I use the categorization operator, I find that if the keyword I want to Categorize appears repeatedly in the input, then I cannot judge the word that appears most frequently. Instead, I simply get the word that matches and return all the ones that have made the following changes to the categorize filter. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring - [x] Performance Improvement	2025-03-27 17:02:21 +08:00
Zhichang Yu	36b62e0fab	EntityResolution batch. Close #6570 (#6602 ) ### What problem does this PR solve? EntityResolution batch ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-27 16:40:36 +08:00
Kevin Hu	d2043ff9f2	Fix: LmStudioChat issue. (#6591 ) ### What problem does this PR solve? #6577 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-27 14:59:15 +08:00
Kevin Hu	ecc9605a32	Fix: team doc deletion issue. (#6589 ) ### What problem does this PR solve? #6557 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-27 13:26:38 +08:00
balibabu	70dc56d26b	Feat: Add logo-with-text-white.svg #3221 (#6588 ) ### What problem does this PR solve? Feat: Add logo-with-text-white.svg #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-27 12:28:17 +08:00
Zanyatta	82ccbd2cba	fix: Remove unnecessary minio initialization (#6544 ) ### What problem does this PR solve? Prevent applications from failing to start due to calling non-existent or incorrect Minio connection configurations when using file storage outside of Minio ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-03-27 09:54:25 +08:00
Zhichang Yu	c4998d0e09	Rename graphrag task lock (#6576 ) ### What problem does this PR solve? Rename graphrag task lock ### Type of change - [x] Refactoring	2025-03-26 23:48:47 +08:00
Fengbo Yuan	5eabfe3912	Update values.yaml image to infiniflow/infinity:v0.6.0-dev3 issue#5882 (#6568 ) related issue #5882 ### What problem does this PR solve? update helm infinity image version from v0.5.0 image to infiniflow/infinity:v0.6.0-dev3 to solve issue #5882 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-26 21:15:26 +08:00
Yongteng Lei	df3890827d	Refa: change LLM chat output from full to delta (incremental) (#6534 ) ### What problem does this PR solve? Change LLM chat output from full to delta (incremental) ### Type of change - [x] Refactoring	2025-03-26 19:33:14 +08:00
liu an	6599db1e99	Test: Update test cases for PR #6405 #6504 #6538 (#6565 ) ### What problem does this PR solve? PR #6405 #6504 #6538 ### Type of change - [x] update test cases	2025-03-26 19:23:13 +08:00
writinwaters	b7d7ad536a	AI search vs. chat (#6569 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-26 18:46:34 +08:00
andy	24d8ff7425	Fix:flow DB Assistant module translate to zh (#6562 ) ### What problem does this PR solve? Fix:flow DB Assistant module translate to zh ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-26 17:32:05 +08:00
Chenzy	735d9dd949	Feat: add "tools" to llm_factories.json (#6552 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Chenzy <chenzy901@gmail.com>	2025-03-26 17:31:18 +08:00
zstar	cc5f4a5efa	Fix: python_api_reference.md update dataset bug (#6527 ) ### What problem does this PR solve? There is a small bug in the update dataset of this document. The return type of rag_oobject.list_datasets is a list type, and the first item should be taken as' ragflow_stdk.modules.dataset ' DataSet`， Adapt to the update. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-26 17:30:09 +08:00
liu an	93c26ae1ef	Test: Added test cases for Update Chunk HTTP API (#6556 ) ### What problem does this PR solve? cover [update chunk](https://ragflow.io/docs/v0.17.2/http_api_reference#update-chunk) endpoints ### Type of change - [x] add test cases	2025-03-26 16:47:47 +08:00
Kevin Hu	cc8029a732	Fix: uploading in chat box issue. (#6547 ) ### What problem does this PR solve? #6228 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-26 15:37:48 +08:00
Zhichang Yu	6bf26e2a81	Optimize graphrag again (#6513 ) ### What problem does this PR solve? Removed set_entity and set_relation to avoid accessing doc engine during graph computation. Introduced GraphChange to avoid writing unchanged chunks. ### Type of change - [x] Performance Improvement	2025-03-26 15:34:42 +08:00
Kevin Hu	7a677cb095	Fix: image_id is None. (#6538 ) ### What problem does this PR solve? #6499 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-26 12:04:21 +08:00
Kevin Hu	12ad746ee6	Fix: Bedrock model invocation error. (#6533 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-26 11:27:12 +08:00
Kevin Hu	163e71d06f	Fix: Hunyuan model adding error. (#6531 ) ### What problem does this PR solve? #6523 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-26 10:33:33 +08:00
Kevin Hu	c8c91fd827	Fix: link to KB from filemanager. (#6530 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-26 09:41:14 +08:00
writinwaters	d17970ebd0	0321 chunkmethods (#6520 ) ### What problem does this PR solve? #6061 ### Type of change - [x] Documentation Update	2025-03-26 09:03:18 +08:00
Kevin Hu	bf483fdf02	Fix: describe parameter error. (#6519 ) ### What problem does this PR solve? #6228 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-26 09:02:48 +08:00
Kevin Hu	b2b7ed8927	Fix: abnormal chunk id (#6506 ) ### What problem does this PR solve? #6500 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-25 19:03:29 +08:00
liu an	0a79dfd5cf	Test: Added test cases for List Chunks HTTP API (#6514 ) ### What problem does this PR solve? cover [list chunks](https://ragflow.io/docs/v0.17.2/http_api_reference#list-chunks) endpoints ### Type of change - [x] update test cases	2025-03-25 17:28:58 +08:00
Stephen Hu	1d73baf3d8	Feat: improve '/mv' '/list' API performance (#6502 ) ### What problem does this PR solve? 1. for /mv API use get by ids to avoid O(n) DB IO 2. for /list remove one useless call ### Type of change - [x] Performance Improvement	2025-03-25 16:30:25 +08:00

1 2 3 4 5 ...

2805 Commits