ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-01-24 20:16:38 +08:00

Author	SHA1	Message	Date
Julien Deveaux	6be197cbb6	Fix: Use tiktoken for proper token counting in OpenAI-compatible endpoint #7850 (#12760 ) ### What problem does this PR solve? The OpenAI-compatible chat endpoint (`/chats_openai/<chat_id>/chat/completions`) was not returning accurate token usage in streaming responses. The token counts were either missing or inaccurate because the underlying LLM API responses weren't being properly parsed for usage data. This PR adds proper token counting using tiktoken (cl100k_base encoding) as a fallback when the LLM API doesn't provide usage data in streaming chunks. This ensures clients always receive token usage information in the response, which is essential for billing and quota management. Changes: - Add tiktoken-based token counting for streaming responses in OpenAI-compatible endpoint - Ensure `usage` field is always populated in the final streaming chunk - Add unit tests for token usage calculation Fixes #7850 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 09:36:21 +08:00
balibabu	8dd4a41bf8	Feat: Add a web search button to the chat box on the chat page. (#12786 ) ### What problem does this PR solve? Feat: Add a web search button to the chat box on the chat page. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-23 09:33:50 +08:00
LIRUI YU	bc7b864a6c	top_k parameter ignored, always returned page_size results (#12753 ) ### What problem does this PR solve? Backend \rag\nlp\search.py Before the fix The top_k parameter was not applied to limit the total number of chunks, and the rerank model also uses the exact whole valid_idx rather than assigning valid_idx = valid_idx[:top] firstly. After the fix The top_k limit is applied to the total results before pagination, using a default value of top = 1024 if top_k is not modified. session.py Before the fix: When the frontend calls the retrieval API with `search_id`, the backend only reads `meta_data_filter` from the saved `search_config`. The `rerank_id`, `top_k`, `similarity_threshold`, and `vector_similarity_weight` parameters are only taken from the direct request body. Since the frontend doesn't pass these parameters explicitly (it only passes `search_id`), they always fall back to default values: - `similarity_threshold` = 0.0 - `vector_similarity_weight` = 0.3 - `top_k` = 1024 - `rerank_id` = "" (no rerank) This means user settings saved in the Search Settings page have no effect on actual search results. After the fix: When a `search_id` is provided, the backend now reads all relevant configuration from the saved `search_config`, including `rerank_id`, `top_k`, `similarity_threshold`, and `vector_similarity_weight`. Request parameters can still override these values if explicitly provided, allowing flexibility. The rerank model is now properly instantiated using the configured `rerank_id`, making the rerank feature actually work. Frontend \web\src\pages\next-search\search-setting.tsx Before the fix search-setting.tsx file, the top_k input box is only displayed when rerank is enabled (wrapped in the rerankModelDisabled condition). If the rerank switch is turned off, the top_k input field will be hidden, but the form value will remain unchanged. In other words: - When rerank is enabled, users can modify top_k (default 1024). - When rerank is disabled, top_k retains the previous value, but it's not visible on the interface. Therefore, the backend will always receive the top_k parameter; it's just that the frontend UI binds this configuration item to the rerank switch. When rerank is turned off, top_k will not automatically reset to 1024, but will retain its original value. After the fix On the contrary, if we switch off the button rerank model, the value top-k will be reset to 1024. By the way, If we use top-k in an individual method, rather than put it into the method retrieval, we can control it separately Now all methods valid Using rerank <img width="2378" height="1565" alt="Screenshot 2026-01-21 190206" src="https://github.com/user-attachments/assets/fa2b0df0-1334-4ca3-b169-da6c5fd59935" /> Not using rerank <img width="2596" height="1559" alt="Screenshot 2026-01-21 190229" src="https://github.com/user-attachments/assets/c5a80522-a0e1-40e7-b349-42fe86df3138" /> Before fixing they are the same ### Type of change - Bug Fix (non-breaking change which fixes an issue)	2026-01-22 15:33:42 +08:00
天海蒼灆	6f3f69b62e	Feat: API adds audio to text and text to speech functions (#12764 ) ### What problem does this PR solve? API adds audio to text and text to speech functions ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-22 11:20:26 +08:00
Kevin Hu	9a10558f80	Refa: async retrieval process. (#12629 ) ### Type of change - [x] Refactoring - [x] Performance Improvement	2026-01-15 12:28:49 +08:00
Stephen Hu	f522391d1e	Fix: "AttributeError(\"'list' object has no attribute 'get'\")" (#12518 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/12515 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-09 10:19:51 +08:00
buua436	1996aa0dac	Refactor: Enhance delta streaming in chat functions for improved reasoning and content handling (#12453 ) ### What problem does this PR solve? change: Enhance delta streaming in chat functions for improved reasoning and content handling ### Type of change - [x] Refactoring	2026-01-08 13:34:16 +08:00
Kevin Hu	461c81e14a	Fix: KG search issue. (#12364 ) ### What problem does this PR solve? Close #12347 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-31 14:40:27 +08:00
Daniel Derefaka	7dac269429	fix: correct session reference initialization to prevent dialogue misalignment (#12343 ) ## Summary Fixes #12311 Changes the `reference` field initialization from `[{}]` to `[]` in session creation. ### Problem When creating a session via the SDK API, the `reference` field was incorrectly initialized as `[{}]`. This caused: - First dialogue round: Empty reference - Second dialogue round: Reference pointing to first round's data - Overall misalignment between dialogue rounds and their references ### Solution Changed the initialization to `[]` (empty list), which: - Matches the `Conversation` model's expected default - Ensures references grow correctly one-to-one with assistant responses - Aligns with the service layer's expectations ### Testing After applying this fix: 1. Create a session via `POST /api/v1/chats/{conversation_id}/sessions` 2. Send multiple questions via `POST /api/v1/chats/{conversation_id}/completions` 3. View the conversation on web - references should now align correctly with each dialogue round	2025-12-31 10:25:49 +08:00
Lynn	6e9691a419	Feat: message manage (#12196 ) ### What problem does this PR solve? Manage message and use in agent. Issue #4213 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-25 21:18:13 +08:00
Jin Hai	993bf7c2c8	Fix IDE warnings (#12085 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-12-22 16:47:21 +08:00
Yongteng Lei	6cd1824a77	Feat: chats completions API supports metadata filtering (#12023 ) ### What problem does this PR solve? Chats completions API supports metadata filtering. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-19 11:36:35 +08:00
Yongteng Lei	151480dc85	Feat: trace information can be returned by the agent completion API (#12019 ) ### What problem does this PR solve? Trace information can be returned by the agent completion API. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-18 15:52:11 +08:00
Jin Hai	30019dab9f	Change knowledge base to dataset (#11976 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-12-17 10:03:33 +08:00
Yongteng Lei	0f0fb53256	Refa: refactor metadata filter (#11907 ) ### What problem does this PR solve? Refactor metadata filter. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-12 17:12:38 +08:00
Kevin Hu	ea4a5cd665	Fix: tokenizer issue. (#11902 ) #11786 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-11 17:38:17 +08:00
TeslaZY	bd0eff2954	Add DeepseekV3.2 of Tongyi-Qianwen and remove unused code (#11898 ) ### What problem does this PR solve? Add DeepseekV3.2 of Tongyi-Qianwen and remove unused code ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-11 13:55:01 +08:00
buua436	e3cfe8e848	Fix:async issue and sensitive logging (#11895 ) ### What problem does this PR solve? change： async issue and sensitive logging ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-11 13:54:47 +08:00
TeslaZY	c610bb605a	Added semi-automatic mode to the metadata filter (#11886 ) ### What problem does this PR solve? Retrieval metadata filtering adds semi-automatic mode, and users can manually check the metadata key that participates in LLM to generate filter conditions. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-11 10:45:21 +08:00
Yongteng Lei	51ec708c58	Refa: cleanup synchronous functions in chat_model and implement synchronization for conversation and dialog chats (#11779 ) ### What problem does this PR solve? Cleanup synchronous functions in chat_model and implement synchronization for conversation and dialog chats. ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-12-08 09:43:03 +08:00
Yongteng Lei	e3f40db963	Refa: make RAGFlow more asynchronous 2 (#11689 ) ### What problem does this PR solve? Make RAGFlow more asynchronous 2. #11551, #11579, #11619. ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-12-03 14:19:53 +08:00
Kevin Hu	a6681d6366	Revert "Refa: make RAGFlow more asynchronous 2" (#11669 ) Reverts infiniflow/ragflow#11664	2025-12-02 19:42:05 +08:00
Yongteng Lei	627c11c429	Refa: make RAGFlow more asynchronous 2 (#11664 ) ### What problem does this PR solve? Make RAGFlow more asynchronous 2. #11551, #11579, #11619. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring - [x] Performance Improvement	2025-12-02 18:57:07 +08:00
Yongteng Lei	b6c4722687	Refa: make RAGFlow more asynchronous (#11601 ) ### What problem does this PR solve? Try to make this more asynchronous. Verified in chat and agent scenarios, reducing blocking behavior. #11551, #11579. However, the impact of these changes still requires further investigation to ensure everything works as expected. ### Type of change - [x] Refactoring	2025-12-01 14:24:06 +08:00
Kevin Hu	820934fc77	Fix: no result if metadata returns none. (#11412 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-20 19:51:25 +08:00
Kevin Hu	06cef71ba6	Feat: add or logic operations for meta data filters. (#11404 ) ### What problem does this PR solve? #11376 #11387 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-20 14:31:12 +08:00
Kevin Hu	d1716d865a	Feat: Alter flask to Quart for async API serving. (#11275 ) ### What problem does this PR solve? #11277 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-18 17:05:16 +08:00
Jin Hai	f98b24c9bf	Move api.settings to common.settings (#11036 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-06 09:36:38 +08:00
Jin Hai	1a9215bc6f	Move some vars to globals (#11017 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-05 14:14:38 +08:00
Jin Hai	bab3fce136	Move some constants to common (#11004 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-05 08:01:39 +08:00
Jin Hai	880a6a0428	Move some enumerate type to constants.py (#10998 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-04 19:25:25 +08:00
Jin Hai	03038c7d3d	Update RetCode to common.constants (#10984 ) ### What problem does this PR solve? 1. Update RetCode to common.constants 2. Decouple the admin and API modules ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-04 15:12:53 +08:00
Jin Hai	16d2be623c	Minor tweaks (#10987 ) ### What problem does this PR solve? 1. Rename identifier name 2. Fix some return statement 3. Fix some typos ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-04 14:15:31 +08:00
Jin Hai	f52e56c2d6	Remove 'get_lan_ip' and add common misc_utils.py (#10880 ) ### What problem does this PR solve? Add get_uuid, download_img and hash_str2int into misc_utils.py ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-10-31 16:42:01 +08:00
Jin Hai	d931c33ced	Fix typos: retrievaler -> retriever (#10372 ) ### What problem does this PR solve? Fix typos ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-10-10 09:17:36 +08:00
Billy Bao	10cbbb76f8	revert gpt5 integration (#10228 ) ### What problem does this PR solve? Revert back to chat.completions. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [x] Other (please describe): Revert back to chat.completions.	2025-09-23 16:06:12 +08:00
Jin Hai	4eb7659499	Fix bug: broken import from rag.prompts.prompts (#10217 ) ### What problem does this PR solve? Fix broken imports ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: jinhai <haijin.chn@gmail.com>	2025-09-23 10:19:25 +08:00
Billy Bao	da82566304	Fix: resolve hash collisions by switching to UUID &correct logic for always-true statements & Update GPT api integration & Support qianwen-deepresearch (#10208 ) ### What problem does this PR solve? Fix: resolve hash collisions by switching to UUID &correct logic for always-true statements, solved: #10165 Feat: Update GPT api integration, solved: #10204 Feat: Support qianwen-deepresearch, solved: #10163 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-09-23 09:34:30 +08:00
Russell Valentine	f3738b06f1	Fixes session_id passing in agent_openai completion. (#10124 ) ### What problem does this PR solve? An exception happens if you give session_id to agent_open_ai completion. Because session_id is being given as well as **req so it tries to send session_id twice. But also the logic seemed odd on picking one of session_id, id, metadata.id. So cleaned it up a little. See #10111 ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue)	2025-09-17 17:54:06 +08:00
Kevin Hu	906969fe4e	Fix: exesql issue. (#9995 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-09-09 19:45:10 +08:00
Yongteng Lei	0a954d720a	Refa: unify reference format of agent completion and OpenAI-compatible completion API (#9792 ) ### What problem does this PR solve? Unify reference format of agent completion and OpenAI-compatible completion API. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Documentation Update - [x] Refactoring	2025-08-28 16:55:28 +08:00
Yongteng Lei	6cb3e08381	Revert: broken agent completion by #9631 (#9760 ) ### What problem does this PR solve? Revert broken agent completion by #9631. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-27 17:16:55 +08:00
Kevin Hu	a914535344	Fix: add `mode` for embeded agent. (#9741 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-27 11:46:15 +08:00
Mckennasora	9dd22e141b	fix: validate chunk type before processing to prevent AttributeError (#9698 ) ### What problem does this PR solve? This PR fixes a critical bug in the session listing endpoint where the application crashes with an `AttributeError` when processing chunk data that contains non-dictionary objects. Error before fix: ```json { "code": 100, "data": null, "message": "AttributeError(\"'str' object has no attribute 'get'\")" } ``` Root cause: The code assumes all items in the `chunks` array are dictionary objects and directly calls the `.get()` method on them. However, in some cases, the chunks array contains string objects or other non-dictionary types, causing the application to crash when attempting to call `.get()` on a string. Solution: Added type validation to ensure each chunk is a dictionary before processing. Non-dictionary chunks are safely skipped, preventing the crash while maintaining functionality for valid chunk data. This fix improves the robustness of the session listing endpoint and ensures users can retrieve their conversation sessions without encountering server errors due to data format inconsistencies. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-25 17:57:01 +08:00
天海蒼灆	7b8752fe24	fix: Create conversation sessions will lost prologue (#9666 ) ### What problem does this PR solve? When create conversation,the prologue hasn't save in conversation. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-25 14:09:28 +08:00
Yongteng Lei	f8847e7bcd	Fix: embedded search AI summary (#9658 ) ### What problem does this PR solve? Fix search app AI summary ERROR: 'dict' object has no attribute 'split'. #9649 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-08-22 12:55:29 +08:00
天海蒼灆	3baebd709b	Refactoring: Agent completions API change response structure (#9631 ) ### What problem does this PR solve? Resolve #9549 and #9436 , In v0.20.x,Agent completions API changed a lot,such as without reference and so on ### Type of change - [x] Refactoring	2025-08-22 12:04:15 +08:00
Kevin Hu	f123587538	Feat: add meta filter to search app. (#9554 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-08-19 17:25:44 +08:00
Yongteng Lei	188c0f614b	Refa: refine search app (#9536 ) ### What problem does this PR solve? Refine search app. ### Type of change - [x] Refactoring	2025-08-19 09:33:33 +08:00
Yongteng Lei	57c8a37285	Feat: add dialog chatbots info (#9530 ) ### What problem does this PR solve? Add dialog chatbots info. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-08-18 19:01:45 +08:00

1 2 3

130 Commits