ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2025-12-25 16:26:51 +08:00

Author	SHA1	Message	Date
Yongteng Lei	56cd576876	Refa: revise the implementation of LightRAG and enable response caching (#9828 ) ### What problem does this PR solve? This revision performed a comprehensive check on LightRAG to ensure the correctness of its implementation. It did not involve Entity Resolution and Community Reports Generation. There is an example using default entity types and the General chunking method, which shows good results in both time and effectiveness. Moreover, response caching is enabled for resuming failed tasks. [The-Necklace.pdf](https://github.com/user-attachments/files/22042432/The-Necklace.pdf) After: ![img_v3_02pk_177dbc6a-e7cc-4732-b202-ad4682d171fg](https://github.com/user-attachments/assets/5ef1d93a-9109-4fe9-8a7b-a65add16f82b) ```bash Begin at: Fri, 29 Aug 2025 16:48:03 GMT Duration: 222.31 s Progress: 16:48:04 Task has been received. 16:48:06 Page(1~7): Start to parse. 16:48:06 Page(1~7): OCR started 16:48:08 Page(1~7): OCR finished (1.89s) 16:48:11 Page(1~7): Layout analysis (3.72s) 16:48:11 Page(1~7): Table analysis (0.00s) 16:48:11 Page(1~7): Text merged (0.00s) 16:48:11 Page(1~7): Finish parsing. 16:48:12 Page(1~7): Generate 7 chunks 16:48:12 Page(1~7): Embedding chunks (0.29s) 16:48:12 Page(1~7): Indexing done (0.04s). Task done (7.84s) 16:48:17 Start processing for f421fb06849e11f0bdd32724b93a52b2: She had no dresses, no je... 16:48:17 Start processing for f421fb06849e11f0bdd32724b93a52b2: Her husband, already half... 16:48:17 Start processing for f421fb06849e11f0bdd32724b93a52b2: And this life lasted ten ... 16:48:17 Start processing for f421fb06849e11f0bdd32724b93a52b2: Then she asked, hesitatin... 16:49:30 Completed processing for f421fb06849e11f0bdd32724b93a52b2: She had no dresses, no je... after 1 gleanings, 21985 tokens. 16:49:30 Entities extraction of chunk 3 1/7 done, 12 nodes, 13 edges, 21985 tokens. 16:49:40 Completed processing for f421fb06849e11f0bdd32724b93a52b2: Finally, she replied, hes... after 1 gleanings, 22584 tokens. 16:49:40 Entities extraction of chunk 5 2/7 done, 19 nodes, 19 edges, 22584 tokens. 16:50:02 Completed processing for f421fb06849e11f0bdd32724b93a52b2: Then she asked, hesitatin... after 1 gleanings, 24610 tokens. 16:50:02 Entities extraction of chunk 0 3/7 done, 16 nodes, 28 edges, 24610 tokens. 16:50:03 Completed processing for f421fb06849e11f0bdd32724b93a52b2: And this life lasted ten ... after 1 gleanings, 24031 tokens. 16:50:04 Entities extraction of chunk 1 4/7 done, 24 nodes, 22 edges, 24031 tokens. 16:50:14 Completed processing for f421fb06849e11f0bdd32724b93a52b2: So they begged the jewell... after 1 gleanings, 24635 tokens. 16:50:14 Entities extraction of chunk 6 5/7 done, 27 nodes, 26 edges, 24635 tokens. 16:50:29 Completed processing for f421fb06849e11f0bdd32724b93a52b2: Her husband, already half... after 1 gleanings, 25758 tokens. 16:50:29 Entities extraction of chunk 2 6/7 done, 25 nodes, 35 edges, 25758 tokens. 16:51:35 Completed processing for f421fb06849e11f0bdd32724b93a52b2: The Necklace By Guy de Ma... after 1 gleanings, 27491 tokens. 16:51:35 Entities extraction of chunk 4 7/7 done, 39 nodes, 37 edges, 27491 tokens. 16:51:35 Entities and relationships extraction done, 147 nodes, 177 edges, 171094 tokens, 198.58s. 16:51:35 Entities merging done, 0.01s. 16:51:35 Relationships merging done, 0.01s. 16:51:35 ignored 7 relations due to missing entities. 16:51:35 generated subgraph for doc f421fb06849e11f0bdd32724b93a52b2 in 198.68 seconds. 16:51:35 run_graphrag f421fb06849e11f0bdd32724b93a52b2 graphrag_task_lock acquired 16:51:35 set_graph removed 0 nodes and 0 edges from index in 0.00s. 16:51:35 Get embedding of nodes: 9/147 16:51:35 Get embedding of nodes: 109/147 16:51:37 Get embedding of edges: 9/170 16:51:37 Get embedding of edges: 109/170 16:51:40 set_graph converted graph change to 319 chunks in 4.21s. 16:51:40 Insert chunks: 4/319 16:51:40 Insert chunks: 104/319 16:51:40 Insert chunks: 204/319 16:51:40 Insert chunks: 304/319 16:51:40 set_graph added/updated 147 nodes and 170 edges from index in 0.53s. 16:51:40 merging subgraph for doc f421fb06849e11f0bdd32724b93a52b2 into the global graph done in 4.79 seconds. 16:51:40 Knowledge Graph done (204.29s) ``` Before: ![img_v3_02pk_63370edf-ecee-4ee8-8ac8-69c8d2c712fg](https://github.com/user-attachments/assets/1162eb0f-68c2-4de5-abe0-cdfa168f71de) ```bash Begin at: Fri, 29 Aug 2025 17:00:47 GMT processDuration: 173.38 s Progress: 17:00:49 Task has been received. 17:00:51 Page(1~7): Start to parse. 17:00:51 Page(1~7): OCR started 17:00:53 Page(1~7): OCR finished (1.82s) 17:00:57 Page(1~7): Layout analysis (3.64s) 17:00:57 Page(1~7): Table analysis (0.00s) 17:00:57 Page(1~7): Text merged (0.00s) 17:00:57 Page(1~7): Finish parsing. 17:00:57 Page(1~7): Generate 7 chunks 17:00:57 Page(1~7): Embedding chunks (0.31s) 17:00:57 Page(1~7): Indexing done (0.03s). Task done (7.88s) 17:00:57 created task graphrag 17:01:00 Task has been received. 17:02:17 Entities extraction of chunk 1 1/7 done, 9 nodes, 9 edges, 10654 tokens. 17:02:31 Entities extraction of chunk 2 2/7 done, 12 nodes, 13 edges, 11066 tokens. 17:02:33 Entities extraction of chunk 4 3/7 done, 9 nodes, 10 edges, 10433 tokens. 17:02:42 Entities extraction of chunk 5 4/7 done, 11 nodes, 14 edges, 11290 tokens. 17:02:52 Entities extraction of chunk 6 5/7 done, 13 nodes, 15 edges, 11039 tokens. 17:02:55 Entities extraction of chunk 3 6/7 done, 14 nodes, 13 edges, 11466 tokens. 17:03:32 Entities extraction of chunk 0 7/7 done, 19 nodes, 18 edges, 13107 tokens. 17:03:32 Entities and relationships extraction done, 71 nodes, 89 edges, 79055 tokens, 149.66s. 17:03:32 Entities merging done, 0.01s. 17:03:32 Relationships merging done, 0.01s. 17:03:32 ignored 1 relations due to missing entities. 17:03:32 generated subgraph for doc b1d9d3b6848711f0aacd7ddc0714c4d3 in 149.69 seconds. 17:03:32 run_graphrag b1d9d3b6848711f0aacd7ddc0714c4d3 graphrag_task_lock acquired 17:03:32 set_graph removed 0 nodes and 0 edges from index in 0.00s. 17:03:32 Get embedding of nodes: 9/71 17:03:33 Get embedding of edges: 9/88 17:03:34 set_graph converted graph change to 161 chunks in 2.27s. 17:03:34 Insert chunks: 4/161 17:03:34 Insert chunks: 104/161 17:03:34 set_graph added/updated 71 nodes and 88 edges from index in 0.28s. 17:03:34 merging subgraph for doc b1d9d3b6848711f0aacd7ddc0714c4d3 into the global graph done in 2.60 seconds. 17:03:34 Knowledge Graph done (153.18s) ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring - [x] Performance Improvement	2025-08-29 17:58:36 +08:00
Yongteng Lei	209ef09dc3	Feat: add Zhipu GLM-4.5 model series (#9715 ) ### What problem does this PR solve? Add Zhipu GLM-4.5 model series. #9708. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-08-26 13:48:00 +08:00
ycz	370c8bc25b	Update llm_factories.json (#9714 ) ### What problem does this PR solve? add ZhipuAI GLM-4.5 model series ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-08-26 11:49:01 +08:00
Yongteng Lei	787e0c6786	Refa: OpenAI whisper-1 (#9552 ) ### What problem does this PR solve? Refactor OpenAI to enable audio parsing. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2025-08-19 16:41:18 +08:00
Yongteng Lei	fe32952825	Fix: Gemini parameters error (#9520 ) ### What problem does this PR solve? Fix Gemini parameters error. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-08-18 14:51:10 +08:00
Yongteng Lei	79481becea	Feat: supports GPT-5 (#9320 ) ### What problem does this PR solve? Supports GPT-5. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-08-08 11:54:40 +08:00
Yongteng Lei	46a35f44da	Feat: add Claude Opus 4.1 (#9268 ) ### What problem does this PR solve? Add Claude Opus 4. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring	2025-08-06 10:57:03 +08:00
TeslaZY	b26088ab70	Add a series of qwen3 latest SOTA models (#9140 ) ### What problem does this PR solve? Add a series of qwen3 latest SOTA models: qwen3-coder-480b-a35b-instruct, qwen3-30b-a3b-instruct-2507, qwen3-30b-a3b-thinking-2507, qwen3-235b-a22b-instruct-2507, qwen3-235b-a22b-thinking-2507 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-08-01 15:19:51 +08:00
Yongteng Lei	4b98119c52	Fix: kimi-latest is not authorized (#9151 ) ### What problem does this PR solve? Fix kimi-latest is not authorized. Add kimi-thinking-preview. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-08-01 12:40:58 +08:00
JI4JUN	aeaeb169e4	Feat/support 302ai provider (#8742 ) ### What problem does this PR solve? Support 302.AI provider. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-31 14:48:30 +08:00
TeslaZY	46ded9d329	add Kimi-K2-Instruct from Tongyi-Qianwen API (#9125 ) ### What problem does this PR solve? add Kimi-K2-Instruct from Tongyi-Qianwen API ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-31 14:42:32 +08:00
Yongteng Lei	7ebc1f0943	Feat: add model provider DeepInfra (#9003 ) ### What problem does this PR solve? Add model provider DeepInfra. This model list comes from our community. NOTE: most endpoints haven't been tested, but they should work as OpenAI does. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-23 18:10:35 +08:00
謝富祥	0d7244e4a4	Fix: Adds newest Gemini models to fit google's standard API rate limits (#8970 ) ### What problem does this PR solve? Adds configurations for gemini-2.5-flash and Gemini 2.5-pro models, including tags, maximum token limits, and model types. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-23 10:18:04 +08:00
Yongteng Lei	ed7bea060f	Feat: add Kimi model series support (#8866 ) ### What problem does this PR solve? Add Kimi model series support. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-16 15:31:57 +08:00
Tuan Le	a9abf9df48	Adds new Voyage embedding models (#8845 ) ### What problem does this PR solve? This PR enhances the application's capabilities by adding support for four new Voyage embedding models (voyage-3-large, voyage-3.5, voyage-3.5-lite, and voyage-code-3) to the `llm_factories.json` configuration file. These models expand the available options for text embedding tasks, enabling improved processing of text data with a maximum token limit of 32,000. This addition addresses the need for more diverse and specialized embedding models to support various use cases without altering existing functionality. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-16 11:41:06 +08:00
Yongteng Lei	1895667573	Feat: add xAI provider (#8781 ) ### What problem does this PR solve? Add xAI provider (experimental feature, requires user feedback). ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-11 10:35:23 +08:00
Kevin Hu	fffb7c0bba	Fix: anthropic llm issue. (#8633 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-02 18:37:34 +08:00
Kevin Hu	aafeffa292	Feat: add gitee as LLM provider. (#8545 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-06-30 09:22:31 +08:00
Yongteng Lei	5e30426916	Feat: add Qwen3-Embedding text-embedding-v4 (#8184 ) ### What problem does this PR solve? Add Qwen3-Embedding text-embedding-v4. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-06-11 15:32:05 +08:00
Yongteng Lei	37075eab98	Feat: add voyage-multimodal-3 (#7987 ) ### What problem does this PR solve? Add voyage-multimodal-3. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-06-03 11:56:59 +08:00
Yongteng Lei	6c9b8ec860	Refa: update gemini2.5 (#7822 ) ### What problem does this PR solve? Update gemini2.5 ### Type of change - [x] Refactoring	2025-05-23 20:29:10 +08:00
Yongteng Lei	50ff16e7a4	Feat: add claude4 models (#7809 ) ### What problem does this PR solve? Add claude4 models. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:25:13 +08:00
liu an	e166f132b3	Feat: change default models (#7777 ) ### What problem does this PR solve? change default models to buildin models https://github.com/infiniflow/ragflow/issues/7774 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:21:25 +08:00
Debug Doctor	36e32dde1a	Feat: update llm factories for SILICONFLOW (#7620 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Other (please describe): llm factories update	2025-05-14 19:46:27 +08:00
Andrea	e39ceb2bd1	Feat: add support for OpenAi gpt 4.1 series (#7540 ) ### What problem does this PR solve? Adds support for the GPT-4.1 series from OpenAI. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-12 18:24:53 +08:00
QuintinTao	e9053b6ed4	fix bug #7309 deepseek-ai/deepseek-vl2 model can not be select as a VL model to parse pdf image (#7312 ) ### What problem does this PR solve? fix deepseek-ai/deepseek-vl2 model can not be select as a VL model to parse pdf image . And add other vl models config from siliconflow _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): --------- Co-authored-by: unknown <taoshi.ln@chinatelecom.cn>	2025-05-08 11:24:39 +08:00
Yongteng Lei	093d280528	Feat: add Qwen3 and OpenAI o series (#7415 ) ### What problem does this PR solve? Qwen3 and more LLMs. Close #7296 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-04-29 18:26:29 +08:00
Neal Davis	23dcbc94ef	feat: replace models of novita (#7360 ) ### What problem does this PR solve? Replace models of novita ### Type of change - [x] Other (please describe): Replace models of novita	2025-04-28 13:35:09 +08:00
Jason Li	67b087019c	Update Groq AI Model Config (#7335 ) With current config will get error "Fail to access model(gemma-7b-it) using this api key" Since the model has been removed, according to Groq official document: https://console.groq.com/docs/models ### Type of change - [ x] Bug Fix (non-breaking change which fixes an issue)	2025-04-27 17:05:25 +08:00
Yongteng Lei	018ff4dd0a	Refa: update llms (#7007 ) ### What problem does this PR solve? Update LLM models ### Type of change - [x] Refactoring	2025-04-15 09:19:07 +08:00
Kevin Hu	5b5558300a	Feat: add gemini-2.5-pro-exp-03-25 (#6774 ) ### What problem does this PR solve? #6733 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-04-03 10:48:58 +08:00
Kevin Hu	fc21dd0a4a	Feat: add qwq-plus-latest (#6702 ) ### What problem does this PR solve? #6697 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-04-01 11:06:03 +08:00
Kevin Hu	7d9dd1e5d3	Refa: remove default build-in rerank model. (#6682 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-03-31 15:33:19 +08:00
Kevin Hu	d2043ff9f2	Fix: LmStudioChat issue. (#6591 ) ### What problem does this PR solve? #6577 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-27 14:59:15 +08:00
Chenzy	735d9dd949	Feat: add "tools" to llm_factories.json (#6552 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Chenzy <chenzy901@gmail.com>	2025-03-26 17:31:18 +08:00
Kevin Hu	85eb3775d6	Refa: update Anthropic models. (#6445 ) ### What problem does this PR solve? #6421 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-24 12:34:57 +08:00
crypticGøøse	f16418ccf7	Feat: Add deepseek to llm_factories (#6051 ) ### What problem does this PR solve? AWS Bedrock has made deepseek-r1 available on its serverless inference. This adds the R1 serverless model for use via the bedrock model abilities. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-14 10:35:44 +08:00
kuro5989	6e13922bdc	Feat: Add qwq model support to Tongyi-Qianwen factory (#5981 ) ### What problem does this PR solve? add qwq model support to Tongyi-Qianwen factory https://github.com/infiniflow/ragflow/issues/5869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) ![image](https://github.com/user-attachments/assets/49f5c6a0-ecaf-41dd-a23a-2009f854d62c) ![image](https://github.com/user-attachments/assets/93ffa303-920e-4942-8188-bcd6b7209204) ![1741774779438](https://github.com/user-attachments/assets/25f2fd1d-8640-4df0-9a08-78ee9daaa8fe) ![image](https://github.com/user-attachments/assets/4763cf6c-1f76-43c4-80ee-74dfd666a184) Co-authored-by: zhaozhicheng <zhicheng.zhao@fastonetech.com>	2025-03-12 18:54:15 +08:00
Kevin Hu	fa817a8ab3	Refa: SiliconFlow model list refresh. (#5825 ) ### What problem does this PR solve? #5806 ### Type of change - [x] Refactoring	2025-03-10 12:51:12 +08:00
Kevin Hu	e05658685c	Refa: update mistral model list. (#5818 ) ### What problem does this PR solve? #5782 ### Type of change - [x] Refactoring	2025-03-10 11:22:06 +08:00
Kevin Hu	b8da2eeb69	Feat: support huggingface re-rank model. (#5684 ) ### What problem does this PR solve? #5658 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-06 10:44:04 +08:00
Debug Doctor	202acbd628	Perf: update novita.ai LLM library (#5574 ) ### What problem does this PR solve? LLM library update ### Type of change - [x] Other : config update	2025-03-04 11:35:25 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
petertc	4694604836	Specify img2text model by tag (#5063 ) ### What problem does this PR solve? The current design is not well-suited for multimodal models, as each model can only be configured for a single purpose—either chat or Img2txt. To work around this limitation, we use model aliases such as gpt-4o-mini and gpt-4o-mini-2024-07-18. To fix this, this PR allows specifying the Img2txt model by tag instead of model_type. ### Type of change - [x] Refactoring	2025-02-18 11:14:48 +08:00
saikidev	d2929e432e	Feat: add LLM provider PPIO (#5013 ) ### What problem does this PR solve? Add a LLM provider: PPIO ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2025-02-17 12:03:26 +08:00
so95	754d5ea364	add gemini-2.0-flash-thinking-exp-01-21 (#4957 ) add gemini-2.0-flash-thinking-exp-01-21	2025-02-14 13:31:07 +08:00
DiamondPoirier	a03f5dd9f6	Add a list of large language models of deepseek and image2text models… (#4914 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:52:29 +08:00
DiamondPoirier	415c4b7ed5	Organized and add a list of large language models of Nvidia.v1.1 (#4910 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:10:19 +08:00
Kevin Hu	55823dbdf6	Refresh Gemini model list. (#4780 ) ### What problem does this PR solve? #4761 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-08 10:19:51 +08:00

1 2 3

112 Commits