ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2025-12-25 16:26:51 +08:00

Author	SHA1	Message	Date
Yongteng Lei	37075eab98	Feat: add voyage-multimodal-3 (#7987 ) ### What problem does this PR solve? Add voyage-multimodal-3. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-06-03 11:56:59 +08:00
Yongteng Lei	6c9b8ec860	Refa: update gemini2.5 (#7822 ) ### What problem does this PR solve? Update gemini2.5 ### Type of change - [x] Refactoring	2025-05-23 20:29:10 +08:00
Yongteng Lei	50ff16e7a4	Feat: add claude4 models (#7809 ) ### What problem does this PR solve? Add claude4 models. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:25:13 +08:00
liu an	e166f132b3	Feat: change default models (#7777 ) ### What problem does this PR solve? change default models to buildin models https://github.com/infiniflow/ragflow/issues/7774 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:21:25 +08:00
Debug Doctor	36e32dde1a	Feat: update llm factories for SILICONFLOW (#7620 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Other (please describe): llm factories update	2025-05-14 19:46:27 +08:00
Andrea	e39ceb2bd1	Feat: add support for OpenAi gpt 4.1 series (#7540 ) ### What problem does this PR solve? Adds support for the GPT-4.1 series from OpenAI. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-12 18:24:53 +08:00
QuintinTao	e9053b6ed4	fix bug #7309 deepseek-ai/deepseek-vl2 model can not be select as a VL model to parse pdf image (#7312 ) ### What problem does this PR solve? fix deepseek-ai/deepseek-vl2 model can not be select as a VL model to parse pdf image . And add other vl models config from siliconflow _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): --------- Co-authored-by: unknown <taoshi.ln@chinatelecom.cn>	2025-05-08 11:24:39 +08:00
Yongteng Lei	093d280528	Feat: add Qwen3 and OpenAI o series (#7415 ) ### What problem does this PR solve? Qwen3 and more LLMs. Close #7296 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-04-29 18:26:29 +08:00
Neal Davis	23dcbc94ef	feat: replace models of novita (#7360 ) ### What problem does this PR solve? Replace models of novita ### Type of change - [x] Other (please describe): Replace models of novita	2025-04-28 13:35:09 +08:00
Jason Li	67b087019c	Update Groq AI Model Config (#7335 ) With current config will get error "Fail to access model(gemma-7b-it) using this api key" Since the model has been removed, according to Groq official document: https://console.groq.com/docs/models ### Type of change - [ x] Bug Fix (non-breaking change which fixes an issue)	2025-04-27 17:05:25 +08:00
Yongteng Lei	018ff4dd0a	Refa: update llms (#7007 ) ### What problem does this PR solve? Update LLM models ### Type of change - [x] Refactoring	2025-04-15 09:19:07 +08:00
Kevin Hu	5b5558300a	Feat: add gemini-2.5-pro-exp-03-25 (#6774 ) ### What problem does this PR solve? #6733 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-04-03 10:48:58 +08:00
Kevin Hu	fc21dd0a4a	Feat: add qwq-plus-latest (#6702 ) ### What problem does this PR solve? #6697 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-04-01 11:06:03 +08:00
Kevin Hu	7d9dd1e5d3	Refa: remove default build-in rerank model. (#6682 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-03-31 15:33:19 +08:00
Kevin Hu	d2043ff9f2	Fix: LmStudioChat issue. (#6591 ) ### What problem does this PR solve? #6577 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-27 14:59:15 +08:00
Chenzy	735d9dd949	Feat: add "tools" to llm_factories.json (#6552 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Chenzy <chenzy901@gmail.com>	2025-03-26 17:31:18 +08:00
Kevin Hu	85eb3775d6	Refa: update Anthropic models. (#6445 ) ### What problem does this PR solve? #6421 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-24 12:34:57 +08:00
crypticGøøse	f16418ccf7	Feat: Add deepseek to llm_factories (#6051 ) ### What problem does this PR solve? AWS Bedrock has made deepseek-r1 available on its serverless inference. This adds the R1 serverless model for use via the bedrock model abilities. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-14 10:35:44 +08:00
kuro5989	6e13922bdc	Feat: Add qwq model support to Tongyi-Qianwen factory (#5981 ) ### What problem does this PR solve? add qwq model support to Tongyi-Qianwen factory https://github.com/infiniflow/ragflow/issues/5869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) ![image](https://github.com/user-attachments/assets/49f5c6a0-ecaf-41dd-a23a-2009f854d62c) ![image](https://github.com/user-attachments/assets/93ffa303-920e-4942-8188-bcd6b7209204) ![1741774779438](https://github.com/user-attachments/assets/25f2fd1d-8640-4df0-9a08-78ee9daaa8fe) ![image](https://github.com/user-attachments/assets/4763cf6c-1f76-43c4-80ee-74dfd666a184) Co-authored-by: zhaozhicheng <zhicheng.zhao@fastonetech.com>	2025-03-12 18:54:15 +08:00
Kevin Hu	fa817a8ab3	Refa: SiliconFlow model list refresh. (#5825 ) ### What problem does this PR solve? #5806 ### Type of change - [x] Refactoring	2025-03-10 12:51:12 +08:00
Kevin Hu	e05658685c	Refa: update mistral model list. (#5818 ) ### What problem does this PR solve? #5782 ### Type of change - [x] Refactoring	2025-03-10 11:22:06 +08:00
Kevin Hu	b8da2eeb69	Feat: support huggingface re-rank model. (#5684 ) ### What problem does this PR solve? #5658 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-06 10:44:04 +08:00
Debug Doctor	202acbd628	Perf: update novita.ai LLM library (#5574 ) ### What problem does this PR solve? LLM library update ### Type of change - [x] Other : config update	2025-03-04 11:35:25 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
petertc	4694604836	Specify img2text model by tag (#5063 ) ### What problem does this PR solve? The current design is not well-suited for multimodal models, as each model can only be configured for a single purpose—either chat or Img2txt. To work around this limitation, we use model aliases such as gpt-4o-mini and gpt-4o-mini-2024-07-18. To fix this, this PR allows specifying the Img2txt model by tag instead of model_type. ### Type of change - [x] Refactoring	2025-02-18 11:14:48 +08:00
saikidev	d2929e432e	Feat: add LLM provider PPIO (#5013 ) ### What problem does this PR solve? Add a LLM provider: PPIO ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2025-02-17 12:03:26 +08:00
so95	754d5ea364	add gemini-2.0-flash-thinking-exp-01-21 (#4957 ) add gemini-2.0-flash-thinking-exp-01-21	2025-02-14 13:31:07 +08:00
DiamondPoirier	a03f5dd9f6	Add a list of large language models of deepseek and image2text models… (#4914 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:52:29 +08:00
DiamondPoirier	415c4b7ed5	Organized and add a list of large language models of Nvidia.v1.1 (#4910 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:10:19 +08:00
Kevin Hu	55823dbdf6	Refresh Gemini model list. (#4780 ) ### What problem does this PR solve? #4761 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-08 10:19:51 +08:00
Kevin Hu	4150805073	More models for siliconflow. (#4756 ) ### What problem does this PR solve? #4751 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-07 10:32:52 +08:00
Kevin Hu	4b9c4c0705	Update deepseek model provider info. (#4714 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-05 13:43:40 +08:00
Kevin Hu	656a2fab41	Refresh deepseek models. (#4660 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-27 11:01:39 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	9c6cf12137	Refactor model list. (#4346 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-01-03 19:55:42 +08:00
petertc	accd3a6c7e	Support OpenAI gpt-4o and gpt-4o-mini for img2text (#4300 ) ### What problem does this PR solve? OpenAI has deprecated the gpt-4-vision-preview model. This PR adds support for the newer gpt-4o and gpt-4o-mini models in the img2text feature. ![image](https://github.com/user-attachments/assets/6dddf2dc-1b9e-4e94-bf07-6bf77d39122b) This PR add addtional 4o/4o-mini entry for img2text besides original ones. Utilized [alias](https://platform.openai.com/docs/models#gpt-4o) model names (e.g., gpt-4o-2024-08-06) because the database schema uses the model name as the primary key. - [x] Other (please describe): model update	2024-12-31 14:36:06 +08:00
Kevin Hu	2cbe064080	Add Llama3.3 (#4174 ) ### What problem does this PR solve? #4168 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-23 11:18:01 +08:00
so95	478da3118c	add gemini 2.0 (#4115 ) add gemini 2.0	2024-12-19 17:30:45 +08:00
Kevin Hu	934dbc2e2b	Add more mistral models. (#3826 ) ### What problem does this PR solve? #3647 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-03 15:18:38 +08:00
devMls	59a5813f1b	add jina new models in jina connector (#3770 ) ### What problem does this PR solve? add new models in jinna connector, to allow use models that support multilingual models ### Type of change - [X] Other (please describe): new connectors no breaking change	2024-12-02 10:06:39 +08:00
Kevin Hu	81c7b6afc5	Make spark model robuster to model name (#3514 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-20 20:53:44 +08:00
Jarry	0657a09e2c	Update llm_factories.json (#3396 ) ### What problem does this PR solve? Added: Claude-3-5-sonnet-20241022 version. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-14 13:00:16 +08:00
shijiefengjun	632b23486f	Fix the value issue of anthropic (#3351 ) ### What problem does this PR solve? This pull request fixes the issue mentioned in https://github.com/infiniflow/ragflow/issues/3263. 1. response should be parsed as dict, prevent the following code from failing to take values: ans = response["content"][0]["text"] 2. API Model ```claude-instant-1.2``` has retired (by [model-deprecations](https://docs.anthropic.com/en/docs/resources/model-deprecations)), it will trigger errors in the code, so I deleted it from the conf/llm_factories.json file and updated the latest API Model ```claude-3-5-sonnet-20241022``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: chenhaodong <chenhaodong@ctrlvideo.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 16:13:52 +08:00
Kevin Hu	70ea6661ed	add new models for zhipu-ai (#3348 ) ### What problem does this PR solve? #3345 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-12 11:27:43 +08:00
Kevin Hu	4097912d59	add inputs to display to every components (#3242 ) ### What problem does this PR solve? #3240 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-06 18:47:53 +08:00
Yangong	7e89be5ed1	feat: add qwen 2.5 models for silicon flow (#3203 ) ### What problem does this PR solve? add qwen 2.5 models for silicon flow ### Type of change - [X] New Feature (non-breaking change which adds functionality)	2024-11-05 13:58:29 +08:00
Kevin Hu	6c6b658ffe	add yi-lightning (#3119 ) ### What problem does this PR solve? #3111 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-31 10:30:23 +08:00
Kevin Hu	8257eeb3f2	add model moonshot-v1-auto (#3051 ) ### What problem does this PR solve? #3048 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-28 10:37:22 +08:00
chongchuanbing	ac26d09a59	Feature/feat1017 (#2872 ) ### What problem does this PR solve? 1. fix: mid map show error in knowledge graph, juse because ```@antv/g6```version changed 2. feat: concurrent threads configuration support in graph extractor 3. fix: used tokens update failed for tenant 4. feat: timeout configuration support for llm 5. fix: regex error in graph extractor 6. feat: qwen rerank(```gte-rerank```) support 7. fix: timeout deal in knowledge graph index process. Now chat by stream output, also, it is configuratable. 8. feat: ```qwen-long``` model configuration ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: chongchuanbing <chongchuanbing@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-21 12:11:08 +08:00

1 2

93 Commits