ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2025-12-08 20:42:30 +08:00

Author	SHA1	Message	Date
Kevin Hu	fa817a8ab3	Refa: SiliconFlow model list refresh. (#5825 ) ### What problem does this PR solve? #5806 ### Type of change - [x] Refactoring	2025-03-10 12:51:12 +08:00
Kevin Hu	e05658685c	Refa: update mistral model list. (#5818 ) ### What problem does this PR solve? #5782 ### Type of change - [x] Refactoring	2025-03-10 11:22:06 +08:00
Kevin Hu	b8da2eeb69	Feat: support huggingface re-rank model. (#5684 ) ### What problem does this PR solve? #5658 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-06 10:44:04 +08:00
Debug Doctor	202acbd628	Perf: update novita.ai LLM library (#5574 ) ### What problem does this PR solve? LLM library update ### Type of change - [x] Other : config update	2025-03-04 11:35:25 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
petertc	4694604836	Specify img2text model by tag (#5063 ) ### What problem does this PR solve? The current design is not well-suited for multimodal models, as each model can only be configured for a single purpose—either chat or Img2txt. To work around this limitation, we use model aliases such as gpt-4o-mini and gpt-4o-mini-2024-07-18. To fix this, this PR allows specifying the Img2txt model by tag instead of model_type. ### Type of change - [x] Refactoring	2025-02-18 11:14:48 +08:00
saikidev	d2929e432e	Feat: add LLM provider PPIO (#5013 ) ### What problem does this PR solve? Add a LLM provider: PPIO ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2025-02-17 12:03:26 +08:00
so95	754d5ea364	add gemini-2.0-flash-thinking-exp-01-21 (#4957 ) add gemini-2.0-flash-thinking-exp-01-21	2025-02-14 13:31:07 +08:00
DiamondPoirier	a03f5dd9f6	Add a list of large language models of deepseek and image2text models… (#4914 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:52:29 +08:00
DiamondPoirier	415c4b7ed5	Organized and add a list of large language models of Nvidia.v1.1 (#4910 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:10:19 +08:00
Kevin Hu	55823dbdf6	Refresh Gemini model list. (#4780 ) ### What problem does this PR solve? #4761 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-08 10:19:51 +08:00
Kevin Hu	4150805073	More models for siliconflow. (#4756 ) ### What problem does this PR solve? #4751 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-07 10:32:52 +08:00
Kevin Hu	4b9c4c0705	Update deepseek model provider info. (#4714 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-05 13:43:40 +08:00
Kevin Hu	656a2fab41	Refresh deepseek models. (#4660 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-27 11:01:39 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	9c6cf12137	Refactor model list. (#4346 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-01-03 19:55:42 +08:00
petertc	accd3a6c7e	Support OpenAI gpt-4o and gpt-4o-mini for img2text (#4300 ) ### What problem does this PR solve? OpenAI has deprecated the gpt-4-vision-preview model. This PR adds support for the newer gpt-4o and gpt-4o-mini models in the img2text feature. ![image](https://github.com/user-attachments/assets/6dddf2dc-1b9e-4e94-bf07-6bf77d39122b) This PR add addtional 4o/4o-mini entry for img2text besides original ones. Utilized [alias](https://platform.openai.com/docs/models#gpt-4o) model names (e.g., gpt-4o-2024-08-06) because the database schema uses the model name as the primary key. - [x] Other (please describe): model update	2024-12-31 14:36:06 +08:00
Kevin Hu	2cbe064080	Add Llama3.3 (#4174 ) ### What problem does this PR solve? #4168 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-23 11:18:01 +08:00
so95	478da3118c	add gemini 2.0 (#4115 ) add gemini 2.0	2024-12-19 17:30:45 +08:00
Kevin Hu	934dbc2e2b	Add more mistral models. (#3826 ) ### What problem does this PR solve? #3647 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-03 15:18:38 +08:00
devMls	59a5813f1b	add jina new models in jina connector (#3770 ) ### What problem does this PR solve? add new models in jinna connector, to allow use models that support multilingual models ### Type of change - [X] Other (please describe): new connectors no breaking change	2024-12-02 10:06:39 +08:00
Kevin Hu	81c7b6afc5	Make spark model robuster to model name (#3514 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-20 20:53:44 +08:00
Jarry	0657a09e2c	Update llm_factories.json (#3396 ) ### What problem does this PR solve? Added: Claude-3-5-sonnet-20241022 version. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-14 13:00:16 +08:00
shijiefengjun	632b23486f	Fix the value issue of anthropic (#3351 ) ### What problem does this PR solve? This pull request fixes the issue mentioned in https://github.com/infiniflow/ragflow/issues/3263. 1. response should be parsed as dict, prevent the following code from failing to take values: ans = response["content"][0]["text"] 2. API Model ```claude-instant-1.2``` has retired (by [model-deprecations](https://docs.anthropic.com/en/docs/resources/model-deprecations)), it will trigger errors in the code, so I deleted it from the conf/llm_factories.json file and updated the latest API Model ```claude-3-5-sonnet-20241022``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: chenhaodong <chenhaodong@ctrlvideo.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 16:13:52 +08:00
Kevin Hu	70ea6661ed	add new models for zhipu-ai (#3348 ) ### What problem does this PR solve? #3345 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-12 11:27:43 +08:00
Kevin Hu	4097912d59	add inputs to display to every components (#3242 ) ### What problem does this PR solve? #3240 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-06 18:47:53 +08:00
Yangong	7e89be5ed1	feat: add qwen 2.5 models for silicon flow (#3203 ) ### What problem does this PR solve? add qwen 2.5 models for silicon flow ### Type of change - [X] New Feature (non-breaking change which adds functionality)	2024-11-05 13:58:29 +08:00
Kevin Hu	6c6b658ffe	add yi-lightning (#3119 ) ### What problem does this PR solve? #3111 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-31 10:30:23 +08:00
Kevin Hu	8257eeb3f2	add model moonshot-v1-auto (#3051 ) ### What problem does this PR solve? #3048 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-28 10:37:22 +08:00
chongchuanbing	ac26d09a59	Feature/feat1017 (#2872 ) ### What problem does this PR solve? 1. fix: mid map show error in knowledge graph, juse because ```@antv/g6```version changed 2. feat: concurrent threads configuration support in graph extractor 3. fix: used tokens update failed for tenant 4. feat: timeout configuration support for llm 5. fix: regex error in graph extractor 6. feat: qwen rerank(```gte-rerank```) support 7. fix: timeout deal in knowledge graph index process. Now chat by stream output, also, it is configuratable. 8. feat: ```qwen-long``` model configuration ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: chongchuanbing <chongchuanbing@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-21 12:11:08 +08:00
Kevin Hu	c1d0473f49	add zhipu glm-4-9b (#2912 ) ### What problem does this PR solve? #2910 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-21 10:30:35 +08:00
JobSmithManipulation	18f80743eb	support api-version and change default-model in adding azure-openai and openai (#2799 ) ### What problem does this PR solve? #2701 #2712 #2749 ### Type of change -[x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-11 11:26:42 +08:00
JobSmithManipulation	96f56a3c43	add huggingface model (#2624 ) ### What problem does this PR solve? #2469 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-09-27 19:15:38 +08:00
liuhua	d545633a6c	OpenAITTS (#2493 ) ### What problem does this PR solve? OpenAITTS ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-09-19 16:55:18 +08:00
黄腾	d6e6c530d7	fix OpenRouter add bug and the way to add OpenRouter model (#2364 ) ### What problem does this PR solve? #2359 fix OpenRouter add bug and the way to add OpenRouter model ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-09-11 15:10:25 +08:00
黄腾	80656309f7	fix azure-openai add bug (#2314 ) ### What problem does this PR solve? #2236 fix azure-openai add bug ### Type of change - [x] Bug Fix --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-09-09 12:10:45 +08:00
黄腾	63da2cb7d5	fix SILICONFLOW rerank error (#2313 ) ### What problem does this PR solve? #2231 fix SILICONFLOW rerank error ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-09-09 11:41:37 +08:00
黄腾	cb69c742b0	add support for TongyiQwen tts (#2311 ) ### What problem does this PR solve? add support for TongyiQwen tts #1853 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-09-09 11:01:43 +08:00
黄腾	5decdde182	add support for Google Cloud (#2175 ) ### What problem does this PR solve? #1853 add support for Google Cloud ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-09-02 12:06:41 +08:00
黄腾	99993e5026	add support for Voyage AI (#2159 ) ### What problem does this PR solve? #1853 #2138 add support for Voyage AI ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-29 16:14:49 +08:00
黄腾	06abef66ef	add support for Anthropic (#2148 ) ### What problem does this PR solve? #1853 add support for Anthropic ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-29 13:30:06 +08:00
zhuhao	e9f5468a49	fix the max token of Tongyi-Qianwen text-embedding-v3 model to 8k (#2118 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ fix the max token of Tongyi-Qianwen text-embedding-v3 model to 8k close #2117 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-08-28 10:14:19 +08:00
黄腾	2da4e7aa46	add support for Tencent Cloud ASR (#2102 ) ### What problem does this PR solve? add support for Tencent Cloud ASR ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-27 11:47:11 +08:00
黄腾	cf038e099f	update groq llm (#2103 ) ### What problem does this PR solve? #2076 update groq llm. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-27 11:42:00 +08:00
黄腾	6b7c028578	add support for TTS model (#2095 ) ### What problem does this PR solve? add support for TTS model #1853 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-26 15:19:43 +08:00
yungongzi	7539d142a9	VolcEngine SDK V3 adaptation (#2082 ) 1) Configuration interface update 2) Back-end adaptation API update Note: The official no longer supports the Skylark1/2 series, and all have been switched to the Doubao series ![image](https://github.com/user-attachments/assets/f6fd8782-0cdf-4c0b-ac8f-9eb130f667a5) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): Co-authored-by: 海贼宅 <stu_xyx@163.com>	2024-08-26 13:34:29 +08:00
黄腾	733219cc3f	add support for Baidu yiyan (#2049 ) ### What problem does this PR solve? add support for Baidu yiyan ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-22 16:45:15 +08:00
黄腾	be431449bd	add support for XunFei Spark (#2017 ) ### What problem does this PR solve? #1853 add support for XunFei Spark ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-20 16:56:42 +08:00
黄腾	6f438e0a49	add support for Tencent Hunyuan (#2015 ) ### What problem does this PR solve? #1853 ### Type of change - [X] New Feature (non-breaking change which adds functionality) Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-20 15:27:13 +08:00

1 2

74 Commits