ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2025-12-26 00:46:52 +08:00

Author	SHA1	Message	Date
TeslaZY	b26088ab70	Add a series of qwen3 latest SOTA models (#9140 ) ### What problem does this PR solve? Add a series of qwen3 latest SOTA models: qwen3-coder-480b-a35b-instruct, qwen3-30b-a3b-instruct-2507, qwen3-30b-a3b-thinking-2507, qwen3-235b-a22b-instruct-2507, qwen3-235b-a22b-thinking-2507 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-08-01 15:19:51 +08:00
Yongteng Lei	4b98119c52	Fix: kimi-latest is not authorized (#9151 ) ### What problem does this PR solve? Fix kimi-latest is not authorized. Add kimi-thinking-preview. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-08-01 12:40:58 +08:00
JI4JUN	aeaeb169e4	Feat/support 302ai provider (#8742 ) ### What problem does this PR solve? Support 302.AI provider. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-31 14:48:30 +08:00
TeslaZY	46ded9d329	add Kimi-K2-Instruct from Tongyi-Qianwen API (#9125 ) ### What problem does this PR solve? add Kimi-K2-Instruct from Tongyi-Qianwen API ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-31 14:42:32 +08:00
Yongteng Lei	7ebc1f0943	Feat: add model provider DeepInfra (#9003 ) ### What problem does this PR solve? Add model provider DeepInfra. This model list comes from our community. NOTE: most endpoints haven't been tested, but they should work as OpenAI does. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-23 18:10:35 +08:00
謝富祥	0d7244e4a4	Fix: Adds newest Gemini models to fit google's standard API rate limits (#8970 ) ### What problem does this PR solve? Adds configurations for gemini-2.5-flash and Gemini 2.5-pro models, including tags, maximum token limits, and model types. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-23 10:18:04 +08:00
Yongteng Lei	ed7bea060f	Feat: add Kimi model series support (#8866 ) ### What problem does this PR solve? Add Kimi model series support. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-16 15:31:57 +08:00
Tuan Le	a9abf9df48	Adds new Voyage embedding models (#8845 ) ### What problem does this PR solve? This PR enhances the application's capabilities by adding support for four new Voyage embedding models (voyage-3-large, voyage-3.5, voyage-3.5-lite, and voyage-code-3) to the `llm_factories.json` configuration file. These models expand the available options for text embedding tasks, enabling improved processing of text data with a maximum token limit of 32,000. This addition addresses the need for more diverse and specialized embedding models to support various use cases without altering existing functionality. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-16 11:41:06 +08:00
Yongteng Lei	1895667573	Feat: add xAI provider (#8781 ) ### What problem does this PR solve? Add xAI provider (experimental feature, requires user feedback). ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-11 10:35:23 +08:00
Kevin Hu	fffb7c0bba	Fix: anthropic llm issue. (#8633 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-02 18:37:34 +08:00
Kevin Hu	aafeffa292	Feat: add gitee as LLM provider. (#8545 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-06-30 09:22:31 +08:00
Yongteng Lei	5e30426916	Feat: add Qwen3-Embedding text-embedding-v4 (#8184 ) ### What problem does this PR solve? Add Qwen3-Embedding text-embedding-v4. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-06-11 15:32:05 +08:00
Yongteng Lei	37075eab98	Feat: add voyage-multimodal-3 (#7987 ) ### What problem does this PR solve? Add voyage-multimodal-3. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-06-03 11:56:59 +08:00
Yongteng Lei	6c9b8ec860	Refa: update gemini2.5 (#7822 ) ### What problem does this PR solve? Update gemini2.5 ### Type of change - [x] Refactoring	2025-05-23 20:29:10 +08:00
Yongteng Lei	50ff16e7a4	Feat: add claude4 models (#7809 ) ### What problem does this PR solve? Add claude4 models. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:25:13 +08:00
liu an	e166f132b3	Feat: change default models (#7777 ) ### What problem does this PR solve? change default models to buildin models https://github.com/infiniflow/ragflow/issues/7774 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:21:25 +08:00
Debug Doctor	36e32dde1a	Feat: update llm factories for SILICONFLOW (#7620 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Other (please describe): llm factories update	2025-05-14 19:46:27 +08:00
Andrea	e39ceb2bd1	Feat: add support for OpenAi gpt 4.1 series (#7540 ) ### What problem does this PR solve? Adds support for the GPT-4.1 series from OpenAI. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-12 18:24:53 +08:00
QuintinTao	e9053b6ed4	fix bug #7309 deepseek-ai/deepseek-vl2 model can not be select as a VL model to parse pdf image (#7312 ) ### What problem does this PR solve? fix deepseek-ai/deepseek-vl2 model can not be select as a VL model to parse pdf image . And add other vl models config from siliconflow _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): --------- Co-authored-by: unknown <taoshi.ln@chinatelecom.cn>	2025-05-08 11:24:39 +08:00
Yongteng Lei	093d280528	Feat: add Qwen3 and OpenAI o series (#7415 ) ### What problem does this PR solve? Qwen3 and more LLMs. Close #7296 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-04-29 18:26:29 +08:00
Neal Davis	23dcbc94ef	feat: replace models of novita (#7360 ) ### What problem does this PR solve? Replace models of novita ### Type of change - [x] Other (please describe): Replace models of novita	2025-04-28 13:35:09 +08:00
Jason Li	67b087019c	Update Groq AI Model Config (#7335 ) With current config will get error "Fail to access model(gemma-7b-it) using this api key" Since the model has been removed, according to Groq official document: https://console.groq.com/docs/models ### Type of change - [ x] Bug Fix (non-breaking change which fixes an issue)	2025-04-27 17:05:25 +08:00
Yongteng Lei	018ff4dd0a	Refa: update llms (#7007 ) ### What problem does this PR solve? Update LLM models ### Type of change - [x] Refactoring	2025-04-15 09:19:07 +08:00
Kevin Hu	5b5558300a	Feat: add gemini-2.5-pro-exp-03-25 (#6774 ) ### What problem does this PR solve? #6733 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-04-03 10:48:58 +08:00
Kevin Hu	fc21dd0a4a	Feat: add qwq-plus-latest (#6702 ) ### What problem does this PR solve? #6697 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-04-01 11:06:03 +08:00
Kevin Hu	7d9dd1e5d3	Refa: remove default build-in rerank model. (#6682 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-03-31 15:33:19 +08:00
Kevin Hu	d2043ff9f2	Fix: LmStudioChat issue. (#6591 ) ### What problem does this PR solve? #6577 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-27 14:59:15 +08:00
Chenzy	735d9dd949	Feat: add "tools" to llm_factories.json (#6552 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Chenzy <chenzy901@gmail.com>	2025-03-26 17:31:18 +08:00
Kevin Hu	85eb3775d6	Refa: update Anthropic models. (#6445 ) ### What problem does this PR solve? #6421 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-24 12:34:57 +08:00
crypticGøøse	f16418ccf7	Feat: Add deepseek to llm_factories (#6051 ) ### What problem does this PR solve? AWS Bedrock has made deepseek-r1 available on its serverless inference. This adds the R1 serverless model for use via the bedrock model abilities. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-14 10:35:44 +08:00
kuro5989	6e13922bdc	Feat: Add qwq model support to Tongyi-Qianwen factory (#5981 ) ### What problem does this PR solve? add qwq model support to Tongyi-Qianwen factory https://github.com/infiniflow/ragflow/issues/5869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) ![image](https://github.com/user-attachments/assets/49f5c6a0-ecaf-41dd-a23a-2009f854d62c) ![image](https://github.com/user-attachments/assets/93ffa303-920e-4942-8188-bcd6b7209204) ![1741774779438](https://github.com/user-attachments/assets/25f2fd1d-8640-4df0-9a08-78ee9daaa8fe) ![image](https://github.com/user-attachments/assets/4763cf6c-1f76-43c4-80ee-74dfd666a184) Co-authored-by: zhaozhicheng <zhicheng.zhao@fastonetech.com>	2025-03-12 18:54:15 +08:00
Kevin Hu	fa817a8ab3	Refa: SiliconFlow model list refresh. (#5825 ) ### What problem does this PR solve? #5806 ### Type of change - [x] Refactoring	2025-03-10 12:51:12 +08:00
Kevin Hu	e05658685c	Refa: update mistral model list. (#5818 ) ### What problem does this PR solve? #5782 ### Type of change - [x] Refactoring	2025-03-10 11:22:06 +08:00
Kevin Hu	b8da2eeb69	Feat: support huggingface re-rank model. (#5684 ) ### What problem does this PR solve? #5658 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-06 10:44:04 +08:00
Debug Doctor	202acbd628	Perf: update novita.ai LLM library (#5574 ) ### What problem does this PR solve? LLM library update ### Type of change - [x] Other : config update	2025-03-04 11:35:25 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
petertc	4694604836	Specify img2text model by tag (#5063 ) ### What problem does this PR solve? The current design is not well-suited for multimodal models, as each model can only be configured for a single purpose—either chat or Img2txt. To work around this limitation, we use model aliases such as gpt-4o-mini and gpt-4o-mini-2024-07-18. To fix this, this PR allows specifying the Img2txt model by tag instead of model_type. ### Type of change - [x] Refactoring	2025-02-18 11:14:48 +08:00
saikidev	d2929e432e	Feat: add LLM provider PPIO (#5013 ) ### What problem does this PR solve? Add a LLM provider: PPIO ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2025-02-17 12:03:26 +08:00
so95	754d5ea364	add gemini-2.0-flash-thinking-exp-01-21 (#4957 ) add gemini-2.0-flash-thinking-exp-01-21	2025-02-14 13:31:07 +08:00
DiamondPoirier	a03f5dd9f6	Add a list of large language models of deepseek and image2text models… (#4914 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:52:29 +08:00
DiamondPoirier	415c4b7ed5	Organized and add a list of large language models of Nvidia.v1.1 (#4910 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:10:19 +08:00
Kevin Hu	55823dbdf6	Refresh Gemini model list. (#4780 ) ### What problem does this PR solve? #4761 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-08 10:19:51 +08:00
Kevin Hu	4150805073	More models for siliconflow. (#4756 ) ### What problem does this PR solve? #4751 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-07 10:32:52 +08:00
Kevin Hu	4b9c4c0705	Update deepseek model provider info. (#4714 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-05 13:43:40 +08:00
Kevin Hu	656a2fab41	Refresh deepseek models. (#4660 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-27 11:01:39 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	9c6cf12137	Refactor model list. (#4346 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-01-03 19:55:42 +08:00
petertc	accd3a6c7e	Support OpenAI gpt-4o and gpt-4o-mini for img2text (#4300 ) ### What problem does this PR solve? OpenAI has deprecated the gpt-4-vision-preview model. This PR adds support for the newer gpt-4o and gpt-4o-mini models in the img2text feature. ![image](https://github.com/user-attachments/assets/6dddf2dc-1b9e-4e94-bf07-6bf77d39122b) This PR add addtional 4o/4o-mini entry for img2text besides original ones. Utilized [alias](https://platform.openai.com/docs/models#gpt-4o) model names (e.g., gpt-4o-2024-08-06) because the database schema uses the model name as the primary key. - [x] Other (please describe): model update	2024-12-31 14:36:06 +08:00
Kevin Hu	2cbe064080	Add Llama3.3 (#4174 ) ### What problem does this PR solve? #4168 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-23 11:18:01 +08:00

1 2 3

105 Commits