ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-01-30 15:16:45 +08:00

Author	SHA1	Message	Date
pyyuhao	c8c3b756b0	Feat: Adds OpenSearch2.19.1 as the vector_database support (#7140 ) ### What problem does this PR solve? This PR adds the support for latest OpenSearch2.19.1 as the store engine & search engine option for RAGFlow. ### Main Benefit 1. OpenSearch2.19.1 is licensed under the [Apache v2.0 License] which is much better than Elasticsearch 2. For search, OpenSearch2.19.1 supports full-text search、vector_search、hybrid_search those are similar with Elasticsearch on schema 3. For store, OpenSearch2.19.1 stores text、vector those are quite simliar with Elasticsearch on schema ### Changes - Support opensearch_python_connetor. I make a lot of adaptions since the schema and api/method between ES and Opensearch differs in many ways(especially the knn_search has a significant gap) : rag/utils/opensearch_coon.py - Support static config adaptions by changing: conf/service_conf.yaml、api/settings.py、rag/settings.py - Supprt some store&search schema changes between OpenSearch and ES: conf/os_mapping.json - Support OpenSearch python sdk : pyproject.toml - Support docker config for OpenSearch2.19.1 : docker/.env、docker/docker-compose-base.yml、docker/service_conf.yaml.template ### How to use - I didn't change the priority that ES as the default doc/search engine. Only if in docker/.env , we set DOC_ENGINE=${DOC_ENGINE:-opensearch}, it will work. ### Others Our team tested a lot of docs in our environment by using OpenSearch as the vector database ,it works very well. All the conifg for OpenSearch is necessary. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Yongteng Lei <yongtengrey@outlook.com> Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com> Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2025-04-24 16:03:31 +08:00
Yongteng Lei	018ff4dd0a	Refa: update llms (#7007 ) ### What problem does this PR solve? Update LLM models ### Type of change - [x] Refactoring	2025-04-15 09:19:07 +08:00
Kevin Hu	5b5558300a	Feat: add gemini-2.5-pro-exp-03-25 (#6774 ) ### What problem does this PR solve? #6733 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-04-03 10:48:58 +08:00
Kevin Hu	fc21dd0a4a	Feat: add qwq-plus-latest (#6702 ) ### What problem does this PR solve? #6697 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-04-01 11:06:03 +08:00
Kevin Hu	7d9dd1e5d3	Refa: remove default build-in rerank model. (#6682 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-03-31 15:33:19 +08:00
Zhichang Yu	65a8cd1772	Fix knowledge_graph_kwd on infinity. Close #6476 and #6624 (#6651 ) ### What problem does this PR solve? Fix knowledge_graph_kwd on infinity. Close #6476 and #6624 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-28 22:05:40 +08:00
Kevin Hu	d2043ff9f2	Fix: LmStudioChat issue. (#6591 ) ### What problem does this PR solve? #6577 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-27 14:59:15 +08:00
Chenzy	735d9dd949	Feat: add "tools" to llm_factories.json (#6552 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Chenzy <chenzy901@gmail.com>	2025-03-26 17:31:18 +08:00
Zhichang Yu	6bf26e2a81	Optimize graphrag again (#6513 ) ### What problem does this PR solve? Removed set_entity and set_relation to avoid accessing doc engine during graph computation. Introduced GraphChange to avoid writing unchanged chunks. ### Type of change - [x] Performance Improvement	2025-03-26 15:34:42 +08:00
Kevin Hu	85eb3775d6	Refa: update Anthropic models. (#6445 ) ### What problem does this PR solve? #6421 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-24 12:34:57 +08:00
crypticGøøse	f16418ccf7	Feat: Add deepseek to llm_factories (#6051 ) ### What problem does this PR solve? AWS Bedrock has made deepseek-r1 available on its serverless inference. This adds the R1 serverless model for use via the bedrock model abilities. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-14 10:35:44 +08:00
kuro5989	6e13922bdc	Feat: Add qwq model support to Tongyi-Qianwen factory (#5981 ) ### What problem does this PR solve? add qwq model support to Tongyi-Qianwen factory https://github.com/infiniflow/ragflow/issues/5869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) ![image](https://github.com/user-attachments/assets/49f5c6a0-ecaf-41dd-a23a-2009f854d62c) ![image](https://github.com/user-attachments/assets/93ffa303-920e-4942-8188-bcd6b7209204) ![1741774779438](https://github.com/user-attachments/assets/25f2fd1d-8640-4df0-9a08-78ee9daaa8fe) ![image](https://github.com/user-attachments/assets/4763cf6c-1f76-43c4-80ee-74dfd666a184) Co-authored-by: zhaozhicheng <zhicheng.zhao@fastonetech.com>	2025-03-12 18:54:15 +08:00
Kevin Hu	fa817a8ab3	Refa: SiliconFlow model list refresh. (#5825 ) ### What problem does this PR solve? #5806 ### Type of change - [x] Refactoring	2025-03-10 12:51:12 +08:00
Kevin Hu	e05658685c	Refa: update mistral model list. (#5818 ) ### What problem does this PR solve? #5782 ### Type of change - [x] Refactoring	2025-03-10 11:22:06 +08:00
Kevin Hu	b8da2eeb69	Feat: support huggingface re-rank model. (#5684 ) ### What problem does this PR solve? #5658 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-06 10:44:04 +08:00
汪威	76e8285904	use to_df replace to_pl when get infinity Result (#5604 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Performance Improvement --------- Co-authored-by: wangwei <dwxiayi@163.com>	2025-03-05 09:35:40 +08:00
Debug Doctor	202acbd628	Perf: update novita.ai LLM library (#5574 ) ### What problem does this PR solve? LLM library update ### Type of change - [x] Other : config update	2025-03-04 11:35:25 +08:00
yihong	79bc9d97c9	Refa: better service conf (#5471 ) ### What problem does this PR solve? This patch fix most of the issues like #4853 #5038 and so on the root reason is that we need to add the hostname to the `/etc/hosts` which is not wrote in main README and the code side read `conf/service_conf.yaml` as settings and its hard for developers to debug, this patch fix it, or maybe can discuss better solution here ### Type of change - [x] Refactoring Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2025-02-28 14:28:00 +08:00
hy89	651422127c	Feat: Accessing Alibaba Cloud OSS with Amazon S3 SDK (#5438 ) Accessing Alibaba Cloud OSS with Amazon S3 SDK	2025-02-27 17:02:42 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
petertc	4694604836	Specify img2text model by tag (#5063 ) ### What problem does this PR solve? The current design is not well-suited for multimodal models, as each model can only be configured for a single purpose—either chat or Img2txt. To work around this limitation, we use model aliases such as gpt-4o-mini and gpt-4o-mini-2024-07-18. To fix this, this PR allows specifying the Img2txt model by tag instead of model_type. ### Type of change - [x] Refactoring	2025-02-18 11:14:48 +08:00
saikidev	d2929e432e	Feat: add LLM provider PPIO (#5013 ) ### What problem does this PR solve? Add a LLM provider: PPIO ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2025-02-17 12:03:26 +08:00
so95	754d5ea364	add gemini-2.0-flash-thinking-exp-01-21 (#4957 ) add gemini-2.0-flash-thinking-exp-01-21	2025-02-14 13:31:07 +08:00
DiamondPoirier	a03f5dd9f6	Add a list of large language models of deepseek and image2text models… (#4914 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:52:29 +08:00
DiamondPoirier	415c4b7ed5	Organized and add a list of large language models of Nvidia.v1.1 (#4910 ) ### What problem does this PR solve? #4870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 17:10:19 +08:00
Kevin Hu	0d3ed37b48	Make the update script shorter. (#4854 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-10 18:18:49 +08:00
Kevin Hu	b48c85dcf9	Increase ES update script length. (#4785 ) ### What problem does this PR solve? #4749 ### Type of change - [x] Performance Improvement	2025-02-08 11:03:31 +08:00
Kevin Hu	55823dbdf6	Refresh Gemini model list. (#4780 ) ### What problem does this PR solve? #4761 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-08 10:19:51 +08:00
Kevin Hu	4150805073	More models for siliconflow. (#4756 ) ### What problem does this PR solve? #4751 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-07 10:32:52 +08:00
Kevin Hu	4b9c4c0705	Update deepseek model provider info. (#4714 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-05 13:43:40 +08:00
Kevin Hu	50055c47ec	Infinity mapping refine. (#4665 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-01-27 18:53:49 +08:00
Kevin Hu	6f30397bb5	Infinity adapt to graphrag. (#4663 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-27 18:35:18 +08:00
Kevin Hu	656a2fab41	Refresh deepseek models. (#4660 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-27 11:01:39 +08:00
Kevin Hu	71c132f76d	Make infinity adapt (#4635 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-24 17:45:04 +08:00
Kevin Hu	dd0ebbea35	Light GraphRAG (#4585 ) ### What problem does this PR solve? #4543 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-22 19:43:14 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	c5da3cdd97	Tagging (#4426 ) ### What problem does this PR solve? #4367 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-09 17:07:21 +08:00
Kenny Dizi	bad764bcda	Improve storage engine (#4341 ) ### What problem does this PR solve? - Bring `STORAGE_IMPL` back in `rag/svr/cache_file_svr.py` - Simplify storage connection when working with AWS S3 ### Type of change - [x] Refactoring	2025-01-06 12:06:24 +08:00
Kevin Hu	9c6cf12137	Refactor model list. (#4346 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-01-03 19:55:42 +08:00
petertc	accd3a6c7e	Support OpenAI gpt-4o and gpt-4o-mini for img2text (#4300 ) ### What problem does this PR solve? OpenAI has deprecated the gpt-4-vision-preview model. This PR adds support for the newer gpt-4o and gpt-4o-mini models in the img2text feature. ![image](https://github.com/user-attachments/assets/6dddf2dc-1b9e-4e94-bf07-6bf77d39122b) This PR add addtional 4o/4o-mini entry for img2text besides original ones. Utilized [alias](https://platform.openai.com/docs/models#gpt-4o) model names (e.g., gpt-4o-2024-08-06) because the database schema uses the model name as the primary key. - [x] Other (please describe): model update	2024-12-31 14:36:06 +08:00
Kevin Hu	2cbe064080	Add Llama3.3 (#4174 ) ### What problem does this PR solve? #4168 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-23 11:18:01 +08:00
so95	478da3118c	add gemini 2.0 (#4115 ) add gemini 2.0	2024-12-19 17:30:45 +08:00
Kevin Hu	ce1e855328	Upgrades Document Layout Analysis model. (#4054 ) ### What problem does this PR solve? #4052 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-17 11:27:19 +08:00
Zhichang Yu	0bca46ac3a	Migrate infinity at startup (#3858 ) ### What problem does this PR solve? Migrate infinity at startup #3809 https://github.com/infiniflow/infinity/issues/2321 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-13 13:43:56 +08:00
Zhichang Yu	03f00c9e6f	Rename page_num_list, top_list, position_list (#3940 ) ### What problem does this PR solve? Rename page_num_list, top_list, position_list to page_num_int, top_int, position_int ### Type of change - [x] Refactoring	2024-12-10 16:32:58 +08:00
Kevin Hu	56f473b680	Feat: Add question parameter to edit chunk modal (#3875 ) ### What problem does this PR solve? Close #3873 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-05 14:51:19 +08:00
Kevin Hu	1b817a5b4c	Refine synonym query. (#3855 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-12-04 17:20:12 +08:00
Kevin Hu	934dbc2e2b	Add more mistral models. (#3826 ) ### What problem does this PR solve? #3647 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-03 15:18:38 +08:00
Kevin Hu	74b28ef1b0	Add pagerank to KB. (#3809 ) ### What problem does this PR solve? #3794 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-12-03 14:30:35 +08:00

1 2 3

124 Commits