ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2025-12-08 20:42:30 +08:00

Author	SHA1	Message	Date
Edouard Hur	b29539b442	Fix: CoHereRerank not respecting base_url when provided (#5784 ) ### What problem does this PR solve? vLLM provider with a reranking model does not work : as vLLM uses under the hood the [CoHereRerank provider](https://github.com/infiniflow/ragflow/blob/v0.17.0/rag/llm/__init__.py#L250) with a `base_url`, if this URL [is not passed to the Cohere client](https://github.com/infiniflow/ragflow/blob/v0.17.0/rag/llm/rerank_model.py#L379-L382) any attempt will endup on the Cohere SaaS (sending your private api key in the process) instead of your vLLM instance. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-10 11:22:06 +08:00
Kevin Hu	df9b7b2fe9	Fix: rerank issue. (#5696 ) ### What problem does this PR solve? #5673 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-06 15:05:19 +08:00
Kevin Hu	251ba7f058	Refa: remove max tokens since no one needs it. (#5690 ) ### What problem does this PR solve? #5646 #5640 ### Type of change - [x] Refactoring	2025-03-06 11:29:40 +08:00
Kevin Hu	b8da2eeb69	Feat: support huggingface re-rank model. (#5684 ) ### What problem does this PR solve? #5658 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-06 10:44:04 +08:00
Kevin Hu	4c9a3e918f	Fix: add image2text issue. (#5431 ) ### What problem does this PR solve? #5356 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-27 14:06:49 +08:00
Yongteng Lei	0284248c93	Fix: correct wrong vLLM rerank model (#5399 ) ### What problem does this PR solve? Correct wrong vLLM rerank model #4316 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 18:59:36 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
Kevin Hu	4e2afcd3b8	Fix FlagRerank max_length issue. (#5366 ) ### What problem does this PR solve? #5352 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 11:01:13 +08:00
Kevin Hu	955801db2e	Resolve super class invokation error. (#5337 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-25 17:42:29 +08:00
Kevin Hu	daddfc9e1b	Remove dup gb2312, solve currupt error. (#5326 ) ### What problem does this PR solve? #5252 #5325 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-25 12:22:37 +08:00
Kevin Hu	df3d0f61bd	Fix base url missing for deepseek from Tongyi. (#5294 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-24 15:43:32 +08:00
Kevin Hu	ec96426c00	Tongyi adapts deepseek. (#5285 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-24 14:04:25 +08:00
liwenju0	569e40544d	Refactor rerank model with dynamic batch processing and memory manage… (#5273 ) …ment ### What problem does this PR solve? Issue：https://github.com/infiniflow/ragflow/issues/5262 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-02-24 11:32:08 +08:00
Omar Leonardo Sanchez Granados	4f2816c01c	Add support to boto3 default connection (#5246 ) ### What problem does this PR solve? This pull request includes changes to the initialization logic of the `ChatModel` and `EmbeddingModel` classes to enhance the handling of AWS credentials. Use cases: - Use env variables for credentials instead of managing them on the DB - Easy connection when deploying on an AWS machine ### Type of change - [X] New Feature (non-breaking change which adds functionality)	2025-02-24 11:01:14 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
Kevin Hu	1a755e75c5	Remove v1 (#5220 ) ### What problem does this PR solve? #5201 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-21 15:15:38 +08:00
saikidev	d2929e432e	Feat: add LLM provider PPIO (#5013 ) ### What problem does this PR solve? Add a LLM provider: PPIO ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2025-02-17 12:03:26 +08:00
Kevin Hu	b08bb56f6c	Display thinking for deepseek r1 (#4904 ) ### What problem does this PR solve? #4903 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 15:43:13 +08:00
Kevin Hu	2aa0cdde8f	Fix Gemini chat issue. (#4757 ) ### What problem does this PR solve? #4753 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-07 12:00:19 +08:00
Kyle	036f37a627	fix: err object has no attribute 'iter_lines' (#4686 ) ### What problem does this PR solve? ERROR: 'Stream' object has no attribute 'iter_lines' with reference to Claude/Anthropic chat streams ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Kyle Olmstead <k.olmstead@offensive-security.com>	2025-02-01 22:39:30 +08:00
Kevin Hu	4776fa5e4e	Refactor for total_tokens. (#4652 ) ### What problem does this PR solve? #4567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-26 13:54:26 +08:00
writinwaters	2cb8edc42c	Added GPUStack (#4649 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-01-26 12:25:02 +08:00
Kevin Hu	f1d9f4290e	Fix TogetherAIEmbed. (#4623 ) ### What problem does this PR solve? #4567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-24 10:29:30 +08:00
Kevin Hu	dd0ebbea35	Light GraphRAG (#4585 ) ### What problem does this PR solve? #4543 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-22 19:43:14 +08:00
Kevin Hu	3805621564	Fix xinference rerank issue. (#4499 ) ### What problem does this PR solve? #4495 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-16 11:35:51 +08:00
Kevin Hu	be5f830878	Truncate text for zhipu embedding. (#4490 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-15 14:36:27 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	b93c136797	Fix gemini embedding error. (#4356 ) ### What problem does this PR solve? #4314 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-06 14:41:29 +08:00
Yingfeng	50f209204e	Synchronize with enterprise version (#4325 ) ### Type of change - [x] Refactoring	2025-01-02 13:44:44 +08:00
Jin Hai	4abc144d3d	Fix error of changing embedding model (#4184 ) ### What problem does this PR solve? 1. Change embedding model of knowledge base won't change the default embedding model. 2. Retrieval test bug ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-12-23 16:23:54 +08:00
Kevin Hu	cb45431412	Fix Voyage re-rank model. Limit file name length. (#4171 ) ### What problem does this PR solve? #4152 #4154 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-23 10:03:50 +08:00
Kevin Hu	d8fca43017	Make fast embed and default embed mutually exclusive. (#4121 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-12-19 17:27:09 +08:00
Kevin Hu	7474348394	Fix fastembed reloading issue. (#4117 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-19 16:18:18 +08:00
Kevin Hu	044afa83d1	Fix transformers dependencies for slim. (#3934 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-09 14:21:37 +08:00
Zhichang Yu	0d68a6cd1b	Fix errors detected by Ruff (#3918 ) ### What problem does this PR solve? Fix errors detected by Ruff ### Type of change - [x] Refactoring	2024-12-08 14:21:12 +08:00
Kevin Hu	593ffc4067	Fix HuggingFace model error. (#3870 ) ### What problem does this PR solve? #3865 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 13:28:42 +08:00
Kevin Hu	78601ee1bd	Fix open AI compatible rerank issue. (#3866 ) ### What problem does this PR solve? #3700 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 10:26:21 +08:00
Kevin Hu	3f3469130b	Fix preview issue in file manager. (#3846 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-04 11:53:23 +08:00
Jin Hai	6657ca7cde	Change default error message to English (#3838 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2024-12-04 09:34:49 +08:00
Zhichang Yu	92ab7ef659	Refactor embedding batch_size (#3825 ) ### What problem does this PR solve? Refactor embedding batch_size. Close #3657 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2024-12-03 16:22:39 +08:00
Kevin Hu	6a0583f5ad	Fix voyage embedding. (#3818 ) ### What problem does this PR solve? #3816 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-03 09:33:54 +08:00
Zhichang Yu	d19f059f34	Detect invalid response from api.siliconflow.cn (#3792 ) ### What problem does this PR solve? Detect invalid response from api.siliconflow.cn. Close #2643 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-02 12:55:05 +08:00
devMls	59a5813f1b	add jina new models in jina connector (#3770 ) ### What problem does this PR solve? add new models in jinna connector, to allow use models that support multilingual models ### Type of change - [X] Other (please describe): new connectors no breaking change	2024-12-02 10:06:39 +08:00
Zhichang Yu	d94386e00a	Pass top_p to ollama (#3744 ) ### What problem does this PR solve? Pass top_p to ollama. Close #1769 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-29 14:52:27 +08:00
Kevin Hu	91f1814a87	Fix error response (#3719 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2024-11-28 18:56:10 +08:00
Kevin Hu	57208d8e53	Fix batch size issue. (#3675 ) ### What problem does this PR solve? #3657 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-27 18:06:43 +08:00
liuhua	8b35776916	Fix a bug in VolcEngine (#3658 ) ### What problem does this PR solve? Fix a bug in VolcEngine #3553 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-11-27 09:30:49 +08:00
Kevin Hu	0891a393d7	Let ThreadPool exit gracefully. (#3653 ) ### What problem does this PR solve? #3646 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-26 16:31:07 +08:00
Kevin Hu	e5af18d5ea	Update docs for v0.14.0 (#3625 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2024-11-25 11:37:56 +08:00
liwenju0	875096384b	when qwen rerank model not return ok, raise exception to notice user (#3593 ) ### What problem does this PR solve? When calling the Qwen rerank model, if the model does not return correctly, an exception should be raised to notify the user, rather than simply returning a value of 0, as this would be confusing to the user. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-22 22:34:34 +08:00

1 2 3 4 5

248 Commits