ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-01-30 07:06:39 +08:00

Author	SHA1	Message	Date
Kevin Hu	c6e1a2ca8a	Feat: add TTS support for SILICONFLOW. (#6264 ) ### What problem does this PR solve? #6244 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-19 12:52:12 +08:00
Yongteng Lei	5cf610af40	Feat: add vision LLM PDF parser (#6173 ) ### What problem does this PR solve? Add vision LLM PDF parser ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-03-18 14:52:20 +08:00
Kevin Hu	e9a6675c40	Fix: enable ollama api-key. (#6205 ) ### What problem does this PR solve? #6189 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-18 13:37:34 +08:00
Kevin Hu	7e4d693054	Fix: in case response.choices[0].message.content is None. (#6190 ) ### What problem does this PR solve? #6164 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-18 10:00:27 +08:00
Kevin Hu	56b228f187	Refa: remove max toekns for image2txt models. (#6078 ) ### What problem does this PR solve? #6063 ### Type of change - [x] Refactoring	2025-03-14 13:51:45 +08:00
writinwaters	9c8060f619	0.17.1 release notes (#6021 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-13 14:43:24 +08:00
Kevin Hu	3571270191	Refa: refine the context window size warning. (#5993 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-03-12 19:40:54 +08:00
kuro5989	6e13922bdc	Feat: Add qwq model support to Tongyi-Qianwen factory (#5981 ) ### What problem does this PR solve? add qwq model support to Tongyi-Qianwen factory https://github.com/infiniflow/ragflow/issues/5869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) ![image](https://github.com/user-attachments/assets/49f5c6a0-ecaf-41dd-a23a-2009f854d62c) ![image](https://github.com/user-attachments/assets/93ffa303-920e-4942-8188-bcd6b7209204) ![1741774779438](https://github.com/user-attachments/assets/25f2fd1d-8640-4df0-9a08-78ee9daaa8fe) ![image](https://github.com/user-attachments/assets/4763cf6c-1f76-43c4-80ee-74dfd666a184) Co-authored-by: zhaozhicheng <zhicheng.zhao@fastonetech.com>	2025-03-12 18:54:15 +08:00
Edouard Hur	b29539b442	Fix: CoHereRerank not respecting base_url when provided (#5784 ) ### What problem does this PR solve? vLLM provider with a reranking model does not work : as vLLM uses under the hood the [CoHereRerank provider](https://github.com/infiniflow/ragflow/blob/v0.17.0/rag/llm/__init__.py#L250) with a `base_url`, if this URL [is not passed to the Cohere client](https://github.com/infiniflow/ragflow/blob/v0.17.0/rag/llm/rerank_model.py#L379-L382) any attempt will endup on the Cohere SaaS (sending your private api key in the process) instead of your vLLM instance. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-10 11:22:06 +08:00
Kevin Hu	df9b7b2fe9	Fix: rerank issue. (#5696 ) ### What problem does this PR solve? #5673 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-06 15:05:19 +08:00
Kevin Hu	251ba7f058	Refa: remove max tokens since no one needs it. (#5690 ) ### What problem does this PR solve? #5646 #5640 ### Type of change - [x] Refactoring	2025-03-06 11:29:40 +08:00
Kevin Hu	b8da2eeb69	Feat: support huggingface re-rank model. (#5684 ) ### What problem does this PR solve? #5658 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-06 10:44:04 +08:00
Kevin Hu	4c9a3e918f	Fix: add image2text issue. (#5431 ) ### What problem does this PR solve? #5356 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-27 14:06:49 +08:00
Yongteng Lei	0284248c93	Fix: correct wrong vLLM rerank model (#5399 ) ### What problem does this PR solve? Correct wrong vLLM rerank model #4316 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 18:59:36 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
Kevin Hu	4e2afcd3b8	Fix FlagRerank max_length issue. (#5366 ) ### What problem does this PR solve? #5352 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 11:01:13 +08:00
Kevin Hu	955801db2e	Resolve super class invokation error. (#5337 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-25 17:42:29 +08:00
Kevin Hu	daddfc9e1b	Remove dup gb2312, solve currupt error. (#5326 ) ### What problem does this PR solve? #5252 #5325 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-25 12:22:37 +08:00
Kevin Hu	df3d0f61bd	Fix base url missing for deepseek from Tongyi. (#5294 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-24 15:43:32 +08:00
Kevin Hu	ec96426c00	Tongyi adapts deepseek. (#5285 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-24 14:04:25 +08:00
liwenju0	569e40544d	Refactor rerank model with dynamic batch processing and memory manage… (#5273 ) …ment ### What problem does this PR solve? Issue：https://github.com/infiniflow/ragflow/issues/5262 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-02-24 11:32:08 +08:00
Omar Leonardo Sanchez Granados	4f2816c01c	Add support to boto3 default connection (#5246 ) ### What problem does this PR solve? This pull request includes changes to the initialization logic of the `ChatModel` and `EmbeddingModel` classes to enhance the handling of AWS credentials. Use cases: - Use env variables for credentials instead of managing them on the DB - Easy connection when deploying on an AWS machine ### Type of change - [X] New Feature (non-breaking change which adds functionality)	2025-02-24 11:01:14 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
Kevin Hu	1a755e75c5	Remove v1 (#5220 ) ### What problem does this PR solve? #5201 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-21 15:15:38 +08:00
saikidev	d2929e432e	Feat: add LLM provider PPIO (#5013 ) ### What problem does this PR solve? Add a LLM provider: PPIO ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2025-02-17 12:03:26 +08:00
Kevin Hu	b08bb56f6c	Display thinking for deepseek r1 (#4904 ) ### What problem does this PR solve? #4903 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 15:43:13 +08:00
Kevin Hu	2aa0cdde8f	Fix Gemini chat issue. (#4757 ) ### What problem does this PR solve? #4753 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-07 12:00:19 +08:00
Kyle	036f37a627	fix: err object has no attribute 'iter_lines' (#4686 ) ### What problem does this PR solve? ERROR: 'Stream' object has no attribute 'iter_lines' with reference to Claude/Anthropic chat streams ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Kyle Olmstead <k.olmstead@offensive-security.com>	2025-02-01 22:39:30 +08:00
Kevin Hu	4776fa5e4e	Refactor for total_tokens. (#4652 ) ### What problem does this PR solve? #4567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-26 13:54:26 +08:00
writinwaters	2cb8edc42c	Added GPUStack (#4649 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-01-26 12:25:02 +08:00
Kevin Hu	f1d9f4290e	Fix TogetherAIEmbed. (#4623 ) ### What problem does this PR solve? #4567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-24 10:29:30 +08:00
Kevin Hu	dd0ebbea35	Light GraphRAG (#4585 ) ### What problem does this PR solve? #4543 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-22 19:43:14 +08:00
Kevin Hu	3805621564	Fix xinference rerank issue. (#4499 ) ### What problem does this PR solve? #4495 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-16 11:35:51 +08:00
Kevin Hu	be5f830878	Truncate text for zhipu embedding. (#4490 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-15 14:36:27 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	b93c136797	Fix gemini embedding error. (#4356 ) ### What problem does this PR solve? #4314 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-06 14:41:29 +08:00
Yingfeng	50f209204e	Synchronize with enterprise version (#4325 ) ### Type of change - [x] Refactoring	2025-01-02 13:44:44 +08:00
Jin Hai	4abc144d3d	Fix error of changing embedding model (#4184 ) ### What problem does this PR solve? 1. Change embedding model of knowledge base won't change the default embedding model. 2. Retrieval test bug ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-12-23 16:23:54 +08:00
Kevin Hu	cb45431412	Fix Voyage re-rank model. Limit file name length. (#4171 ) ### What problem does this PR solve? #4152 #4154 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-23 10:03:50 +08:00
Kevin Hu	d8fca43017	Make fast embed and default embed mutually exclusive. (#4121 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-12-19 17:27:09 +08:00
Kevin Hu	7474348394	Fix fastembed reloading issue. (#4117 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-19 16:18:18 +08:00
Kevin Hu	044afa83d1	Fix transformers dependencies for slim. (#3934 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-09 14:21:37 +08:00
Zhichang Yu	0d68a6cd1b	Fix errors detected by Ruff (#3918 ) ### What problem does this PR solve? Fix errors detected by Ruff ### Type of change - [x] Refactoring	2024-12-08 14:21:12 +08:00
Kevin Hu	593ffc4067	Fix HuggingFace model error. (#3870 ) ### What problem does this PR solve? #3865 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 13:28:42 +08:00
Kevin Hu	78601ee1bd	Fix open AI compatible rerank issue. (#3866 ) ### What problem does this PR solve? #3700 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 10:26:21 +08:00
Kevin Hu	3f3469130b	Fix preview issue in file manager. (#3846 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-04 11:53:23 +08:00
Jin Hai	6657ca7cde	Change default error message to English (#3838 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2024-12-04 09:34:49 +08:00
Zhichang Yu	92ab7ef659	Refactor embedding batch_size (#3825 ) ### What problem does this PR solve? Refactor embedding batch_size. Close #3657 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2024-12-03 16:22:39 +08:00
Kevin Hu	6a0583f5ad	Fix voyage embedding. (#3818 ) ### What problem does this PR solve? #3816 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-03 09:33:54 +08:00
Zhichang Yu	d19f059f34	Detect invalid response from api.siliconflow.cn (#3792 ) ### What problem does this PR solve? Detect invalid response from api.siliconflow.cn. Close #2643 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-02 12:55:05 +08:00

1 2 3 4 5 ...

256 Commits