ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-02-02 08:35:08 +08:00

Author	SHA1	Message	Date
Zhichang Yu	dba0caa00b	Fix update_progress (#6340 ) ### What problem does this PR solve? Fix update_progress ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-20 17:01:28 +08:00
Kevin Hu	95497b4aab	Fix: adapt to old configurations. (#6321 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-20 14:50:59 +08:00
Kevin Hu	5b04b7d972	Fix: rerank with vllm issue. (#6306 ) ### What problem does this PR solve? #6301 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-20 11:52:42 +08:00
Yongteng Lei	9611185eb4	Feat: add VLM-boosted DocX parser (#6307 ) ### What problem does this PR solve? Add VLM-boosted DocX parser ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-20 11:24:44 +08:00
Yongteng Lei	e4380843c4	Feat: add fallback for PDF figure parser (#6305 ) ### What problem does this PR solve? Add fallback for PDF figure parser ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-20 10:48:38 +08:00
lgphone	046f0bba74	Fix: optimize setting config initialization to resolve Minio initialization error (#6282 ) ### What problem does this PR solve? Optimize setting configuration initialization to resolve Minio initialization error caused by using a specific storage. Reproduction Scenario: Using Aliyun OSS as the backend storage with the STORAGE_IMPL environment variable set to OSS. The service_conf.yaml.template configuration file contains OSS-related configurations, while other storage configurations are commented out. When the service starts, it still attempts to initialize the Minio storage. Since there is no Minio configuration in service_conf.yaml.template, it results in an error due to the missing configuration file. Optimization Measures: Automatically determine the required initialization configuration based on the environment variable. Do not initialize configurations for unused resources. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-20 10:45:40 +08:00
Yongteng Lei	1d6760dd84	Feat: add VLM-boosted PDF parser (#6278 ) ### What problem does this PR solve? Add VLM-boosted PDF parser if VLM is set. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-20 09:39:32 +08:00
Zhichang Yu	bb869aca33	Fix get_unacked_iterator (#6280 ) ### What problem does this PR solve? Fix get_unacked_iterator. Close #6132 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-19 17:46:58 +08:00
zhou	9cad60fa6d	Fix: Add a basic example when the example of content_tagging is empty (#6276 ) ### What problem does this PR solve? When using LLM for auto-tag, if there are no examples, the tag format generated by LLM may be wrong. This will cause Elasticsearch insert errors. Adding basic examples can avoid this problem. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-19 17:30:47 +08:00
Kevin Hu	c6e1a2ca8a	Feat: add TTS support for SILICONFLOW. (#6264 ) ### What problem does this PR solve? #6244 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-19 12:52:12 +08:00
Kevin Hu	49086964b8	Fix: type violations. (#6262 ) ### What problem does this PR solve? #6238 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-19 12:12:34 +08:00
Kevin Hu	dd81c30976	Fix: tag_feas deletion error. (#6257 ) ### What problem does this PR solve? #6218 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-19 11:25:11 +08:00
Kevin Hu	a087d13ccb	Feat: text file support position retaining. (#6231 ) ### What problem does this PR solve? #5832 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-18 16:55:11 +08:00
Kevin Hu	6e8d0e3177	Fix: rank feat issue. (#6225 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-18 16:07:29 +08:00
Yongteng Lei	5cf610af40	Feat: add vision LLM PDF parser (#6173 ) ### What problem does this PR solve? Add vision LLM PDF parser ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-03-18 14:52:20 +08:00
Kevin Hu	e9a6675c40	Fix: enable ollama api-key. (#6205 ) ### What problem does this PR solve? #6189 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-18 13:37:34 +08:00
Kevin Hu	1333d3c02a	Fix: float transfer exception. (#6197 ) ### What problem does this PR solve? #6177 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-18 11:13:44 +08:00
Kevin Hu	7e4d693054	Fix: in case response.choices[0].message.content is None. (#6190 ) ### What problem does this PR solve? #6164 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-18 10:00:27 +08:00
Kevin Hu	3a99c2b5f4	Refa: PARALLEL_DEVICES is a static parameter. (#6168 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-03-17 16:49:54 +08:00
Kevin Hu	fabc5e9259	Refa: fix re-rank scope. (#6152 ) ### What problem does this PR solve? #6140 ### Type of change - [x] Refactoring	2025-03-17 13:26:29 +08:00
Kevin Hu	bfa8d342b3	Fix: retrieval debug mode issue. (#6150 ) ### What problem does this PR solve? #6139 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-17 13:07:13 +08:00
Debug Doctor	3e19044dee	Feat: add OCR's muti-gpus and parallel processing support (#5972 ) ### What problem does this PR solve? Add OCR's muti-gpus and parallel processing support ### Type of change - [x] New Feature (non-breaking change which adds functionality) @yuzhichang I've tried to resolve the comments in #5697. OCR jobs can now be done on both CPU and GPU. ( By the way, I've encountered a “Generate embedding error” issue #5954 that might be due to my outdated GPUs? idk. ) Please review it and give me suggestions. GPU: ![gpu_ocr](https://github.com/user-attachments/assets/0ee2ecfb-a665-4e50-8bc7-15941b9cd80e) ![smi](https://github.com/user-attachments/assets/a2312f8c-cf24-443d-bf89-bec50503546d) CPU: ![cpu_ocr](https://github.com/user-attachments/assets/1ba6bb0b-94df-41ea-be79-790096da4bf1)	2025-03-17 11:58:40 +08:00
Zhichang Yu	89a69eed72	Introduced task priority (#6118 ) ### What problem does this PR solve? Introduced task priority ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-14 23:43:46 +08:00
Kevin Hu	e5a8b23684	Fix: empty tag field issue. (#6103 ) ### What problem does this PR solve? #6102 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-14 17:35:57 +08:00
Zhichang Yu	4fffee6695	Regards kb_id at ElasticSearch insert, update, delete. (#6105 ) ### What problem does this PR solve? Regards kb_id at ElasticSearch insert, update, delete. Close #6066 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-14 17:34:02 +08:00
Kevin Hu	485bc7d7d6	Fix: limit the depth of DFS (#6101 ) ### What problem does this PR solve? #6085 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-14 17:10:38 +08:00
Zhichang Yu	5d75b6be62	Fix executor name (#6080 ) ### What problem does this PR solve? Fix executor name ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-14 14:13:47 +08:00
Kevin Hu	56b228f187	Refa: remove max toekns for image2txt models. (#6078 ) ### What problem does this PR solve? #6063 ### Type of change - [x] Refactoring	2025-03-14 13:51:45 +08:00
utopia2077	2d4a60cae6	Fix: Reduce excessive IO operations by loading LLM factory configurations (#6047 ) …ions ### What problem does this PR solve? This PR fixes an issue where the application was repeatedly reading the llm_factories.json file from disk in multiple places, which could lead to "Too many open files" errors under high load conditions. The fix centralizes the file reading operation in the settings.py module and stores the data in a global variable that can be accessed by other modules. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [x] Performance Improvement - [ ] Other (please describe):	2025-03-14 09:54:38 +08:00
Yongteng Lei	4ff609b6a8	Fix: optimize OCR garbage identification to reduce unnecessary filtering (#6027 ) ### What problem does this PR solve? Optimize OCR garbage identification to reduce unnecessary filtering. #5713 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-13 18:48:32 +08:00
writinwaters	9c8060f619	0.17.1 release notes (#6021 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-13 14:43:24 +08:00
Zhichang Yu	e213873852	Optimize graphrag cache get entity (#6018 ) ### What problem does this PR solve? Optimize graphrag cache get entity ### Type of change - [x] Performance Improvement	2025-03-13 14:37:59 +08:00
Kevin Hu	e05cdc2f9c	Fix: encode detect error. (#6006 ) ### What problem does this PR solve? #5967 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-13 10:47:58 +08:00
Kevin Hu	3571270191	Refa: refine the context window size warning. (#5993 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-03-12 19:40:54 +08:00
Yongteng Lei	7cd37c37cd	Feat: add CSV file parsing support (#5989 ) ### What problem does this PR solve? Add CSV file parsing support #4552, #5849, #5870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-12 19:20:50 +08:00
kuro5989	6e13922bdc	Feat: Add qwq model support to Tongyi-Qianwen factory (#5981 ) ### What problem does this PR solve? add qwq model support to Tongyi-Qianwen factory https://github.com/infiniflow/ragflow/issues/5869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) ![image](https://github.com/user-attachments/assets/49f5c6a0-ecaf-41dd-a23a-2009f854d62c) ![image](https://github.com/user-attachments/assets/93ffa303-920e-4942-8188-bcd6b7209204) ![1741774779438](https://github.com/user-attachments/assets/25f2fd1d-8640-4df0-9a08-78ee9daaa8fe) ![image](https://github.com/user-attachments/assets/4763cf6c-1f76-43c4-80ee-74dfd666a184) Co-authored-by: zhaozhicheng <zhicheng.zhao@fastonetech.com>	2025-03-12 18:54:15 +08:00
donblack01	1c663b32b9	Fix:signal.SIGUSR1 and signal.SIGUSR2 can't use in window. so don't bind signal.SIGUSR1 and signal.SIGUSR2 in the windows env (#5941 ) ### What problem does this PR solve? Fix:signal.SIGUSR1 and signal.SIGUSR2 can't use in window. so don't bind signal.SIGUSR1 and signal.SIGUSR2 in the windows env ### Type of change - [✓ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): Co-authored-by: tangyu <1@1.com>	2025-03-12 09:43:18 +08:00
Kevin Hu	caecaa7562	Feat: apply LLM to optimize citations. (#5935 ) ### What problem does this PR solve? #5905 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-11 19:56:21 +08:00
Zhichang Yu	6ec6ca6971	Refactor graphrag to remove redis lock (#5828 ) ### What problem does this PR solve? Refactor graphrag to remove redis lock ### Type of change - [x] Refactoring	2025-03-10 15:15:06 +08:00
Kevin Hu	15736c57c3	Fix: empty query issue. (#5830 ) ### What problem does this PR solve? #5214 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-10 13:56:56 +08:00
Edouard Hur	b29539b442	Fix: CoHereRerank not respecting base_url when provided (#5784 ) ### What problem does this PR solve? vLLM provider with a reranking model does not work : as vLLM uses under the hood the [CoHereRerank provider](https://github.com/infiniflow/ragflow/blob/v0.17.0/rag/llm/__init__.py#L250) with a `base_url`, if this URL [is not passed to the Cohere client](https://github.com/infiniflow/ragflow/blob/v0.17.0/rag/llm/rerank_model.py#L379-L382) any attempt will endup on the Cohere SaaS (sending your private api key in the process) instead of your vLLM instance. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-03-10 11:22:06 +08:00
Kevin Hu	2ad852d8df	Fix: truncate message issue. (#5776 ) ### What problem does this PR solve? Close #5761 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-07 17:41:56 +08:00
Kevin Hu	64c6cc4cf3	Fix: truncate message issue. (#5765 ) ### What problem does this PR solve? Close #5761 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-07 16:33:25 +08:00
Kevin Hu	b1bbb9e210	Refa: make Rewrite component effective to relative data expression. (#5752 ) ### What problem does this PR solve? #5716 ### Type of change - [x] Refactoring	2025-03-07 13:48:13 +08:00
Kevin Hu	df9b7b2fe9	Fix: rerank issue. (#5696 ) ### What problem does this PR solve? #5673 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-06 15:05:19 +08:00
Kevin Hu	251ba7f058	Refa: remove max tokens since no one needs it. (#5690 ) ### What problem does this PR solve? #5646 #5640 ### Type of change - [x] Refactoring	2025-03-06 11:29:40 +08:00
Kevin Hu	b8da2eeb69	Feat: support huggingface re-rank model. (#5684 ) ### What problem does this PR solve? #5658 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-06 10:44:04 +08:00
yihong	4326873af6	refactor: no need to inherit in python3 clean the code (#5659 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2025-03-05 18:03:53 +08:00
Kevin Hu	e5041749a2	Fix: tavily search error. (#5653 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-05 17:03:05 +08:00
Zhichang Yu	f65c3ae62b	Refactored DocumentService.update_progress (#5642 ) ### What problem does this PR solve? Refactored DocumentService.update_progress ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-05 14:48:03 +08:00

1 2 3 4 5 ...

712 Commits