ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-01-31 15:45:08 +08:00

Author	SHA1	Message	Date
Kevin Hu	593ffc4067	Fix HuggingFace model error. (#3870 ) ### What problem does this PR solve? #3865 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 13:28:42 +08:00
Kevin Hu	78601ee1bd	Fix open AI compatible rerank issue. (#3866 ) ### What problem does this PR solve? #3700 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 10:26:21 +08:00
Kevin Hu	3f3469130b	Fix preview issue in file manager. (#3846 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-04 11:53:23 +08:00
Jin Hai	6657ca7cde	Change default error message to English (#3838 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2024-12-04 09:34:49 +08:00
Zhichang Yu	92ab7ef659	Refactor embedding batch_size (#3825 ) ### What problem does this PR solve? Refactor embedding batch_size. Close #3657 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2024-12-03 16:22:39 +08:00
Kevin Hu	6a0583f5ad	Fix voyage embedding. (#3818 ) ### What problem does this PR solve? #3816 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-03 09:33:54 +08:00
Zhichang Yu	d19f059f34	Detect invalid response from api.siliconflow.cn (#3792 ) ### What problem does this PR solve? Detect invalid response from api.siliconflow.cn. Close #2643 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-02 12:55:05 +08:00
devMls	59a5813f1b	add jina new models in jina connector (#3770 ) ### What problem does this PR solve? add new models in jinna connector, to allow use models that support multilingual models ### Type of change - [X] Other (please describe): new connectors no breaking change	2024-12-02 10:06:39 +08:00
Zhichang Yu	d94386e00a	Pass top_p to ollama (#3744 ) ### What problem does this PR solve? Pass top_p to ollama. Close #1769 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-29 14:52:27 +08:00
Kevin Hu	91f1814a87	Fix error response (#3719 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2024-11-28 18:56:10 +08:00
Kevin Hu	57208d8e53	Fix batch size issue. (#3675 ) ### What problem does this PR solve? #3657 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-27 18:06:43 +08:00
liuhua	8b35776916	Fix a bug in VolcEngine (#3658 ) ### What problem does this PR solve? Fix a bug in VolcEngine #3553 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-11-27 09:30:49 +08:00
Kevin Hu	0891a393d7	Let ThreadPool exit gracefully. (#3653 ) ### What problem does this PR solve? #3646 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-26 16:31:07 +08:00
Kevin Hu	e5af18d5ea	Update docs for v0.14.0 (#3625 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2024-11-25 11:37:56 +08:00
liwenju0	875096384b	when qwen rerank model not return ok, raise exception to notice user (#3593 ) ### What problem does this PR solve? When calling the Qwen rerank model, if the model does not return correctly, an exception should be raised to notify the user, rather than simply returning a value of 0, as this would be confusing to the user. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-22 22:34:34 +08:00
Kevin Hu	81c7b6afc5	Make spark model robuster to model name (#3514 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-20 20:53:44 +08:00
liuhua	d42362deb6	Add api for sessions and add max_tokens for tenant_llm (#3472 ) ### What problem does this PR solve? Add api for sessions and add max_tokens for tenant_llm ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-11-19 14:51:33 +08:00
Zhichang Yu	4413683898	Introduced beartype (#3460 ) ### What problem does this PR solve? Introduced [beartype](https://github.com/beartype/beartype) for runtime type-checking. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-18 17:38:17 +08:00
shizzgar	4b3eeaa6ef	Added LocalAI support for rerank models (#3446 ) ### What problem does this PR solve? Hi there! LocalAI added support of rerank models https://localai.io/features/reranker/ I've implemented LocalAIRerank class (typically copied it from OpenAI_APIRerank class). Also, LocalAI model response with 500 error code if len of "documents" is less than 2 in similarity check. So I've added the second "document" on RERANK model connection check in `api/apps/llm_app.py`. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-18 12:05:52 +08:00
Jin Hai	1e90a1bf36	Move settings initialization after module init phase (#3438 ) ### What problem does this PR solve? 1. Module init won't connect database any more. 2. Config in settings need to be used with settings.CONFIG_NAME ### Type of change - [x] Refactoring Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-11-15 17:30:56 +08:00
Zhichang Yu	30f6421760	Use consistent log file names, introduced initLogger (#3403 ) ### What problem does this PR solve? Use consistent log file names, introduced initLogger ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-14 17:13:48 +08:00
shijiefengjun	632b23486f	Fix the value issue of anthropic (#3351 ) ### What problem does this PR solve? This pull request fixes the issue mentioned in https://github.com/infiniflow/ragflow/issues/3263. 1. response should be parsed as dict, prevent the following code from failing to take values: ans = response["content"][0]["text"] 2. API Model ```claude-instant-1.2``` has retired (by [model-deprecations](https://docs.anthropic.com/en/docs/resources/model-deprecations)), it will trigger errors in the code, so I deleted it from the conf/llm_factories.json file and updated the latest API Model ```claude-3-5-sonnet-20241022``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: chenhaodong <chenhaodong@ctrlvideo.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 16:13:52 +08:00
roc king	fa54cd5f5c	exstract model dir from model‘s full name (#3368 ) ### What problem does this PR solve? When model’s group name contains 0-9，we can't find downloaded model，because we do not correctly exstract model dir's name from model‘s full name ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: 王志鹏 <zhipeng3.wang@midea.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 14:10:16 +08:00
Zhichang Yu	a2a5631da4	Rework logging (#3358 ) Unified all log files into one. ### What problem does this PR solve? Unified all log files into one. ### Type of change - [x] Refactoring	2024-11-12 17:35:13 +08:00
Kevin Hu	34d1daac67	fix: Anthropic param error (#3327 ) ### What problem does this PR solve? #3263 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-11 11:54:14 +08:00
Kevin Hu	4097912d59	add inputs to display to every components (#3242 ) ### What problem does this PR solve? #3240 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-06 18:47:53 +08:00
ksztone-huanggonghao	0dff64f6ad	fix: TypeError: only length-1 arrays can be converted to Python scalars (#3211 ) ### What problem does this PR solve? fix "TypeError: only length-1 arrays can be converted to Python scalars" while using cohere embedding model. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) ![image](https://github.com/user-attachments/assets/2c21a69f-cd76-4d25-b320-058964812db8)	2024-11-06 11:15:00 +08:00
Kevin Hu	55953819c1	accelerate term weight calculation (#3206 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-11-05 13:11:26 +08:00
Kevin Hu	677f02c2a7	rm unused file (#3205 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-11-05 11:56:09 +08:00
Kevin Hu	7e0148c058	fix local variable ans (#3077 ) ### What problem does this PR solve? #3064 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-29 10:42:45 +08:00
Kevin Hu	f86826b7a0	refactor error message of qwen (#3074 ) ### What problem does this PR solve? #3055 ### Type of change - [x] Refactoring	2024-10-29 10:08:08 +08:00
Kevin Hu	9457d20ef1	make gemini robust (#3012 ) ### What problem does this PR solve? #3003 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-25 10:50:44 +08:00
Kevin Hu	89d5b2414e	fix SILICONFLOW rerank error (#2980 ) ### What problem does this PR solve? #2977 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-23 10:12:39 +08:00
Yinquan WANG	445dce4363	[Bug]: unnecessary auto-increment calculations in the tokens statistics of the chat model (#2969 ) ### What problem does this PR solve? the details is shown in https://github.com/infiniflow/ragflow/issues/2968 ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-22 16:26:04 +08:00
Yinquan WANG	5aa9d7787e	[Bug]: When use OpenAI chat model , raise ERROR: 'CompletionUsage' object has no attribute 'get' #2948 (#2949 ) [Bug]: When use OpenAI chat model , raise ERROR: 'CompletionUsage' object has no attribute 'get' #2948 ### What problem does this PR solve? the detail of this PR is shown at https://github.com/infiniflow/ragflow/issues/2948 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-22 11:40:05 +08:00
Kevin Hu	b2524eec49	fix sequence2txt error and usage total token issue (#2961 ) ### What problem does this PR solve? #1363 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-22 11:38:37 +08:00
chongchuanbing	ac26d09a59	Feature/feat1017 (#2872 ) ### What problem does this PR solve? 1. fix: mid map show error in knowledge graph, juse because ```@antv/g6```version changed 2. feat: concurrent threads configuration support in graph extractor 3. fix: used tokens update failed for tenant 4. feat: timeout configuration support for llm 5. fix: regex error in graph extractor 6. feat: qwen rerank(```gte-rerank```) support 7. fix: timeout deal in knowledge graph index process. Now chat by stream output, also, it is configuratable. 8. feat: ```qwen-long``` model configuration ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: chongchuanbing <chongchuanbing@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-21 12:11:08 +08:00
Ziyu Huang	e5f7733b31	Resolves #2905 openai compatible model provider add llama.cpp rerank support (#2906 ) ### What problem does this PR solve? Resolve #2905 due to the in-consistent of token size, I make it safe to limit 500 in code, since there is no config param to control my llama.cpp run set -ub to 1024: ${llama_path}/bin/llama-server --host 0.0.0.0 --port 9901 -ub 1024 -ngl 99 -m $gguf_file --reranking "$@" ### Type of change - [x] New Feature (non-breaking change which adds functionality) Here is my test Ragflow use llama.cpp ``` lot update_slots: id 0 \| task 458 \| prompt done, n_past = 416, n_tokens = 416 slot release: id 0 \| task 458 \| stop processing: n_past = 416, truncated = 0 slot launch_slot_: id 0 \| task 459 \| processing task slot update_slots: id 0 \| task 459 \| tokenizing prompt, len = 2 slot update_slots: id 0 \| task 459 \| prompt tokenized, n_ctx_slot = 8192, n_keep = 0, n_prompt_tokens = 111 slot update_slots: id 0 \| task 459 \| kv cache rm [0, end) slot update_slots: id 0 \| task 459 \| prompt processing progress, n_past = 111, n_tokens = 111, progress = 1.000000 slot update_slots: id 0 \| task 459 \| prompt done, n_past = 111, n_tokens = 111 slot release: id 0 \| task 459 \| stop processing: n_past = 111, truncated = 0 srv update_slots: all slots are idle request: POST /rerank 172.23.0.4 200 ```	2024-10-21 10:06:29 +08:00
0000sir	4991107822	Fix keys of Xinference deployed models, especially has the same model name with public hosted models. (#2832 ) ### What problem does this PR solve? Fix keys of Xinference deployed models, especially has the same model name with public hosted models. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: 0000sir <0000sir@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-16 10:21:08 +08:00
JobSmithManipulation	3f065c75da	support chat model in huggingface (#2802 ) ### What problem does this PR solve? #2794 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-11 14:45:48 +08:00
Kevin Hu	5e7c1fb23a	reduce rerank batch size (#2801 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-10-11 11:29:19 +08:00
JobSmithManipulation	18f80743eb	support api-version and change default-model in adding azure-openai and openai (#2799 ) ### What problem does this PR solve? #2701 #2712 #2749 ### Type of change -[x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-11 11:26:42 +08:00
Kevin Hu	29f022c91c	fix bedrock issue (#2776 ) ### What problem does this PR solve? #2722 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-10 09:13:35 +08:00
Sky Blue	2df15742fc	fix xinference add rerank model bug (#2758 ) ### What problem does this PR solve? Fix xinference add rerank model bug, https://github.com/infiniflow/ragflow/issues/2294#issue-2510788135 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-09 19:37:11 +08:00
Kevin Hu	7f44cf543a	move import positions (#2753 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-10-09 10:34:58 +08:00
JobSmithManipulation	16472eb3ea	solve knowledgegraph issue when calling gemini model (#2738 ) ### What problem does this PR solve? #2720 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-08 18:27:04 +08:00
JobSmithManipulation	a3ab5ba9ac	support sequence2txt and tts model in Xinference (#2696 ) ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-08 10:43:18 +08:00
Omar Leonardo Sanchez Granados	34761fa4ca	Fix/bedrock issues (#2718 ) ### What problem does this PR solve? Adding a Bedrock API key for Claude Sonnet was broken. I find the issue came up when trying to test the LLM configuration, the system is a required parameter in boto3. As well, there were problems in Bedrock implementation for embeddings when trying to encode queries. ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue)	2024-10-05 16:44:50 +08:00
Kevin Hu	0a7654c747	fix error in exception (#2694 ) ### What problem does this PR solve? #2670 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-09-30 17:54:27 +08:00
JobSmithManipulation	96f56a3c43	add huggingface model (#2624 ) ### What problem does this PR solve? #2469 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-09-27 19:15:38 +08:00

1 2 3 4 5

213 Commits