296476ab89
Refactor function name ( #11210 )
...
### What problem does this PR solve?
As title
### Type of change
- [x] Refactoring
---------
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2025-11-12 19:00:15 +08:00
9e323a9351
Feat(nlp): add "怎么办" pattern to question word removal ( #10284 )
...
### What problem does this PR solve?
Added "怎么办" to the regex pattern in rmWWW method to improve query
cleaning by removing this common question phrase along with other
question words.
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2025-09-25 16:47:56 +08:00
342a04ec8a
Added infinity rank_feature support ( #9044 )
...
### What problem does this PR solve?
Added infinity rank_feature support
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2025-07-29 09:14:23 +08:00
0d7cfce6e1
Update rag/nlp/query.py ( #7816 )
...
### What problem does this PR solve?
Fix tokenizer resulting in low recall



### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [x] Refactoring
---------
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com >
2025-05-23 17:13:37 +08:00
a14865e6bb
Fix: empty query issue. ( #7551 )
...
### What problem does this PR solve?
#5214
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2025-05-09 12:20:19 +08:00
c7310f7fb2
Refa: similarity calculations. ( #7381 )
...
### What problem does this PR solve?
### Type of change
- [x] Refactoring
2025-04-28 19:17:11 +08:00
0758c04941
Refa: token similarity calculations. ( #6614 )
...
### What problem does this PR solve?
#6507
### Type of change
- [x] Performance Improvement
2025-03-28 09:33:08 +08:00
15736c57c3
Fix: empty query issue. ( #5830 )
...
### What problem does this PR solve?
#5214
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2025-03-10 13:56:56 +08:00
4f40f685d9
Code refactor ( #5371 )
...
### What problem does this PR solve?
#5173
### Type of change
- [x] Refactoring
2025-02-26 15:40:52 +08:00
53b9e7b52f
Add tavily as web searh tool. ( #5349 )
...
### What problem does this PR solve?
#5198
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2025-02-26 10:21:04 +08:00
cdb3e6434a
Fix empty question issue. ( #5225 )
...
### What problem does this PR solve?
#5241
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2025-02-21 15:47:39 +08:00
6f2c3a3c3c
Fix too long query exception. ( #4729 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2025-02-06 10:11:52 +08:00
c5da3cdd97
Tagging ( #4426 )
...
### What problem does this PR solve?
#4367
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2025-01-09 17:07:21 +08:00
f948c0d9f1
Clean query. ( #4259 )
...
### What problem does this PR solve?
#4239
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-12-27 14:25:03 +08:00
927873bfa6
Fix syn error. ( #3953 )
...
### What problem does this PR solve?
Close #3696
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-12-10 10:54:54 +08:00
0d68a6cd1b
Fix errors detected by Ruff ( #3918 )
...
### What problem does this PR solve?
Fix errors detected by Ruff
### Type of change
- [x] Refactoring
2024-12-08 14:21:12 +08:00
56f473b680
Feat: Add question parameter to edit chunk modal ( #3875 )
...
### What problem does this PR solve?
Close #3873
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-12-05 14:51:19 +08:00
1b817a5b4c
Refine synonym query. ( #3855 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-12-04 17:20:12 +08:00
bc701d7b4c
Edit chunk shall update instead of insert it ( #3709 )
...
### What problem does this PR solve?
Edit chunk shall update instead of insert it. Close #3679
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-28 13:00:38 +08:00
57208d8e53
Fix batch size issue. ( #3675 )
...
### What problem does this PR solve?
#3657
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-27 18:06:43 +08:00
ca9e97d2f2
Enlarge the term weight difference ( #3435 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-11-15 15:41:50 +08:00
48e060aa53
rm es query escape chars ( #3428 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-15 13:19:07 +08:00
a1ba228bc2
fix: empty token bug ( #3424 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-15 10:33:03 +08:00
220aaddc62
fix: synonym bug ( #3423 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-15 10:14:51 +08:00
30f6421760
Use consistent log file names, introduced initLogger ( #3403 )
...
### What problem does this PR solve?
Use consistent log file names, introduced initLogger
### Type of change
- [ ] Bug Fix (non-breaking change which fixes an issue)
- [ ] New Feature (non-breaking change which adds functionality)
- [ ] Documentation Update
- [x] Refactoring
- [ ] Performance Improvement
- [ ] Other (please describe):
2024-11-14 17:13:48 +08:00
91332fa0f8
Refine english synonym ( #3371 )
...
### What problem does this PR solve?
#3361
### Type of change
- [x] Performance Improvement
2024-11-13 12:58:37 +08:00
f4c52371ab
Integration with Infinity ( #2894 )
...
### What problem does this PR solve?
Integration with Infinity
- Replaced ELASTICSEARCH with dataStoreConn
- Renamed deleteByQuery with delete
- Renamed bulk to upsertBulk
- getHighlight, getAggregation
- Fix KGSearch.search
- Moved Dealer.sql_retrieval to es_conn.py
### Type of change
- [x] Refactoring
2024-11-12 14:59:41 +08:00
d88f0d43ea
make language judgement robuster ( #3287 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-11-08 12:48:11 +08:00
55953819c1
accelerate term weight calculation ( #3206 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-11-05 13:11:26 +08:00
b164116277
refine token similarity ( #2824 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-10-14 13:33:18 +08:00
54342ae0a2
boost highlight performace ( #2419 )
...
### What problem does this PR solve?
#2415
### Type of change
- [x] Performance Improvement
2024-09-13 18:10:32 +08:00
5a2c542ce2
make term similarity robust ( #2212 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-09-03 14:30:07 +08:00
6d232f1bdb
enable 3 char words to finegrind tokenize ( #2210 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-09-03 13:37:32 +08:00
642006c8e2
filter out + in es query ( #2046 )
...
### What problem does this PR solve?
#2028
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [ ]
2024-08-22 10:02:04 +08:00
e35f7610e7
fix too long query exception ( #1195 )
...
### What problem does this PR solve?
#1161
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-06-18 09:50:59 +08:00
4454ba7a1e
add self-rag ( #1070 )
...
### What problem does this PR solve?
#1069
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-06-06 11:13:39 +08:00
9ed0e50f6b
Update info ( #1005 )
...
### What problem does this PR solve?
_Briefly describe what this PR aims to solve. Include background context
that will help reviewers understand the purpose of the PR._
### Type of change
- [x] Refactoring
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2024-05-31 09:53:04 +08:00
758eb03ccb
fix jina adding issure and term weight refinement ( #974 )
...
### What problem does this PR solve?
#724 #162
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [x] New Feature (non-breaking change which adds functionality)
2024-05-29 19:38:57 +08:00
614defec21
add rerank model ( #969 )
...
### What problem does this PR solve?
feat: add rerank models to the project #724 #162
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-05-29 16:50:02 +08:00
2b36283712
fix english query bug ( #840 )
...
### What problem does this PR solve?
#834
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-05-20 12:23:51 +08:00
de839fc3f0
optimize srv broker and executor logic ( #630 )
...
### What problem does this PR solve?
Optimize task broker and executor for reduce memory usage and deployment
complexity.
### Type of change
- [x] Performance Improvement
- [x] Refactoring
### Change Log
- Enhance redis utils for message queue(use stream)
- Modify task broker logic via message queue (1.get parse event from
message queue 2.use ThreadPoolExecutor async executor )
- Modify the table column name of document and task (process_duation ->
process_duration maybe just a spelling mistake)
- Reformat some code style(just what i see)
- Add requirement_dev.txt for developer
- Add redis container on docker compose
---------
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com >
2024-05-07 11:43:33 +08:00
8c07992b6c
refine code ( #595 )
...
### What problem does this PR solve?
### Type of change
- [x] Refactoring
2024-04-28 19:13:33 +08:00
923cbe488e
fix #258 task_executor occupy cpu too much ( #288 )
...
### What problem does this PR solve?
Issue link:#285
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-04-10 10:11:22 +08:00
653c759d8d
resolve issure to call ZH?IPUAI ( #277 )
...
### What problem does this PR solve?
Issue link:#265
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-04-09 16:16:10 +08:00
392e515c3f
fix bug about reload knowledgebase configuration reloading ( #210 )
...
### What problem does this PR solve?
_Briefly describe what this PR aims to solve. Include background context
that will help reviewers understand the purpose of the PR._
Issue link:#[[Link the issue
here](https://github.com/infiniflow/ragflow/issues/209 )]
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-04-03 11:00:50 +08:00
05298d5626
refine README ( #169 )
2024-03-29 10:48:29 +08:00
fd7fcb5baf
apply pep8 formalize ( #155 )
2024-03-27 11:33:46 +08:00
da21320b88
fix plainPdf bugs ( #152 )
2024-03-26 15:11:07 +08:00
6999598101
refine for English corpus ( #135 )
2024-03-20 16:56:16 +08:00
9a843667b3
fix github account login issue ( #132 )
2024-03-19 15:31:47 +08:00