20b577a72c
Fix: Merge main branch ( #10377 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
---------
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: jinhai <haijin.chn@gmail.com >
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
Co-authored-by: Lynn <lynn_inf@hotmail.com >
Co-authored-by: chanx <1243304602@qq.com >
Co-authored-by: balibabu <cike8899@users.noreply.github.com >
Co-authored-by: 纷繁下的无奈 <zhileihuang@126.com >
Co-authored-by: huangzl <huangzl@shinemo.com >
Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com >
Co-authored-by: Wilmer <33392318@qq.com >
Co-authored-by: Adrian Weidig <adrianweidig@gmx.net >
Co-authored-by: Zhichang Yu <yuzhichang@gmail.com >
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
Co-authored-by: Yongteng Lei <yongtengrey@outlook.com >
Co-authored-by: Liu An <asiro@qq.com >
Co-authored-by: buua436 <66937541+buua436@users.noreply.github.com >
Co-authored-by: BadwomanCraZY <511528396@qq.com >
Co-authored-by: cucusenok <31804608+cucusenok@users.noreply.github.com >
Co-authored-by: Russell Valentine <russ@coldstonelabs.org >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Billy Bao <newyorkupperbay@gmail.com >
Co-authored-by: Zhedong Cen <cenzhedong2@126.com >
Co-authored-by: TensorNull <129579691+TensorNull@users.noreply.github.com >
Co-authored-by: TensorNull <tensor.null@gmail.com >
Co-authored-by: Ajay <160579663+aybanda@users.noreply.github.com >
Co-authored-by: AB <aj@Ajays-MacBook-Air.local >
Co-authored-by: 天海蒼灆 <huangaoqin@tecpie.com >
Co-authored-by: He Wang <wanghechn@qq.com >
Co-authored-by: Atsushi Hatakeyama <atu729@icloud.com >
Co-authored-by: Jin Hai <haijin.chn@gmail.com >
Co-authored-by: Mohamed Mathari <155896313+melmathari@users.noreply.github.com >
Co-authored-by: Mohamed Mathari <nocodeventure@Mac-mini-van-Mohamed.fritz.box >
Co-authored-by: Stephen Hu <stephenhu@seismic.com >
Co-authored-by: Shaun Zhang <zhangwfjh@users.noreply.github.com >
Co-authored-by: zhimeng123 <60221886+zhimeng123@users.noreply.github.com >
Co-authored-by: mxc <mxc@example.com >
Co-authored-by: Dominik Novotný <50611433+SgtMarmite@users.noreply.github.com >
Co-authored-by: EVGENY M <168018528+rjohny55@users.noreply.github.com >
Co-authored-by: mcoder6425 <mcoder64@gmail.com >
Co-authored-by: TeslaZY <TeslaZY@outlook.com >
Co-authored-by: lemsn <lemsn@msn.com >
Co-authored-by: lemsn <lemsn@126.com >
Co-authored-by: Adrian Gora <47756404+adagora@users.noreply.github.com >
Co-authored-by: Womsxd <45663319+Womsxd@users.noreply.github.com >
Co-authored-by: FatMii <39074672+FatMii@users.noreply.github.com >
2025-09-30 13:13:15 +08:00
ecdb1701df
Perf: test llm before RAPTOR. ( #8897 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2025-07-17 16:48:50 +08:00
4a2ff633e0
Fix typo in code ( #8327 )
...
### What problem does this PR solve?
Fix typo in code
### Type of change
- [x] Refactoring
---------
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2025-06-18 09:41:09 +08:00
b908c33464
Fix: uncaptured image data with position information ( #7683 )
...
### What problem does this PR solve?
Fixed uncaptured figure data with position information. #7466 , #7681
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com >
2025-05-19 19:33:28 +08:00
953b3e1b3f
Fix: Sometimes VisionFigureParser.figures may is tuple ( #7477 )
...
### What problem does this PR solve?
https://github.com/infiniflow/ragflow/issues/7466
I think due to some times we can not get position
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2025-05-06 17:38:22 +08:00
2f768b96e8
perf: optimze figure parser ( #7392 )
...
### What problem does this PR solve?
When parsing documents containing images, the current code uses a
single-threaded approach to call the VL model, resulting in extremely
slow parsing speed (e.g., parsing a Word document with dozens of images
takes over 20 minutes).
By switching to a multithreaded approach to call the VL model, the
parsing speed can be improved to an acceptable level.
### Type of change
- [x] Performance Improvement
---------
Co-authored-by: liuzhenghua-jk <liuzhenghua-jk@360shuke.com >
2025-05-06 14:39:45 +08:00
9611185eb4
Feat: add VLM-boosted DocX parser ( #6307 )
...
### What problem does this PR solve?
Add VLM-boosted DocX parser
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2025-03-20 11:24:44 +08:00
1d6760dd84
Feat: add VLM-boosted PDF parser ( #6278 )
...
### What problem does this PR solve?
Add VLM-boosted PDF parser if VLM is set.
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2025-03-20 09:39:32 +08:00