20b577a72c
Fix: Merge main branch ( #10377 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
---------
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: jinhai <haijin.chn@gmail.com >
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
Co-authored-by: Lynn <lynn_inf@hotmail.com >
Co-authored-by: chanx <1243304602@qq.com >
Co-authored-by: balibabu <cike8899@users.noreply.github.com >
Co-authored-by: 纷繁下的无奈 <zhileihuang@126.com >
Co-authored-by: huangzl <huangzl@shinemo.com >
Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com >
Co-authored-by: Wilmer <33392318@qq.com >
Co-authored-by: Adrian Weidig <adrianweidig@gmx.net >
Co-authored-by: Zhichang Yu <yuzhichang@gmail.com >
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
Co-authored-by: Yongteng Lei <yongtengrey@outlook.com >
Co-authored-by: Liu An <asiro@qq.com >
Co-authored-by: buua436 <66937541+buua436@users.noreply.github.com >
Co-authored-by: BadwomanCraZY <511528396@qq.com >
Co-authored-by: cucusenok <31804608+cucusenok@users.noreply.github.com >
Co-authored-by: Russell Valentine <russ@coldstonelabs.org >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Billy Bao <newyorkupperbay@gmail.com >
Co-authored-by: Zhedong Cen <cenzhedong2@126.com >
Co-authored-by: TensorNull <129579691+TensorNull@users.noreply.github.com >
Co-authored-by: TensorNull <tensor.null@gmail.com >
Co-authored-by: Ajay <160579663+aybanda@users.noreply.github.com >
Co-authored-by: AB <aj@Ajays-MacBook-Air.local >
Co-authored-by: 天海蒼灆 <huangaoqin@tecpie.com >
Co-authored-by: He Wang <wanghechn@qq.com >
Co-authored-by: Atsushi Hatakeyama <atu729@icloud.com >
Co-authored-by: Jin Hai <haijin.chn@gmail.com >
Co-authored-by: Mohamed Mathari <155896313+melmathari@users.noreply.github.com >
Co-authored-by: Mohamed Mathari <nocodeventure@Mac-mini-van-Mohamed.fritz.box >
Co-authored-by: Stephen Hu <stephenhu@seismic.com >
Co-authored-by: Shaun Zhang <zhangwfjh@users.noreply.github.com >
Co-authored-by: zhimeng123 <60221886+zhimeng123@users.noreply.github.com >
Co-authored-by: mxc <mxc@example.com >
Co-authored-by: Dominik Novotný <50611433+SgtMarmite@users.noreply.github.com >
Co-authored-by: EVGENY M <168018528+rjohny55@users.noreply.github.com >
Co-authored-by: mcoder6425 <mcoder64@gmail.com >
Co-authored-by: TeslaZY <TeslaZY@outlook.com >
Co-authored-by: lemsn <lemsn@msn.com >
Co-authored-by: lemsn <lemsn@126.com >
Co-authored-by: Adrian Gora <47756404+adagora@users.noreply.github.com >
Co-authored-by: Womsxd <45663319+Womsxd@users.noreply.github.com >
Co-authored-by: FatMii <39074672+FatMii@users.noreply.github.com >
2025-09-30 13:13:15 +08:00
a1b947ffd6
Feat: add splitter ( #10161 )
...
### What problem does this PR solve?
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
---------
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: Lynn <lynn_inf@hotmail.com >
Co-authored-by: chanx <1243304602@qq.com >
Co-authored-by: balibabu <cike8899@users.noreply.github.com >
Co-authored-by: 纷繁下的无奈 <zhileihuang@126.com >
Co-authored-by: huangzl <huangzl@shinemo.com >
Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com >
Co-authored-by: Wilmer <33392318@qq.com >
Co-authored-by: Adrian Weidig <adrianweidig@gmx.net >
Co-authored-by: Zhichang Yu <yuzhichang@gmail.com >
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
Co-authored-by: Yongteng Lei <yongtengrey@outlook.com >
Co-authored-by: Liu An <asiro@qq.com >
Co-authored-by: buua436 <66937541+buua436@users.noreply.github.com >
Co-authored-by: BadwomanCraZY <511528396@qq.com >
Co-authored-by: cucusenok <31804608+cucusenok@users.noreply.github.com >
Co-authored-by: Russell Valentine <russ@coldstonelabs.org >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Billy Bao <newyorkupperbay@gmail.com >
Co-authored-by: Zhedong Cen <cenzhedong2@126.com >
Co-authored-by: TensorNull <129579691+TensorNull@users.noreply.github.com >
Co-authored-by: TensorNull <tensor.null@gmail.com >
2025-09-19 10:15:19 +08:00
e6d36f3a3a
Improve image rotation logic for text recognition ( #8167 )
...
### What problem does this PR solve?
Enhanced the image rotation handling by evaluating the original
orientation, clockwise 90°, and counter-clockwise 90° rotations. The
image with the highest text recognition score is now selected, improving
accuracy for text detection in images with aspect ratios >= 1.5.
#8166
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
Co-authored-by: wenrui.cao <wenrui.cao@univers.com >
2025-06-11 09:20:30 +08:00
6ba5a4348a
set PARALLEL_DEVICES default value= 0 ( #7935 )
...
### What problem does this PR solve?
it would be fail if PARALLEL_DEVICES = None in OCR class , because it
pass 0 to TextDetector and TextRecognizer init method.
and It would be simpler to set 0 as the default value for
PARALLEL_DEVICES.
### Type of change
- [x] Refactoring
2025-05-29 13:32:16 +08:00
3a99c2b5f4
Refa: PARALLEL_DEVICES is a static parameter. ( #6168 )
...
### What problem does this PR solve?
### Type of change
- [x] Refactoring
2025-03-17 16:49:54 +08:00
3e19044dee
Feat: add OCR's muti-gpus and parallel processing support ( #5972 )
...
### What problem does this PR solve?
Add OCR's muti-gpus and parallel processing support
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
@yuzhichang I've tried to resolve the comments in #5697 . OCR jobs can
now be done on both CPU and GPU. ( By the way, I've encountered a
“Generate embedding error” issue #5954 that might be due to my outdated
GPUs? idk. ) Please review it and give me suggestions.
GPU:


CPU:

2025-03-17 11:58:40 +08:00
4326873af6
refactor: no need to inherit in python3 clean the code ( #5659 )
...
### What problem does this PR solve?
As title
### Type of change
- [x] Refactoring
Signed-off-by: yihong0618 <zouzou0208@gmail.com >
2025-03-05 18:03:53 +08:00
db42d0e0ae
Optimize ocr ( #5297 )
...
### What problem does this PR solve?
Introduced OCR.recognize_batch
### Type of change
- [x] Performance Improvement
2025-02-24 16:21:55 +08:00
0151d42156
Reuse loaded modules if possible ( #5231 )
...
### What problem does this PR solve?
Reuse loaded modules if possible
### Type of change
- [x] Refactoring
2025-02-21 17:21:01 +08:00
3411d0a2ce
Added cuda_is_available ( #4725 )
...
### What problem does this PR solve?
Added cuda_is_available
### Type of change
- [x] Refactoring
2025-02-05 18:01:23 +08:00
e1526846da
Fixed GPU detection on CPU only environment ( #4711 )
...
### What problem does this PR solve?
Fixed GPU detection on CPU only environment. Close #4692
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2025-02-05 12:02:43 +08:00
4230402fbb
deepdoc use GPU if possible ( #4618 )
...
### What problem does this PR solve?
deepdoc use GPU if possible
### Type of change
- [x] Refactoring
2025-01-24 09:48:02 +08:00
3894de895b
Update comments ( #4569 )
...
### What problem does this PR solve?
Add license statement.
### Type of change
- [x] Refactoring
Signed-off-by: Jin Hai <haijin.chn@gmail.com >
2025-01-21 20:52:28 +08:00
4f9f9405b8
Remove use of eval() from ocr.py ( #4481 )
...
`eval(op_name)` -> `getattr(operators, op_name)`
### What problem does this PR solve?
Using `eval()` can lead to code injections and is entirely unnecessary
here.
### Type of change
- [x] Other (please describe):
Best practice code improvement, preventing the possibility of code
injection.
2025-01-20 09:52:30 +08:00
1254ecf445
Added static check at PR CI ( #3921 )
...
### What problem does this PR solve?
Added static check at PR CI
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [x] Refactoring
2024-12-08 21:23:51 +08:00
0d68a6cd1b
Fix errors detected by Ruff ( #3918 )
...
### What problem does this PR solve?
Fix errors detected by Ruff
### Type of change
- [x] Refactoring
2024-12-08 14:21:12 +08:00
99adeabc85
remove dependency ( #1536 )
...
### What problem does this PR solve?
#702
### Type of change
- [x] Refactoring
2024-07-16 16:30:17 +08:00
453c29170f
make sure the models will not be load twice ( #422 )
...
### What problem does this PR solve?
#381
### Type of change
- [x] Refactoring
2024-04-18 09:37:23 +08:00
a5384446e3
let's load model from local ( #163 )
2024-03-28 16:10:47 +08:00
979b3a5b4b
support snapshot download from local ( #153 )
...
* support snapshot download from local
* let snapshot download from local
2024-03-27 09:53:42 +08:00
da21320b88
fix plainPdf bugs ( #152 )
2024-03-26 15:11:07 +08:00
9da671b951
refine manul parser ( #131 )
2024-03-19 12:26:04 +08:00
675a9f8d9a
add dockerfile for cuda envirement. Refine table search strategy, ( #123 )
2024-03-14 19:45:29 +08:00
8f86ab9f7f
refine pdf parser, add time zone to userinfo ( #112 )
2024-03-08 11:24:24 +08:00
7fd1eca582
init README of deepdoc, add picture processer. ( #71 )
...
* init README of deepdoc, add picture processer.
* add resume parsing
2024-02-23 18:28:12 +08:00
d32322c081
rename vision, add layour and tsr recognizer ( #70 )
...
* rename vision, add layour and tsr recognizer
* trivial fixing
2024-02-22 19:11:37 +08:00