Files
ragflow/rag/app
yH 5333e764fc fix: optimize Excel row counting for files with abnormal max_row (#13018)
### What problem does this PR solve?

Some Excel files have abnormal `max_row` metadata (e.g.,
`max_row=1,048,534` with only 300 actual data rows). This causes:
- `row_number()` returns incorrect count, creating 350+ tasks instead of
1
- `list(ws.rows)` iterates through millions of empty rows, causing
system hang

This PR uses binary search to find the actual last row with data.

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
- [x] Performance Improvement

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-06 14:43:52 +08:00
..
2025-01-21 20:52:28 +08:00
2025-12-30 11:09:18 +08:00
2026-01-09 17:48:45 +08:00
2025-12-29 12:01:18 +08:00
2026-01-09 17:48:45 +08:00
2026-01-09 17:48:45 +08:00
2025-12-29 12:54:31 +08:00
2025-12-29 12:01:18 +08:00
2026-01-29 14:23:26 +08:00