ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-01-28 22:26:36 +08:00

Author	SHA1	Message	Date
qinling0210	9a5208976c	Put document metadata in ES/Infinity (#12826 ) ### What problem does this PR solve? Put document metadata in ES/Infinity. Index name of meta data: ragflow_doc_meta_{tenant_id} ### Type of change - [x] Refactoring	2026-01-28 13:29:34 +08:00
Zhichang Yu	fd11aca8e5	feat: Implement pluggable multi-provider sandbox architecture (#12820 ) ## Summary Implement a flexible sandbox provider system supporting both self-managed (Docker) and SaaS (Aliyun Code Interpreter) backends for secure code execution in agent workflows. Key Changes: - ✅ Aliyun Code Interpreter provider using official `agentrun-sdk>=0.0.16` - ✅ Self-managed provider with gVisor (runsc) security - ✅ Arguments parameter support for dynamic code execution - ✅ Database-only configuration (removed fallback logic) - ✅ Configuration scripts for quick setup Issue #12479 ## Features ### 🔌 Provider Abstraction Layer 1. Self-Managed Provider (`agent/sandbox/providers/self_managed.py`) - Wraps existing executor_manager HTTP API - gVisor (runsc) for secure container isolation - Configurable pool size, timeout, retry logic - Languages: Python, Node.js, JavaScript - ⚠️ Requires: gVisor installation, Docker, base images 2. Aliyun Code Interpreter (`agent/sandbox/providers/aliyun_codeinterpreter.py`) - SaaS integration using official agentrun-sdk - Serverless microVM execution with auto-authentication - Hard timeout: 30 seconds max - Credentials: `AGENTRUN_ACCESS_KEY_ID`, `AGENTRUN_ACCESS_KEY_SECRET`, `AGENTRUN_ACCOUNT_ID`, `AGENTRUN_REGION` - Automatically wraps code to call `main()` function 3. E2B Provider (`agent/sandbox/providers/e2b.py`) - Placeholder for future integration ### ⚙️ Configuration System - `conf/system_settings.json`: Default provider = `aliyun_codeinterpreter` - `agent/sandbox/client.py`: Enforces database-only configuration - Admin UI: `/admin/sandbox-settings` - Configuration validation via `validate_config()` method - Health checks for all providers ### 🎯 Key Capabilities Arguments Parameter Support: All providers support passing arguments to `main()` function: ```python # User code def main(name: str, count: int) -> dict: return {"message": f"Hello {name}!" * count} # Executed with: arguments={"name": "World", "count": 3} # Result: {"message": "Hello World!Hello World!Hello World!"} ``` Self-Describing Providers: Each provider implements `get_config_schema()` returning form configuration for Admin UI Error Handling: Structured `ExecutionResult` with stdout, stderr, exit_code, execution_time ## Configuration Scripts Two scripts for quick Aliyun sandbox setup: Shell Script (requires jq): ```bash source scripts/configure_aliyun_sandbox.sh ``` Python Script (interactive): ```bash python3 scripts/configure_aliyun_sandbox.py ``` ## Testing ```bash # Unit tests uv run pytest agent/sandbox/tests/test_providers.py -v # Aliyun provider tests uv run pytest agent/sandbox/tests/test_aliyun_codeinterpreter.py -v # Integration tests (requires credentials) uv run pytest agent/sandbox/tests/test_aliyun_codeinterpreter_integration.py -v # Quick SDK validation python3 agent/sandbox/tests/verify_sdk.py ``` Test Coverage: - 30 unit tests for provider abstraction - Provider-specific tests for Aliyun - Integration tests with real API - Security tests for executor_manager ## Documentation - `docs/develop/sandbox_spec.md` - Complete architecture specification - `agent/sandbox/tests/MIGRATION_GUIDE.md` - Migration from legacy sandbox - `agent/sandbox/tests/QUICKSTART.md` - Quick start guide - `agent/sandbox/tests/README.md` - Testing documentation ## Breaking Changes ⚠️ Migration Required: 1. Directory Move: `sandbox/` → `agent/sandbox/` - Update imports: `from sandbox.` → `from agent.sandbox.` 2. Mandatory Configuration: - SystemSettings must have `sandbox.provider_type` configured - Removed fallback default values - Configuration must exist in database (from `conf/system_settings.json`) 3. Aliyun Credentials: - Requires `AGENTRUN_` environment variables (not `ALIYUN_`) - `AGENTRUN_ACCOUNT_ID` is now required (Aliyun primary account ID) 4. Self-Managed Provider: - gVisor (runsc) must be installed for security - Install: `go install gvisor.dev/gvisor/runsc@latest` ## Database Schema Changes ```python # SystemSettings.value: CharField → TextField api/db/db_models.py: Changed for unlimited config length # SystemSettingsService.get_by_name(): Fixed query precision api/db/services/system_settings_service.py: startswith → exact match ``` ## Files Changed ### Backend (Python) - `agent/sandbox/providers/base.py` - SandboxProvider ABC interface - `agent/sandbox/providers/manager.py` - ProviderManager - `agent/sandbox/providers/self_managed.py` - Self-managed provider - `agent/sandbox/providers/aliyun_codeinterpreter.py` - Aliyun provider - `agent/sandbox/providers/e2b.py` - E2B provider (placeholder) - `agent/sandbox/client.py` - Unified client (enforces DB-only config) - `agent/tools/code_exec.py` - Updated to use provider system - `admin/server/services.py` - SandboxMgr with registry & validation - `admin/server/routes.py` - 5 sandbox API endpoints - `conf/system_settings.json` - Default: aliyun_codeinterpreter - `api/db/db_models.py` - TextField for SystemSettings.value - `api/db/services/system_settings_service.py` - Exact match query ### Frontend (TypeScript/React) - `web/src/pages/admin/sandbox-settings.tsx` - Settings UI - `web/src/services/admin-service.ts` - Sandbox service functions - `web/src/services/admin.service.d.ts` - Type definitions - `web/src/utils/api.ts` - Sandbox API endpoints ### Documentation - `docs/develop/sandbox_spec.md` - Architecture spec - `agent/sandbox/tests/MIGRATION_GUIDE.md` - Migration guide - `agent/sandbox/tests/QUICKSTART.md` - Quick start - `agent/sandbox/tests/README.md` - Testing guide ### Configuration Scripts - `scripts/configure_aliyun_sandbox.sh` - Shell script (jq) - `scripts/configure_aliyun_sandbox.py` - Python script ### Tests - `agent/sandbox/tests/test_providers.py` - 30 unit tests - `agent/sandbox/tests/test_aliyun_codeinterpreter.py` - Provider tests - `agent/sandbox/tests/test_aliyun_codeinterpreter_integration.py` - Integration tests - `agent/sandbox/tests/verify_sdk.py` - SDK validation ## Architecture ``` Admin UI → Admin API → SandboxMgr → ProviderManager → [SelfManaged\|Aliyun\|E2B] ↓ SystemSettings ``` ## Usage ### 1. Configure Provider Via Admin UI: 1. Navigate to `/admin/sandbox-settings` 2. Select provider (Aliyun Code Interpreter / Self-Managed) 3. Fill in configuration 4. Click "Test Connection" to verify 5. Click "Save" to apply Via Configuration Scripts: ```bash # Aliyun provider export AGENTRUN_ACCESS_KEY_ID="xxx" export AGENTRUN_ACCESS_KEY_SECRET="yyy" export AGENTRUN_ACCOUNT_ID="zzz" export AGENTRUN_REGION="cn-shanghai" source scripts/configure_aliyun_sandbox.sh ``` ### 2. Restart Service ```bash cd docker docker compose restart ragflow-server ``` ### 3. Execute Code in Agent ```python from agent.sandbox.client import execute_code result = execute_code( code='def main(name: str) -> dict: return {"message": f"Hello {name}!"}', language="python", timeout=30, arguments={"name": "World"} ) print(result.stdout) # {"message": "Hello World!"} ``` ## Troubleshooting ### "Container pool is busy" (Self-Managed) - Cause: Pool exhausted (default: 1 container in `.env`) - Fix: Increase `SANDBOX_EXECUTOR_MANAGER_POOL_SIZE` to 5+ ### "Sandbox provider type not configured" - Cause: Database missing configuration - Fix: Run config script or set via Admin UI ### "gVisor not found" - Cause: runsc not installed - Fix: `go install gvisor.dev/gvisor/runsc@latest && sudo cp ~/go/bin/runsc /usr/local/bin/` ### Aliyun authentication errors - Cause: Wrong environment variable names - Fix: Use `AGENTRUN_` prefix (not `ALIYUN_`) ## Checklist - [x] All tests passing (30 unit tests + integration tests) - [x] Documentation updated (spec, migration guide, quickstart) - [x] Type definitions added (TypeScript) - [x] Admin UI implemented - [x] Configuration validation - [x] Health checks implemented - [x] Error handling with structured results - [x] Breaking changes documented - [x] Configuration scripts created - [x] gVisor requirements documented Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-28 13:28:21 +08:00
Yongteng Lei	b57c82b122	Feat: add kimi-k2.5 (#12852 ) ### What problem does this PR solve? Add kimi-k2.5 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-28 12:41:20 +08:00
Stephen Hu	3a8c848af5	Fix:OSConnection.create_idx 4 arguments (#12862 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/12858 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-28 12:41:01 +08:00
balibabu	fe99905a2b	Refactor: Remove the brute-force deduplication method for agent logs. (#12864 ) ### What problem does this PR solve? Refactor: Remove the brute-force deduplication method for agent logs. ### Type of change - [x] Refactoring	2026-01-28 12:04:30 +08:00
Jin Hai	591870eb6e	Update quickstart (#12866 ) ### What problem does this PR solve? To notify developer use the correct release. ### Type of change - [x] Documentation Update Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-01-28 11:06:17 +08:00
BitToby	df3d044f03	fix: enable auto-resize for chat input textarea (#12836 ) Closes #12803 ### What problem does this PR solve? The chat input textarea in the Chat UI (and Embed UI) has a fixed height and cannot be resized, causing poor UX when users type messages longer than 2 sentences. The input becomes cramped and difficult to read/edit. Root cause: The `Textarea` component in [NextMessageInput](cci:1://file:///ragflow/web/src/components/message-input/next.tsx:62:0-290:1) had `resize-none` and `field-sizing-content` CSS classes that prevented resizing, and the existing `autoSize` prop was not being utilized. Solution: - Removed `resize-none` and `field-sizing-content` classes - Added `autoSize={{ minRows: 1, maxRows: 8 }}` to enable auto-expand - Added `max-h-40` class to limit maximum height to 160px The textarea now auto-expands from 1 to 8 rows as users type longer messages. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-28 09:53:02 +08:00
Magicbook1108	ee654f08d2	Refact: update description for max_token in embedding #12792 (#12845 ) ### What problem does this PR solve? Refact: update description for max_token in embedding #12792 ### Type of change - [x] Refactoring Co-authored-by: Liu An <asiro@qq.com>	2026-01-28 09:52:32 +08:00
writinwaters	ceff119f89	Docs: Added build Ecommerce customer support guide (#12832 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2026-01-28 09:48:54 +08:00
Liu An	c2e8f90023	feat(ci): Add Redis service port configuration to test environment (#12855 ) ### What problem does this PR solve? Added Redis port calculation and environment variable export to support Redis service in test environment. The port is dynamically assigned based on runner number to prevent conflicts during parallel test execution. Removed by #12685 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-28 09:27:47 +08:00
Jin Hai	702b5b35e8	Fix error handle in RAGFlow CLI (#12829 ) ### What problem does this PR solve? As title. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-01-27 17:22:23 +08:00
Yongteng Lei	2a758402ad	Fix: Hunyuan cannot work properly (#12843 ) ### What problem does this PR solve? Hunyuan cannot work properly ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-27 17:04:53 +08:00
Angel98518	e77168feba	Fix: Handle whitespace-only question in /retrieval endpoint (#12831 ) ## Description This PR fixes issue #12805 by adding validation to handle whitespace-only questions in the `/retrieval` endpoint. ## Problem Sending a single space `" "` as the `question` parameter to `/retrieval` crashes the request with an `AssertionError`. This happens because: 1. The endpoint doesn't trim or validate the question parameter 2. A whitespace-only string is treated as valid input 3. The retrieval logic only checks for empty strings (which are falsy), but `" "` is truthy 4. Invalid match expressions are constructed, causing an assertion failure in the Elasticsearch layer ## Solution - Trim whitespace from the question parameter before processing - Return an empty result for whitespace-only or empty questions - Prevents the AssertionError and provides expected behavior ## Changes - Added whitespace trimming and validation in `api/apps/sdk/doc.py` - Returns empty result early if question is empty after trimming ## Testing - Tested with single space input - now returns empty result instead of crashing - Tested with empty string - returns empty result - Tested with normal questions - works as expected Fixes #12805 Co-authored-by: Daniel <daniel@example.com>	2026-01-27 15:57:47 +08:00
Stephen Hu	52da81cf9e	Fix:Redis configuration template error in v0.22.1 (#12685 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/12674 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-27 12:47:46 +08:00
Mathias Panzenböck	b36d9744ae	shortcut metadata_condition if there is none (#12835 ) ### What problem does this PR solve? If no `metadata_condition` parameter is given then don't load the metadata of all documents into memory. Instead just pass `doc_ids` as `None` to the `retrieval()` method, which means to use all documents of the given datasets. This is relevant if you have a lot of documents! ### Type of change - [x] Performance Improvement	2026-01-27 12:45:58 +08:00
Yongteng Lei	c8338dec57	Refa: convert RAGFlow MCP server from sync to async (#12834 ) ### What problem does this PR solve? Convert RAGFlow MCP server from sync to async. ### Type of change - [x] Refactoring - [x] Performance Improvement	2026-01-27 12:45:43 +08:00
Yongteng Lei	f096917eeb	Fix: overlap cannot be properly applied (#12828 ) ### What problem does this PR solve? Overlap cannot be properly applied. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-27 12:43:01 +08:00
Jonah Hartmann	413956e9dd	Feat: Add German language support for agent template and various UI elements (#12830 ) ### What problem does this PR solve? This PR updates and extends the german language support in the frontend. Additionally two more elements are handled dynamically now. The interactive Agent is also titled and described in german now. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Jakob <16180662+hauberj@users.noreply.github.com>	2026-01-27 12:42:44 +08:00
Zhichang Yu	6404af0a91	Bump to infinity v0.7.0-dev2 (#12839 ) ### What problem does this PR solve? Bump to infinity v0.7.0-dev2 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --- 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-27 11:48:02 +08:00
Lin Manhui	27a36344d4	Feat: Support PaddleOCR-VL-1.5 interface (#12819 ) ### What problem does this PR solve? This PR adds support to PaddleOCR-VL-1.5 interface to the PaddleOCR PDF Parser. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-27 09:49:46 +08:00
Kevin Hu	e20d56a34c	Fix: metadata update issue (#12815 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-26 18:02:44 +08:00
chanx	1d93519cb2	Fix: Issues with metadata parameter addition failures and single-file chunk saving failures. (#12818 ) ### What problem does this PR solve? Fix: Issues with metadata parameter addition failures and single-file chunk saving failures. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-26 18:00:40 +08:00
Yongteng Lei	13076bb87b	Fix: Parent chunking fails on DOCX files (#12822 ) ### What problem does this PR solve? Fixes parent chunking fails on DOCX files. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-26 17:55:09 +08:00
balibabu	e04cd99ae2	Feat: Add the history field to the agent's system variables. #7322 (#12823 ) ### What problem does this PR solve? Feat: Add the history field to the agent's system variables. #7322 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-26 17:54:30 +08:00
Jin Hai	41905e2569	Update RAGFlow CLI (#12816 ) ### What problem does this PR solve? Improve performance slightly. ### Type of change - [x] Refactoring - [x] Performance Improvement Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-01-26 12:58:04 +08:00
Stephen Hu	0782a7d3c6	Refactor: improve task cancellation checks in RAPTOR (#12813 ) ### What problem does this PR solve? Introduced a helper method _check_task_canceled to centralize and simplify task cancellation checks throughout RecursiveAbstractiveProcessing4TreeOrganizedRetrieval. This reduces code duplication and improves maintainability. ### Type of change - [x] Refactoring	2026-01-26 11:34:54 +08:00
LIRUI YU	4236a62855	Fix: Cancel tasks before document or datasets deletion to prevent queue blocking (#12799 ) ### What problem does this PR solve? When deleting the knowledge base, the records in the Document and Knowledgebase tables are immediately deleted But there are still a large number of pending task messages in the Redis queue (asynchronous queue) if you did not click on stopping tasks before deleting knowledge base. TaskService.get_task() uses a JOIN query to associate three tables (Task ← Document ← Knowledgebase) Since Document/Knowledgebase have been deleted, the JOIN returns an empty result, even though the Task records still exist task-executor considers the task does not exist ("collect task xxx is unknown"), can only skip and warn log：2026-01-23 16:43:21,716 WARNING 1190179 collect task 110fbf70f5bd11f0945a23b0930487df is unknown 2026-01-23 16:43:21,818 WARNING 1190179 collect task 11146bc4f5bd11f0945a23b0930487df is unknown 2026-01-23 16:43:21,918 WARNING 1190179 collect task 111c3336f5bd11f0945a23b0930487df is unknown 2026-01-23 16:43:22,021 WARNING 1190179 collect task 112471b8f5bd11f0945a23b0930487df is unknown 2026-01-23 16:43:26,719 WARNING 1190179 collect task 112e855ef5bd11f0945a23b0930487df is unknown 2026-01-23 16:43:26,734 WARNING 1190179 collect task 1134380af5bd11f0945a23b0930487df is unknown 2026-01-23 16:43:26,834 WARNING 1190179 collect task 1138cb2cf5bd11f0945a23b0930487df is unknown As a consequence, a large number of such tasks occupy the queue processing capacity, causing new tasks to queue and wait <img width="1910" height="947" alt="9a00f2e0-9112-4dbb-b357-7f66b8eb5acf" src="https://github.com/user-attachments/assets/0e1227c2-a2df-4ef3-ba8f-e04c3f6ef0e1" /> Solution Add logic to stop all ongoing tasks before deleting the knowledge base and Tasks ### Type of change - Bug Fix (non-breaking change which fixes an issue)	2026-01-26 10:45:59 +08:00
Da22wei	9afb5bc136	Add Copilot setting and conventions (#12807 ) ### What problem does this PR solve? Added project instructions for setting up and running the application. ### Type of change - [x] Documentation Update	2026-01-26 10:44:20 +08:00
Kevin Hu	f0fcf8aa9a	Fix: reset conversation variables. (#12814 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-26 10:43:57 +08:00
Jin Hai	274fc5ffaa	Fix RAGFlow CLI bug (#12811 ) ### What problem does this PR solve? As title ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-01-25 23:08:59 +08:00
writinwaters	80a16e71df	Docs: Added webhook specific configuration tips (#12802 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2026-01-23 22:09:49 +08:00
balibabu	6220906164	Fix: Fixed the error on the login page. (#12801 ) ### What problem does this PR solve? Fix: Fixed the error on the login page. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 18:58:54 +08:00
Jimmy Ben Klieve	fa5284361c	feat: support admin assign superuser in admin ui (#12798 ) ### What problem does this PR solve? Allow superuser(admin) to grant or revoke other superuser. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-23 18:08:46 +08:00
Lynn	f3923452df	Fix: add tokenized content (#12793 ) ### What problem does this PR solve? Add tokenized content es field to query zh message. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 16:56:03 +08:00
chanx	11470906cf	Fix: Metadata time Picker (#12796 ) ### What problem does this PR solve? Fix: Metadata time Picker ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 16:55:43 +08:00
Jin Hai	e1df82946e	RAGFlow CLI: ping server before input password when login user (#12791 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-01-23 15:03:05 +08:00
Kevin Hu	08c01b76d5	Fix: missing parent chunk issue. (#12789 ) ### What problem does this PR solve? Close #12783 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 12:54:08 +08:00
apps-lycusinc	678392c040	feat(deepdoc): add configurable ONNX thread counts and GPU memory shrinkage (#12777 ) ### What problem does this PR solve? This PR addresses critical memory and CPU resource management issues in high-concurrency environments (multi-worker setups): GPU Memory Exhaustion (OOM): Currently, onnxruntime-gpu uses an aggressive memory arena that does not effectively release VRAM back to the system after a task completes. In multi-process worker setups ($WS > 4), this leads to BFCArena allocation failures and OOM errors as workers "hoard" VRAM even when idle. This PR introduces an optional GPU Memory Arena Shrinkage toggle to mitigate this issue. CPU Oversubscription: ONNX intra_op and inter_op thread counts are currently hardcoded to 2. When running many workers, this causes significant CPU context-switching overhead and degrades performance. This PR makes these values configurable to match the host's actual CPU core density. Multi-GPU Support: The memory management logic has been improved to dynamically target the correct device_id, ensuring stability on systems with multiple GPUs. Transparency: Added detailed initialization logs to help administrators verify and troubleshoot their ONNX session configurations. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: shakeel <shakeel@lollylaw.com>	2026-01-23 11:36:28 +08:00
Julien Deveaux	6be197cbb6	Fix: Use tiktoken for proper token counting in OpenAI-compatible endpoint #7850 (#12760 ) ### What problem does this PR solve? The OpenAI-compatible chat endpoint (`/chats_openai/<chat_id>/chat/completions`) was not returning accurate token usage in streaming responses. The token counts were either missing or inaccurate because the underlying LLM API responses weren't being properly parsed for usage data. This PR adds proper token counting using tiktoken (cl100k_base encoding) as a fallback when the LLM API doesn't provide usage data in streaming chunks. This ensures clients always receive token usage information in the response, which is essential for billing and quota management. Changes: - Add tiktoken-based token counting for streaming responses in OpenAI-compatible endpoint - Ensure `usage` field is always populated in the final streaming chunk - Add unit tests for token usage calculation Fixes #7850 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 09:36:21 +08:00
balibabu	8dd4a41bf8	Feat: Add a web search button to the chat box on the chat page. (#12786 ) ### What problem does this PR solve? Feat: Add a web search button to the chat box on the chat page. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-23 09:33:50 +08:00
chanx	e9453a3971	Fix: Metadata supports precise time selection (#12785 ) ### What problem does this PR solve? Fix: Metadata supports precise time selection ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 09:33:34 +08:00
balibabu	7c9b6e032b	Fix: The minimum size of the historical message window for the classification operator is 1. #12778 (#12779 ) ### What problem does this PR solve? Fix: The minimum size of the historical message window for the classification operator is 1. #12778 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-22 19:45:25 +08:00
Kevin Hu	3beb85efa0	Feat: enhance metadata arranging. (#12745 ) ### What problem does this PR solve? #11564 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-22 15:34:08 +08:00
LIRUI YU	bc7b864a6c	top_k parameter ignored, always returned page_size results (#12753 ) ### What problem does this PR solve? Backend \rag\nlp\search.py Before the fix The top_k parameter was not applied to limit the total number of chunks, and the rerank model also uses the exact whole valid_idx rather than assigning valid_idx = valid_idx[:top] firstly. After the fix The top_k limit is applied to the total results before pagination, using a default value of top = 1024 if top_k is not modified. session.py Before the fix: When the frontend calls the retrieval API with `search_id`, the backend only reads `meta_data_filter` from the saved `search_config`. The `rerank_id`, `top_k`, `similarity_threshold`, and `vector_similarity_weight` parameters are only taken from the direct request body. Since the frontend doesn't pass these parameters explicitly (it only passes `search_id`), they always fall back to default values: - `similarity_threshold` = 0.0 - `vector_similarity_weight` = 0.3 - `top_k` = 1024 - `rerank_id` = "" (no rerank) This means user settings saved in the Search Settings page have no effect on actual search results. After the fix: When a `search_id` is provided, the backend now reads all relevant configuration from the saved `search_config`, including `rerank_id`, `top_k`, `similarity_threshold`, and `vector_similarity_weight`. Request parameters can still override these values if explicitly provided, allowing flexibility. The rerank model is now properly instantiated using the configured `rerank_id`, making the rerank feature actually work. Frontend \web\src\pages\next-search\search-setting.tsx Before the fix search-setting.tsx file, the top_k input box is only displayed when rerank is enabled (wrapped in the rerankModelDisabled condition). If the rerank switch is turned off, the top_k input field will be hidden, but the form value will remain unchanged. In other words: - When rerank is enabled, users can modify top_k (default 1024). - When rerank is disabled, top_k retains the previous value, but it's not visible on the interface. Therefore, the backend will always receive the top_k parameter; it's just that the frontend UI binds this configuration item to the rerank switch. When rerank is turned off, top_k will not automatically reset to 1024, but will retain its original value. After the fix On the contrary, if we switch off the button rerank model, the value top-k will be reset to 1024. By the way, If we use top-k in an individual method, rather than put it into the method retrieval, we can control it separately Now all methods valid Using rerank <img width="2378" height="1565" alt="Screenshot 2026-01-21 190206" src="https://github.com/user-attachments/assets/fa2b0df0-1334-4ca3-b169-da6c5fd59935" /> Not using rerank <img width="2596" height="1559" alt="Screenshot 2026-01-21 190229" src="https://github.com/user-attachments/assets/c5a80522-a0e1-40e7-b349-42fe86df3138" /> Before fixing they are the same ### Type of change - Bug Fix (non-breaking change which fixes an issue)	2026-01-22 15:33:42 +08:00
zhanxin.xu	93091f4551	[Feat]Automatic table orientation detection and correction (#12719 ) ### What problem does this PR solve? This PR introduces automatic table orientation detection and correction within the PDF parser. This ensures that tables in PDFs are correctly oriented before structure recognition, improving overall parsing accuracy. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2026-01-22 12:47:55 +08:00
会敲代码的喵	2d9e7b4acd	Fix: aliyun oss need to use s3 signature_version (#12766 ) ### What problem does this PR solve? Aliyun OSS do not support boto s4 signature_version which will lead to an error: ``` botocore.exceptions.ClientError: An error occurred (InvalidArgument) when calling the PutObject operation: aws-chunked encoding is not supported with the specified x-amz-content-sha256 value ``` According to aliyun oss docs, oss_conn need to use s3 signature_version. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-22 11:43:55 +08:00
天海蒼灆	6f3f69b62e	Feat: API adds audio to text and text to speech functions (#12764 ) ### What problem does this PR solve? API adds audio to text and text to speech functions ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-22 11:20:26 +08:00
chanx	bfd5435087	Fix: After deleting metadata in batches, the selected items need to be cleared. (#12767 ) ### What problem does this PR solve? Fix: After deleting metadata in batches, the selected items need to be cleared. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-22 11:20:11 +08:00
balibabu	0e9fe68110	Feat: Adjust the icons in the chat page's collapsible panel. (#12755 ) ### What problem does this PR solve? Feat: Adjust the icons in the chat page's collapsible panel. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-01-22 09:48:44 +08:00
Jin Hai	89f438fe45	Add ping command to test ping API (#12757 ) ### What problem does this PR solve? As title. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-01-22 00:18:29 +08:00

1 2 3 4 5 ...

5159 Commits