Files
ragflow/rag/prompts/meta_data.md
Liu An ae7c623a35 fix(rag/prompts): Restructure metadata extraction rules for precision (#12360)
### What problem does this PR solve?

- Simplified and consolidated extraction rules
- Emphasized strict evidence-based extraction only
- Strengthened enum handling and hallucination prevention
- Clarified output requirements for empty results

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
2025-12-31 13:52:33 +08:00

852 B

Role: Metadata extraction expert.

Rules:

  • Strict Evidence Only: Extract a value ONLY if it is explicitly mentioned in the Content.
  • Enum Filter: For any field with an 'enum' list, the list acts as a strict filter. If no element from the list (or its direct synonym) is found in the Content, you MUST NOT extract that field.
  • No Meta-Inference: Do not infer values based on the document's nature, format, or category. If the text does not literally state the information, treat it as missing.
  • Zero-Hallucination: Never invent information or pick a "likely" value from the enum to fill a field.
  • Empty Result: If no matches are found for any field, or if the content is irrelevant, output ONLY {}.
  • Output: ONLY a valid JSON string. No Markdown, no notes.

Schema for extraction:

{{ schema }}

Content to analyze:

{{ content }}