Files
ragflow/docs/guides/dataset/extract_table_of_contents.md
Jimmy Ben Klieve 867ec94258 revert white-space changes in docs (#12557)
### What problem does this PR solve?

Trailing white-spaces in commit 6814ace1aa
got automatically trimmed by code editor may causes documentation
typesetting broken.

Mostly for double spaces for soft line breaks.  

### Type of change

- [x] Documentation Update
2026-01-13 09:41:02 +08:00

1.7 KiB

sidebar_position, slug, sidebar_custom_props
sidebar_position slug sidebar_custom_props
4 /enable_table_of_contents
categoryIcon
LucideTableOfContents

Extract table of contents

Extract table of contents (TOC) from documents to provide long context RAG and improve retrieval.


During indexing, this technique uses LLM to extract and generate chapter information, which is added to each chunk to provide sufficient global context. At the retrieval stage, it first uses the chunks matched by search, then supplements missing chunks based on the table of contents structure. This addresses issues caused by chunk fragmentation and insufficient context, improving answer quality.

:::danger WARNING Enabling TOC extraction requires significant memory, computational resources, and tokens. :::

Prerequisites

The system's default chat model is used to summarize clustered content. Before proceeding, ensure that you have a chat model properly configured:

Set default models

Quickstart

  1. Navigate to the Configuration page.

  2. Enable TOC Enhance.

  3. To use this technique during retrieval, do either of the following:

    • In the Chat setting panel of your chat app, switch on the TOC Enhance toggle.
    • If you are using an agent, click the Retrieval agent component to specify the dataset(s) and switch on the TOC Enhance toggle.

Frequently asked questions

Will previously parsed files be searched using the TOC enhancement feature once I enable TOC Enhance?

No. Only files parsed after you enable TOC Enhance will be searched using the TOC enhancement feature. To apply this feature to files parsed before enabling TOC Enhance, you must reparse them.