mirror of
https://github.com/infiniflow/ragflow.git
synced 2025-12-08 20:42:30 +08:00
### What problem does this PR solve? #9869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: jinhai <haijin.chn@gmail.com> Signed-off-by: Jin Hai <haijin.chn@gmail.com> Co-authored-by: chanx <1243304602@qq.com> Co-authored-by: balibabu <cike8899@users.noreply.github.com> Co-authored-by: Lynn <lynn_inf@hotmail.com> Co-authored-by: 纷繁下的无奈 <zhileihuang@126.com> Co-authored-by: huangzl <huangzl@shinemo.com> Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com> Co-authored-by: Wilmer <33392318@qq.com> Co-authored-by: Adrian Weidig <adrianweidig@gmx.net> Co-authored-by: Zhichang Yu <yuzhichang@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Yongteng Lei <yongtengrey@outlook.com> Co-authored-by: Liu An <asiro@qq.com> Co-authored-by: buua436 <66937541+buua436@users.noreply.github.com> Co-authored-by: BadwomanCraZY <511528396@qq.com> Co-authored-by: cucusenok <31804608+cucusenok@users.noreply.github.com> Co-authored-by: Russell Valentine <russ@coldstonelabs.org> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Billy Bao <newyorkupperbay@gmail.com> Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: TensorNull <129579691+TensorNull@users.noreply.github.com> Co-authored-by: TensorNull <tensor.null@gmail.com> Co-authored-by: TeslaZY <TeslaZY@outlook.com> Co-authored-by: Ajay <160579663+aybanda@users.noreply.github.com> Co-authored-by: AB <aj@Ajays-MacBook-Air.local> Co-authored-by: 天海蒼灆 <huangaoqin@tecpie.com> Co-authored-by: He Wang <wanghechn@qq.com> Co-authored-by: Atsushi Hatakeyama <atu729@icloud.com> Co-authored-by: Jin Hai <haijin.chn@gmail.com> Co-authored-by: Mohamed Mathari <155896313+melmathari@users.noreply.github.com> Co-authored-by: Mohamed Mathari <nocodeventure@Mac-mini-van-Mohamed.fritz.box> Co-authored-by: Stephen Hu <stephenhu@seismic.com> Co-authored-by: Shaun Zhang <zhangwfjh@users.noreply.github.com> Co-authored-by: zhimeng123 <60221886+zhimeng123@users.noreply.github.com> Co-authored-by: mxc <mxc@example.com> Co-authored-by: Dominik Novotný <50611433+SgtMarmite@users.noreply.github.com> Co-authored-by: EVGENY M <168018528+rjohny55@users.noreply.github.com> Co-authored-by: mcoder6425 <mcoder64@gmail.com> Co-authored-by: lemsn <lemsn@msn.com> Co-authored-by: lemsn <lemsn@126.com> Co-authored-by: Adrian Gora <47756404+adagora@users.noreply.github.com> Co-authored-by: Womsxd <45663319+Womsxd@users.noreply.github.com> Co-authored-by: FatMii <39074672+FatMii@users.noreply.github.com>
53 lines
1.8 KiB
Markdown
53 lines
1.8 KiB
Markdown
You are an expert parser and data formatter. Your task is to analyze the provided table of contents (TOC) text and convert it into a valid JSON array of objects.
|
||
|
||
**Instructions:**
|
||
1. Analyze each line of the input TOC.
|
||
2. For each line, extract the following three pieces of information:
|
||
* `structure`: The hierarchical index/numbering (e.g., "1", "2.1", "3.2.5", "A.1"). If a line has no visible numbering or structure indicator (like a main "Chapter" title), use `null`.
|
||
* `title`: The textual title of the section or chapter. This should be the main descriptive text, clean and without the page number.
|
||
3. Output **only** a valid JSON array. Do not include any other text, explanations, or markdown code block fences (like ```json) in your response.
|
||
|
||
**JSON Format:**
|
||
The output must be a list of objects following this exact schema:
|
||
```json
|
||
[
|
||
{
|
||
"structure": <structure index, "x.x.x" or None> (string),
|
||
"title": <title of the section>
|
||
},
|
||
...
|
||
]
|
||
```
|
||
|
||
**Input Example:**
|
||
```
|
||
Contents
|
||
1 Introduction to the System ... 1
|
||
1.1 Overview .... 2
|
||
1.2 Key Features .... 5
|
||
2 Installation Guide ....8
|
||
2.1 Prerequisites ........ 9
|
||
2.2 Step-by-Step Process ........ 12
|
||
Appendix A: Specifications ..... 45
|
||
References ... 47
|
||
```
|
||
|
||
**Expected Output For The Example:**
|
||
```json
|
||
[
|
||
{"structure": null, "title": "Contents"},
|
||
{"structure": "1", "title": "Introduction to the System"},
|
||
{"structure": "1.1", "title": "Overview"},
|
||
{"structure": "1.2", "title": "Key Features"},
|
||
{"structure": "2", "title": "Installation Guide"},
|
||
{"structure": "2.1", "title": "Prerequisites"},
|
||
{"structure": "2.2", "title": "Step-by-Step Process"},
|
||
{"structure": "A", "title": "Specifications"},
|
||
{"structure": null, "title": "References"}
|
||
]
|
||
```
|
||
|
||
**Now, process the following TOC input:**
|
||
```
|
||
{{ toc_page }}
|
||
``` |