mirror of
https://github.com/infiniflow/ragflow.git
synced 2025-12-08 12:32:30 +08:00
doc: change to chunk_token num (#8590)
### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8556 ### Type of change - [x] Documentation Update
This commit is contained in:
@ -881,7 +881,7 @@ curl --request PUT \
|
||||
{
|
||||
"name": "manual.txt",
|
||||
"chunk_method": "manual",
|
||||
"parser_config": {"chunk_token_count": 128}
|
||||
"parser_config": {"chunk_token_num": 128}
|
||||
}'
|
||||
|
||||
```
|
||||
@ -910,7 +910,7 @@ curl --request PUT \
|
||||
- `"parser_config"`: (*Body parameter*), `object`
|
||||
The configuration settings for the dataset parser. The attributes in this JSON object vary with the selected `"chunk_method"`:
|
||||
- If `"chunk_method"` is `"naive"`, the `"parser_config"` object contains the following attributes:
|
||||
- `"chunk_token_count"`: Defaults to `256`.
|
||||
- `"chunk_token_num"`: Defaults to `256`.
|
||||
- `"layout_recognize"`: Defaults to `true`.
|
||||
- `"html4excel"`: Indicates whether to convert Excel documents into HTML format. Defaults to `false`.
|
||||
- `"delimiter"`: Defaults to `"\n"`.
|
||||
|
||||
@ -461,7 +461,7 @@ dataset = rag_object.list_datasets(id='id')
|
||||
dataset = dataset[0]
|
||||
doc = dataset.list_documents(id="wdfxb5t547d")
|
||||
doc = doc[0]
|
||||
doc.update([{"parser_config": {"chunk_token_count": 256}}, {"chunk_method": "manual"}])
|
||||
doc.update([{"parser_config": {"chunk_token_num": 256}}, {"chunk_method": "manual"}])
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
Reference in New Issue
Block a user