Docs: minor (#10630)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
This commit is contained in:
writinwaters
2025-10-17 11:41:19 +08:00
committed by GitHub
parent 15838a6673
commit f12290f04b
2 changed files with 32 additions and 3 deletions

View File

@ -3,15 +3,15 @@ sidebar_position: 30
slug: /parser_component slug: /parser_component
--- ---
# Message component # Parser component
A component that sets the parsing rules for your dataset. A component that sets the parsing rules for your dataset.
--- ---
A **Parser** component sets the parsing rules for various file types, including parsing methods for PDFs , fields to parse for Emails, and OCR methods for images. A **Parser** component defines how various file types should be parsed, including parsing methods for PDFs , fields to parse for Emails, and OCR methods for images.
## Scenario ## Scenario
An **parser** component is auto-populated on the ingestion pipeline canvase and always required in an ingestion pipeline workflow. A **Parser** component is auto-populated on the ingestion pipeline canvas and required in all ingestion pipeline workflows.

View File

@ -0,0 +1,29 @@
---
sidebar_position: 30
slug: /indexer_component
---
# Indexer component
A component that defines how chunks are indexed.
---
An **Indexer** component indexes chunks and configures their storage formats in the document engine.
## Scenario
An **Indexer** component is the mandatory ending component for all ingestion pipelines.
## Configurations
### Search method
This setting configures how chunks are stored in the document engine: as full-text, embeddings, or both.
### Filename embedding weight
This setting defines the filename's contribution to the final embedding, which is a weighted combination of both the chunk content and the filename. Essentially, a higher value gives the filename more influence in the final *composite* embedding.
- 0.1: Filename contributes 10% (chunk content 90%)
- 0.5 (maximum): Filename contributes 50% (chunk content 90%)