Feat: parse email (#10181)

### What problem does this PR solve?

- Dataflow support email.
- Fix old email parser.
- Add new depends to parse msg file.

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
- [x] New Feature (non-breaking change which adds functionality)
- [x] Other (please describe): add new depends.
This commit is contained in:
Lynn
2025-09-22 09:29:38 +08:00
committed by GitHub
parent b5d6a6e8f2
commit 028c2d83e9
7 changed files with 309 additions and 6 deletions

View File

@ -496,7 +496,7 @@ class FileService(CommonService):
return ParserType.AUDIO.value
if re.search(r"\.(ppt|pptx|pages)$", filename):
return ParserType.PRESENTATION.value
if re.search(r"\.(eml)$", filename):
if re.search(r"\.(msg|eml)$", filename):
return ParserType.EMAIL.value
return default