### What problem does this PR solve? support parse image by OCR or VLM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)