-
Notifications
You must be signed in to change notification settings - Fork 0
Labels
enhancementNew feature or requestNew feature or request
Description
- Handling in-line formulas and multiple bounding boxes corresponding to the same chunk/paragraph of text (e.g. "Hello" and "World" may be transcribed separately and we need to bring them together based on bounding box coords or using layout detection better)
- Improving processing FPS (making specific models more lightweight, adding customization of model size and pipeline model selection) (e.g. remove Omniparser icon detection model and only use the underlying OCR model)
Sub-issues
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request