Skip to content

Model pipeline performance improvements #4

@deluzhao

Description

@deluzhao
  1. Handling in-line formulas and multiple bounding boxes corresponding to the same chunk/paragraph of text (e.g. "Hello" and "World" may be transcribed separately and we need to bring them together based on bounding box coords or using layout detection better)
  2. Improving processing FPS (making specific models more lightweight, adding customization of model size and pipeline model selection) (e.g. remove Omniparser icon detection model and only use the underlying OCR model)

Sub-issues

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions