Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

feat: OCR as enrichment for pictures in simple pipeline (docx, pptx, html, etc) #2488

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
dolfim-ibm wants to merge 1 commit into main
base: main
Choose a base branch
Loading
from ocr-enrichment

Conversation

@dolfim-ibm
Copy link
Contributor

@dolfim-ibm dolfim-ibm commented Oct 17, 2025

This PR allows to run the OCR step also in the pictures found in the documents converted with the SimplePipeline, e.g. docx, pptx, html, etc.

Unfinished work TODO

  • actually call the OCR model
  • each OCR models is currently implementing its logic in the call method. For this feature to work it should be better to refactor and decouple some components

Checklist:

  • Documentation has been updated, if necessary.
  • Examples have been added, if necessary.
  • Tests have been added, if necessary.

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Copy link

mergify bot commented Oct 17, 2025

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

Copy link
Contributor

DCO Check Passed

Thanks @dolfim-ibm, all your commits are properly signed off. 🎉

Copy link

codecov bot commented Oct 17, 2025

Codecov Report

❌ Patch coverage is 90.62500% with 3 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
docling/models/ocr_enrichment.py 89.28% 3 Missing ⚠️

📢 Thoughts on this report? Let us know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

No reviews

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

1 participant

AltStyle によって変換されたページ (->オリジナル) /