docs: Clarify pymupdf.layout import order for OCR support #4882

Deepaksaini00 · 2026-01-30T07:54:48Z

Summary

This PR improves the PyMuPDF4LLM documentation around OCR support. It clarifies how OCR actually works, what dependencies are required, and highlights an important import-order requirement that isn’t currently obvious from the docs.

Key points

Documented that pymupdf.layout must be imported before pymupdf4llm for OCR heuristics to run
Added an explanation of when and how OCR is triggered (image-only pages, unreadable text, partial OCR)
Listed the full set of requirements (layout import, use_ocr, Tesseract, OpenCV)
Added a small decision-flow diagram to make the behavior easier to understand

This is a documentation-only update — no code or runtime behavior is changed.

Fixes #4833

…uPDF4LLM

github-actions · 2026-01-30T07:55:01Z

Thank you for your submission, we really appreciate it. Like many open-source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution. You can sign the CLA by just posting a Pull Request Comment same as the below format.

I have read the CLA Document and I hereby sign the CLA

_{You can retrigger this bot by commenting recheck in this Pull Request. Posted by the CLA Assistant Lite bot.}

docs: document OCR support and required pymupdf.layout import for PyM…

40d5dbd

…uPDF4LLM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Clarify pymupdf.layout import order for OCR support #4882

docs: Clarify pymupdf.layout import order for OCR support #4882

Deepaksaini00 commented Jan 30, 2026

Uh oh!

github-actions bot commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

docs: Clarify pymupdf.layout import order for OCR support #4882

Are you sure you want to change the base?

docs: Clarify pymupdf.layout import order for OCR support #4882

Conversation

Deepaksaini00 commented Jan 30, 2026

Summary

Key points

Uh oh!

github-actions bot commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant