Skip to content
Discussion options

You must be logged in to vote

Hi @pradeept! I'm Dosu and I’m helping the docling team.

There's no built-in Google Cloud Vision integration in Docling, but you don't need to rely on the third-party docling-google-ocr package (or downgrade Docling) — you can write your own OCR plugin using Docling's plugin system.

Docling uses a pluggy-based plugin architecture that lets you register custom OCR engines. The steps are:

  1. Create an options class extending OcrOptions with a unique kind discriminator [1].

  2. Create an OCR model class extending BaseOcrModel and implement the __call__() method to process page images and return TextCell results [2].

  3. Register your plugin via a setuptools entrypoint in your pyproject.toml [3]:

    [

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Manamama
Comment options

Answer selected by pradeept
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants