Can Docling Be Integrated with Google Cloud Vision for OCR? #3365

pradeept · 2026-04-25T17:27:21Z

pradeept
Apr 25, 2026

While researching online, I came across the package docling-google-ocr. However, when I tried to install it, it conflicted with the version of the Docling parser I’m using. To make it work, I would need to downgrade Docling, which would cause me to lose several features.

Is there another way to use Google Vision for OCR without downgrading Docling?

Thank you.

Answered by dosubot[bot]

Apr 25, 2026

Hi @pradeept! I'm Dosu and I’m helping the docling team.

There's no built-in Google Cloud Vision integration in Docling, but you don't need to rely on the third-party docling-google-ocr package (or downgrade Docling) — you can write your own OCR plugin using Docling's plugin system.

Docling uses a pluggy-based plugin architecture that lets you register custom OCR engines. The steps are:

Create an options class extending OcrOptions with a unique kind discriminator [1].
Create an OCR model class extending BaseOcrModel and implement the __call__() method to process page images and return TextCell results [2].
Register your plugin via a setuptools entrypoint in your pyproject.toml [3]:
```
[…
```

View full answer

@pradeept · 2026-04-25T17:31:53Z

dosubot[bot]
Bot Apr 25, 2026

Hi @pradeept! I'm Dosu and I’m helping the docling team.

There's no built-in Google Cloud Vision integration in Docling, but you don't need to rely on the third-party docling-google-ocr package (or downgrade Docling) — you can write your own OCR plugin using Docling's plugin system.

Docling uses a pluggy-based plugin architecture that lets you register custom OCR engines. The steps are:

Create an options class extending OcrOptions with a unique kind discriminator [1].
Create an OCR model class extending BaseOcrModel and implement the __call__() method to process page images and return TextCell results [2].

Register your plugin via a setuptools entrypoint in your pyproject.toml [3]:

[project.entry-points."docling"]
google_vision_ocr = "your_package.module"

Use it with allow_external_plugins=True:

pipeline_options = PdfPipelineOptions(
    do_ocr=True,
    allow_external_plugins=True,
    ocr_options=YourGoogleVisionOcrOptions(lang=["en"]),
)

The docling-surya plugin is a good reference implementation to follow — it's a third-party OCR plugin that follows exactly this pattern [4]. Your Google Vision plugin would work similarly, but instead of running a local model, you'd call the Google Cloud Vision API in the __call__() method and map the results to Docling's TextCell format [5].

One note: there's a reported issue where the --allow-external-plugins CLI flag may not work properly [6], so using the Python API as shown above is the more reliable approach for now.

To reply, just mention @dosu.

Docs are dead. Just use Dosu.

1 reply

Manamama May 26, 2026

Mind you: https://github.com/DS4SD/docling-surya is 404 as of today.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can Docling Be Integrated with Google Cloud Vision for OCR? #3365

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Can Docling Be Integrated with Google Cloud Vision for OCR? #3365

Uh oh!

pradeept Apr 25, 2026

Replies: 1 comment · 1 reply

Uh oh!

dosubot[bot] Bot Apr 25, 2026

Uh oh!

Manamama May 26, 2026

pradeept
Apr 25, 2026

Replies: 1 comment 1 reply

dosubot[bot]
Bot Apr 25, 2026