Skip to content

feat: add Docling document converter#3066

Merged
julian-risch merged 13 commits into
deepset-ai:mainfrom
vagenas:add-docling-integration
Apr 3, 2026
Merged

feat: add Docling document converter#3066
julian-risch merged 13 commits into
deepset-ai:mainfrom
vagenas:add-docling-integration

Conversation

@vagenas
Copy link
Copy Markdown
Contributor

@vagenas vagenas commented Mar 31, 2026

Part of #3097

Proposed Changes:

Adding Docling document converter.

How did you test it?

(cd integrations/docling && hatch run test:all)

Notes for the reviewer

  • Current import path of distribution package is from docling_haystack.converter import DoclingConverter.
  • Import path introduced in this PR is from haystack_integrations.components.converters.docling import DoclingConverter

=> Presumably the PyPI package will become a shim to accordingly redirect the exposed packages for backwards compatibility?

Checklist

Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
@vagenas vagenas requested a review from a team as a code owner March 31, 2026 11:10
@vagenas vagenas requested review from anakin87 and removed request for a team March 31, 2026 11:10
@github-actions github-actions Bot added topic:CI type:documentation Improvements or additions to documentation labels Mar 31, 2026
@julian-risch julian-risch self-requested a review March 31, 2026 11:12
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
@anakin87 anakin87 removed their request for review March 31, 2026 12:07
@CLAassistant
Copy link
Copy Markdown

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
You have signed the CLA already but the status is still pending? Let us recheck it.

Copy link
Copy Markdown
Member

@julian-risch julian-risch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you so much @vagenas for the great and super quick collaboration!
I added backward compatibility for the old import paths and a test for that. In addition, I slightly changed how serialization works. There was a small issue. I ensure backward compatibility for deserialization too.
I will transfer the open GitHub issues from the docling repository and will comment on the open PRs there too. Feel free to archive the docling repo and point to haystack-core-integrations.

@julian-risch julian-risch merged commit 9e73670 into deepset-ai:main Apr 3, 2026
14 checks passed
@vagenas vagenas deleted the add-docling-integration branch April 13, 2026 08:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

topic:CI type:documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants