Skip to content

feat: Track the table extraction method#4346

Merged
vladimir-kivi-ds merged 2 commits into
mainfrom
vk/track-table-extraction-method
Apr 28, 2026
Merged

feat: Track the table extraction method#4346
vladimir-kivi-ds merged 2 commits into
mainfrom
vk/track-table-extraction-method

Conversation

@vladimir-kivi-ds
Copy link
Copy Markdown
Contributor

@vladimir-kivi-ds vladimir-kivi-ds commented Apr 28, 2026

Note

Low Risk
Adds a new optional metadata field and wires it through PDF partitioning; behavior is additive and low risk aside from potential downstream consumers expecting a fixed metadata schema.

Overview
Adds table-provenance metadata for extracted tables. ElementMetadata now includes an optional table_extraction_method (e.g., grid, tatr, vlm) and includes it in metadata consolidation.

During PDF partitioning, the value is propagated from each LayoutElement onto the resulting element metadata, enabling downstream consumers to identify which table-extraction algorithm produced a given table. Version is bumped to 0.22.26 and the changelog is updated accordingly.

Reviewed by Cursor Bugbot for commit fe6135a. Bugbot is set up for automated code reviews on this repo. Configure here.

@vladimir-kivi-ds vladimir-kivi-ds added this pull request to the merge queue Apr 28, 2026
Merged via the queue into main with commit 199f255 Apr 28, 2026
54 checks passed
@vladimir-kivi-ds vladimir-kivi-ds deleted the vk/track-table-extraction-method branch April 28, 2026 21:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants