Commit 199f255
authored
feat: Track the table extraction method (#4346)
<!-- CURSOR_SUMMARY -->
> [!NOTE]
> **Low Risk**
> Adds a new optional metadata field and wires it through PDF
partitioning; behavior is additive and low risk aside from potential
downstream consumers expecting a fixed metadata schema.
>
> **Overview**
> **Adds table-provenance metadata for extracted tables.**
`ElementMetadata` now includes an optional `table_extraction_method`
(e.g., `grid`, `tatr`, `vlm`) and includes it in metadata consolidation.
>
> During PDF partitioning, the value is propagated from each
`LayoutElement` onto the resulting element metadata, enabling downstream
consumers to identify which table-extraction algorithm produced a given
table. Version is bumped to `0.22.26` and the changelog is updated
accordingly.
>
> <sup>Reviewed by [Cursor Bugbot](https://cursor.com/bugbot) for commit
fe6135a. Bugbot is set up for automated
code reviews on this repo. Configure
[here](https://www.cursor.com/dashboard/bugbot).</sup>
<!-- /CURSOR_SUMMARY -->1 parent f51769b commit 199f255
4 files changed
Lines changed: 14 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
1 | 7 | | |
2 | 8 | | |
3 | 9 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
213 | 213 | | |
214 | 214 | | |
215 | 215 | | |
| 216 | + | |
216 | 217 | | |
217 | 218 | | |
218 | 219 | | |
| |||
267 | 268 | | |
268 | 269 | | |
269 | 270 | | |
| 271 | + | |
270 | 272 | | |
271 | 273 | | |
272 | 274 | | |
| |||
320 | 322 | | |
321 | 323 | | |
322 | 324 | | |
| 325 | + | |
323 | 326 | | |
324 | 327 | | |
325 | 328 | | |
| |||
548 | 551 | | |
549 | 552 | | |
550 | 553 | | |
| 554 | + | |
551 | 555 | | |
552 | 556 | | |
553 | 557 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1443 | 1443 | | |
1444 | 1444 | | |
1445 | 1445 | | |
| 1446 | + | |
| 1447 | + | |
| 1448 | + | |
1446 | 1449 | | |
1447 | 1450 | | |
1448 | 1451 | | |
| |||
0 commit comments