Skip to content

Auto-tagging - Improve location of Caption structure element#377

Merged
MaximPlusov merged 3 commits intoauto_taggingfrom
auto-tagging-caption
Apr 10, 2026
Merged

Auto-tagging - Improve location of Caption structure element#377
MaximPlusov merged 3 commits intoauto_taggingfrom
auto-tagging-caption

Conversation

@LonelyMidoriya
Copy link
Copy Markdown
Contributor

Checklist:

  • Documentation has been updated, if necessary.
  • Examples have been added, if necessary.
  • Tests have been added, if necessary.

@LonelyMidoriya LonelyMidoriya self-assigned this Mar 31, 2026
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented Mar 31, 2026

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 6dff9f2d-2a26-46e4-a9ab-d27060da4ff2

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@MaximPlusov MaximPlusov merged commit 193a3b5 into auto_tagging Apr 10, 2026
4 checks passed
@MaximPlusov MaximPlusov deleted the auto-tagging-caption branch April 10, 2026 12:35
bundolee added a commit that referenced this pull request Apr 14, 2026
…377

Replace the 2-pass caption-to-float mapping (521b274) with Raman Kakhnovich's
addKids/addCaptionIfPresent pattern from origin/auto_tagging #377 (193a3b5).

Key changes:
- addKids(): first pass collects Captions by linkedContentId, second pass
  creates non-Caption struct elements (skipping Captions)
- addCaptionIfPresent(): called by Figure/Table/List/Part after struct elem
  creation, attaches Caption as first or last child based on spatial position
- isCaptionFirstChild(): compares caption vs parent bbox to decide placement
- addStructElement() overload with isFirstKid for Caption-first insertion
- Removed: captionToFloat map, floatStructElems map, isFloatElement()
- Kept: heading normalization, Link annotations, Formula, EnrichedImageChunk

Reference: Kakhnovich Raman's origin/auto_tagging 193a3b5 (#377)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
bundolee added a commit that referenced this pull request Apr 14, 2026
…377

Replace the 2-pass caption-to-float mapping (521b274) with Raman Kakhnovich's
addKids/addCaptionIfPresent pattern from origin/auto_tagging #377 (193a3b5).

Key changes:
- addKids(): first pass collects Captions by linkedContentId, second pass
  creates non-Caption struct elements (skipping Captions)
- addCaptionIfPresent(): called by Figure/Table/List/Part after struct elem
  creation, attaches Caption as first or last child based on spatial position
- isCaptionFirstChild(): compares caption vs parent bbox to decide placement
- addStructElement() overload with isFirstKid for Caption-first insertion
- Removed: captionToFloat map, floatStructElems map, isFloatElement()
- Kept: heading normalization, Link annotations, Formula, EnrichedImageChunk

Reference: Kakhnovich Raman's origin/auto_tagging 193a3b5 (#377)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
bundolee added a commit that referenced this pull request Apr 15, 2026
…377

Replace the 2-pass caption-to-float mapping (521b274) with Raman Kakhnovich's
addKids/addCaptionIfPresent pattern from origin/auto_tagging #377 (193a3b5).

Key changes:
- addKids(): first pass collects Captions by linkedContentId, second pass
  creates non-Caption struct elements (skipping Captions)
- addCaptionIfPresent(): called by Figure/Table/List/Part after struct elem
  creation, attaches Caption as first or last child based on spatial position
- isCaptionFirstChild(): compares caption vs parent bbox to decide placement
- addStructElement() overload with isFirstKid for Caption-first insertion
- Removed: captionToFloat map, floatStructElems map, isFloatElement()
- Kept: heading normalization, Link annotations, Formula, EnrichedImageChunk

Reference: Kakhnovich Raman's origin/auto_tagging 193a3b5 (#377)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
bundolee added a commit that referenced this pull request Apr 15, 2026
…377

Replace the 2-pass caption-to-float mapping (521b274) with Raman Kakhnovich's
addKids/addCaptionIfPresent pattern from origin/auto_tagging #377 (193a3b5).

Key changes:
- addKids(): first pass collects Captions by linkedContentId, second pass
  creates non-Caption struct elements (skipping Captions)
- addCaptionIfPresent(): called by Figure/Table/List/Part after struct elem
  creation, attaches Caption as first or last child based on spatial position
- isCaptionFirstChild(): compares caption vs parent bbox to decide placement
- addStructElement() overload with isFirstKid for Caption-first insertion
- Removed: captionToFloat map, floatStructElems map, isFloatElement()
- Kept: heading normalization, Link annotations, Formula, EnrichedImageChunk

Reference: Kakhnovich Raman's origin/auto_tagging 193a3b5 (#377)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants