Skip to content

feat: support incremental scan between 2 snapshots#1470

Closed
xxchan wants to merge 4 commits intoapache:mainfrom
xxchan:xxchan/naval-heron
Closed

feat: support incremental scan between 2 snapshots#1470
xxchan wants to merge 4 commits intoapache:mainfrom
xxchan:xxchan/naval-heron

Conversation

@xxchan
Copy link
Copy Markdown
Member

@xxchan xxchan commented Jun 26, 2025

Which issue does this PR close?

close #1469

What changes are included in this PR?

Added to_snapshot_id and from_snapshot_id to the table scan API.

Are these changes tested?

unit test

Comment thread crates/iceberg/src/scan/context.rs Outdated
}
}

struct Ancestors {
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This can be handy for validation in SnapshotProducer as well, maybe we should move this to a new module util/snapshot.rs

Copy link
Copy Markdown
Member Author

@xxchan xxchan Jul 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

btw, what kind of validation are you talking about? It seems currently there's no such place where Ancestors can be used

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Snapshot validation is not implemented yet, I have a stale PR for that. My PR also comes with the logic of scanning ancestors, but I like your version much better and would love to reuse what you have here when we restart the snapshot validation work

xxchan added 3 commits July 22, 2025 14:09
Signed-off-by: xxchan <xxchan22f@gmail.com>
Signed-off-by: xxchan <xxchan22f@gmail.com>
Signed-off-by: xxchan <xxchan22f@gmail.com>
@xxchan xxchan force-pushed the xxchan/naval-heron branch from a3f5068 to d87a9d2 Compare July 22, 2025 06:15
This refactoring addresses the PR comment suggesting that the Ancestors
struct could be reused by SnapshotProducer and should be moved to a
dedicated utilities module for better code organization.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
@xxchan xxchan force-pushed the xxchan/naval-heron branch from d87a9d2 to d5fb028 Compare July 22, 2025 06:18
@xxchan
Copy link
Copy Markdown
Member Author

xxchan commented Jul 23, 2025

cc @Xuanwo @liurenjie1024 @Fokko

CTTY added a commit to CTTY/iceberg-rust that referenced this pull request Aug 20, 2025
let (delete_file_idx, delete_file_tx) = DeleteFileIndex::new();

let manifest_list = plan_context.get_manifest_list().await?;
// For incremental scan, disable delete file processing
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we processed delete files, your PR would essentially almost be producing a changelog, like in Spark, right? Perhaps not actionable right now, but mentioning this as a possible extension in future.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cool, that would be useful

CTTY added a commit to CTTY/iceberg-rust that referenced this pull request Nov 5, 2025
@github-actions
Copy link
Copy Markdown
Contributor

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@iceberg.apache.org list. Thank you for your contributions.

@github-actions github-actions Bot added the stale label Feb 26, 2026
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Mar 5, 2026

This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

@github-actions github-actions Bot closed this Mar 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

support incremental scan between 2 snapshots

3 participants