Skip to content

feat: integrate with Hudi using Rust-based hudi-core#6873

Draft
xushiyan wants to merge 1 commit into
Eventual-Inc:mainfrom
xushiyan:use-hudi-core-for-integration
Draft

feat: integrate with Hudi using Rust-based hudi-core#6873
xushiyan wants to merge 1 commit into
Eventual-Inc:mainfrom
xushiyan:use-hudi-core-for-integration

Conversation

@xushiyan
Copy link
Copy Markdown
Contributor

@xushiyan xushiyan commented May 4, 2026

Changes Made

  • Add hudi-core as a dep for Hudi integration APIs
  • Remove pyhudi impl

Related Issues

Closes #2154

@github-actions github-actions Bot added the feat label May 4, 2026
@xushiyan xushiyan force-pushed the use-hudi-core-for-integration branch 7 times, most recently from bd1c71c to 42f7fd3 Compare May 4, 2026 21:37
Replace the Python `pyhudi` implementation with a Rust-based HudiScanOperator
backed by `hudi-core`. Handles schema conversion (including Avro map-to-Arrow-Map
rewriting), partition spec parsing with `_hoodie_partition_path` support for
timestamp-based keygen, and S3 credentials provider resolution.

Pin hudi-core to commit 2a5291a with latest API improvements. Add v9 COW test
table fixtures from hudi-rs covering partitioned, non-partitioned, and
timestamp-based keygen tables. Add Hudi integration test suite to the CI
catalog test job with native and ray runners.
@xushiyan xushiyan force-pushed the use-hudi-core-for-integration branch from 42f7fd3 to 7d24f6e Compare May 4, 2026 23:26
@rahil-c
Copy link
Copy Markdown

rahil-c commented May 5, 2026

@madvart @samster25 @jaychia Hi Daft community hope all is well! Was wondering if someone can take a look at this PR (once its out of draft state) when they get a chance. The hudi community is very interested in working toward storing and interacting with multi-modal data.

@xushiyan
Copy link
Copy Markdown
Contributor Author

xushiyan commented May 5, 2026

Thanks. But this is still in draft state. i'll mark this ready for review once it is.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[FEAT] Support snapshot read for Hudi MOR tables

2 participants