docs(datafusion): add user documentation for DataFusion integration#2029
docs(datafusion): add user documentation for DataFusion integration#2029xbattlax wants to merge 5 commits intoapache:mainfrom
Conversation
Add comprehensive user documentation for the DataFusion integration that covers SQL-based table operations, catalog integration, and query optimization features. Changes: - Add datafusion.md documentation page with setup, SQL operations, metadata tables, partitioned tables, and configuration options - Add datafusion_integration.rs example with annotated code sections - Update SUMMARY.md to include new documentation page - Add required dependencies to examples crate Closes apache#2027
853219a to
4830a74
Compare
- Change catalog name from "iceberg" to "my_catalog" to clarify it has no special meaning - Move external table code to separate example file - Remove sections not suited for end-users: External Tables, Table Provider Types, Creating Partitioned Tables (Rust API), Query Optimization - Add clarification that table properties must be set via Iceberg catalog API
68f9f95 to
6cb8fbd
Compare
6cb8fbd to
7f6bfda
Compare
|
This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@iceberg.apache.org list. Thank you for your contributions. |
|
@liurenjie1024 have you some other points on this PR ? |
|
This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@iceberg.apache.org list. Thank you for your contributions. |
|
This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time. |
Summary
Add comprehensive user documentation for the DataFusion integration, addressing issue #2027.
Changes
Add
website/src/datafusion.mddocumentation page covering:IcebergCatalogProviderCREATE TABLE,INSERT INTO,SELECT$snapshots,$manifests)IcebergTableProviderFactoryAdd
crates/examples/src/datafusion_integration.rsworking example with annotated code sections for documentation includesUpdate
website/src/SUMMARY.mdto include new documentation pageAdd required dependencies to
crates/examples/Cargo.tomlNotes
This addresses the documentation request from #2027, which noted substantial progress in DataFusion integration (per epic #1382). The documentation covers all major features merged including:
CREATE TABLEfor DataFusion #1972)insert_intoforIcebergTableProvider#1600, feat(datafusion): SupportINSERT INTOpartitioned tables #1827)Closes #2027