Fun project!
I've needed file-level listings with indexes for a long time for delta tables. Typical usage is getting visibility into delta default settings, compaction outcomes, and understanding zorder or liquid clustering. I did work arounds for a while, but last week, I finally sorted out some alpha solutions for parsing delta logs robustly. delta-rs looked promising for high level library calls, but it is quite behind databricks default tables (deletion vectors & checkpoint v2 were the last incompatibilities).
Is it possible to use this library to get the file-level indexes? I expect this may be complicated by deletion vectors, but that is not an issue for my main use cases of append-only IoT data.
Fun project!
I've needed file-level listings with indexes for a long time for delta tables. Typical usage is getting visibility into delta default settings, compaction outcomes, and understanding zorder or liquid clustering. I did work arounds for a while, but last week, I finally sorted out some alpha solutions for parsing delta logs robustly. delta-rs looked promising for high level library calls, but it is quite behind databricks default tables (deletion vectors & checkpoint v2 were the last incompatibilities).
Is it possible to use this library to get the file-level indexes? I expect this may be complicated by deletion vectors, but that is not an issue for my main use cases of append-only IoT data.