You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: wire batch_size_bytes to Python and public Rust API (#6428)
## Summary
Stacked on #6388. Please merge that PR first.
- Adds `batch_size_bytes: Option<u64>` to `FileReaderOptions` and
propagates it through all 6 `SchedulerDecoderConfig` creation sites in
the file reader
- Adds `batch_size_bytes` field + setter to `Scanner`, wired through
both `scan_fragments` (via `LanceScanConfig`) and `pushdown_scan` (via
`FileReaderOptions` in `ScanConfig`)
- Adds `batch_size_bytes` to `LanceScanConfig`, with `try_new_v2`
injecting it into `FragReadConfig` via `FileReaderOptions`
- Exposes `batch_size_bytes` in the Python API:
`LanceDataset.scanner()`, `to_table()`, `to_batches()`, `ScannerBuilder`
## Test plan
- [x] `cargo check -p lance-file -p lance --tests` — clean
- [x] `cargo clippy -p lance-file -p lance --tests -- -D warnings` —
clean
- [x] `cargo fmt --all` — applied
- [x] `cargo test -p lance-encoding -- byte_sized` — 3/3 pass
- [x] `cargo test -p lance -- test_scan` — 38/38 pass
🤖 Generated with [Claude Code](https://claude.com/claude-code)
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
0 commit comments