You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: Add CPU offload, Alpaca dataset, and benchmarking to training script
- KbitLoraModel: add cpu_offload option that wraps per-layer forward
with checkpoint_cpu_offload for inter-layer activation offloading
- train_qlora.py: support Alpaca dataset (tatsu-lab/alpaca) with tokenizer
- train_qlora.py: report tokens/sec, avg step time
- train_qlora.py: add --compare-memory mode for chunked vs unchunked
- train_qlora.py: add --cpu-offload and --grad-accum options
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
0 commit comments