You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: Update QuantLLM to v2.1 with memory optimization and training enhancements
- Added memory_optimized_tensor_order function to optimize tensor serialization order.
- Updated AutoBatchSizeFinder to use the new torch.amp.autocast context manager.
- Enhanced load_training_data function to combine instruction and output fields for specific text columns.
- Introduced architecture registration and resolution mechanisms in TurboModel for better model loading.
- Improved model saving and exporting functionalities, including support for chunked conversion and smart tensor ordering.
- Added detailed logging and warnings for quantization and GGUF repository handling.
- Refactored quantization checks and model state reporting for clarity and accuracy.
0 commit comments