Skip to content

Latest commit

 

History

History
11 lines (7 loc) · 721 Bytes

File metadata and controls

11 lines (7 loc) · 721 Bytes

Minitron Pruning — End-to-End Tutorials

End-to-end tutorials for Minitron structured pruning followed by knowledge distillation, quantization, evaluation,and vLLM deployment.

Each subdirectory covers a specific source model and target size, including the full data blend, pruning config, distillation hyperparameters, evaluation results, and throughput benchmarks.

Related