🚀 The feature, motivation and pitch
After merging the infrastructure changes, one additional piece of modeling infra that needs to be removed are the "patches" used to patch models beyond their HuggingFace definition to make them AutoDeploy compatible. The new sharding IR relies on custom modeling files, so we want to move entirely to these.
Alternatives
No response
Additional context
No response
Before submitting a new issue...
🚀 The feature, motivation and pitch
After merging the infrastructure changes, one additional piece of modeling infra that needs to be removed are the "patches" used to patch models beyond their HuggingFace definition to make them AutoDeploy compatible. The new sharding IR relies on custom modeling files, so we want to move entirely to these.
Alternatives
No response
Additional context
No response
Before submitting a new issue...