Skip to content

[None][docs] Update supported models matrix with AD-onboarded architectures#248

Closed
bmarimuthu-nv wants to merge 2 commits into
feat/paperclip_maximizerfrom
feat/update-supported-models-matrix-v2
Closed

[None][docs] Update supported models matrix with AD-onboarded architectures#248
bmarimuthu-nv wants to merge 2 commits into
feat/paperclip_maximizerfrom
feat/update-supported-models-matrix-v2

Conversation

@bmarimuthu-nv

Copy link
Copy Markdown

Summary

  • Add 15 new architecture entries to the model support matrix for models onboarded via AutoDeploy in this branch
  • Expand 8 existing architecture entries with broader model family coverage
  • Each new entry traces to a specific commit in the branch (see commit_model_mapping.txt)
  • Add [^7] footnote for all AutoDeploy-supported architectures

New architectures

DeepseekV2ForCausalLM, ExaoneForCausalLM, Gemma2ForCausalLM, GemmaForCausalLM, GlmMoeDsaForCausalLM, GraniteMoeHybridForCausalLM, HunYuanDenseV1ForCausalLM, HunYuanMoEV1ForCausalLM, InternLM2ForCausalLM, Olmo2ForCausalLM, OpenELMForCausalLM, Phi4FlashForCausalLM, Phi4VisionRForConditionalGeneration, SeedOssForCausalLM, Starcoder2ForCausalLM

Test plan

  • Verify markdown renders correctly
  • Cross-check architecture names against __init__.py diff with upstream/main

🤖 Generated with Claude Code

… architectures

Add 15 new architecture entries to the model support matrix for models
onboarded via the AutoDeploy backend, and expand existing entries to
reflect broader model family coverage from the AD sprint.

Signed-off-by: Bala Marimuthu <bmarimuthu@nvidia.com>
Signed-off-by: Balamurugan Marimuthu <246387390+bmarimuthu-nv@users.noreply.github.com>
… architectures

Add 15 new architecture entries and expand 8 existing entries in the
model support matrix to reflect models onboarded via AutoDeploy in
this branch. Each new entry traces to a specific commit in the branch.

New architectures (all AD-supported via [^7]):
- DeepseekV2ForCausalLM, ExaoneForCausalLM, Gemma2ForCausalLM,
  GemmaForCausalLM, GlmMoeDsaForCausalLM, GraniteMoeHybridForCausalLM,
  HunYuanDenseV1ForCausalLM, HunYuanMoEV1ForCausalLM,
  InternLM2ForCausalLM, Olmo2ForCausalLM, OpenELMForCausalLM,
  Phi4FlashForCausalLM, Phi4VisionRForConditionalGeneration,
  SeedOssForCausalLM, Starcoder2ForCausalLM

Expanded existing architectures:
- Cohere2, LlamaForCausalLM, MistralForCausalLM, Phi3ForCausalLM,
  Qwen2ForCausalLM, Qwen3ForCausalLM, Qwen3MoeForCausalLM,
  MiniMaxM2ForCausalLM

Signed-off-by: Bala Marimuthu <bmarimuthu@nvidia.com>
Signed-off-by: Balamurugan Marimuthu <246387390+bmarimuthu-nv@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant