[None][docs] Update supported models matrix with AD-onboarded architectures by bmarimuthu-nv · Pull Request #248 · nv-auto-deploy/TensorRT-LLM

bmarimuthu-nv · 2026-03-19T05:43:34Z

Summary

Add 15 new architecture entries to the model support matrix for models onboarded via AutoDeploy in this branch
Expand 8 existing architecture entries with broader model family coverage
Each new entry traces to a specific commit in the branch (see commit_model_mapping.txt)
Add [^7] footnote for all AutoDeploy-supported architectures

New architectures

DeepseekV2ForCausalLM, ExaoneForCausalLM, Gemma2ForCausalLM, GemmaForCausalLM, GlmMoeDsaForCausalLM, GraniteMoeHybridForCausalLM, HunYuanDenseV1ForCausalLM, HunYuanMoEV1ForCausalLM, InternLM2ForCausalLM, Olmo2ForCausalLM, OpenELMForCausalLM, Phi4FlashForCausalLM, Phi4VisionRForConditionalGeneration, SeedOssForCausalLM, Starcoder2ForCausalLM

Test plan

Verify markdown renders correctly
Cross-check architecture names against __init__.py diff with upstream/main

🤖 Generated with Claude Code

… architectures Add 15 new architecture entries to the model support matrix for models onboarded via the AutoDeploy backend, and expand existing entries to reflect broader model family coverage from the AD sprint. Signed-off-by: Bala Marimuthu <bmarimuthu@nvidia.com> Signed-off-by: Balamurugan Marimuthu <246387390+bmarimuthu-nv@users.noreply.github.com>

… architectures Add 15 new architecture entries and expand 8 existing entries in the model support matrix to reflect models onboarded via AutoDeploy in this branch. Each new entry traces to a specific commit in the branch. New architectures (all AD-supported via [^7]): - DeepseekV2ForCausalLM, ExaoneForCausalLM, Gemma2ForCausalLM, GemmaForCausalLM, GlmMoeDsaForCausalLM, GraniteMoeHybridForCausalLM, HunYuanDenseV1ForCausalLM, HunYuanMoEV1ForCausalLM, InternLM2ForCausalLM, Olmo2ForCausalLM, OpenELMForCausalLM, Phi4FlashForCausalLM, Phi4VisionRForConditionalGeneration, SeedOssForCausalLM, Starcoder2ForCausalLM Expanded existing architectures: - Cohere2, LlamaForCausalLM, MistralForCausalLM, Phi3ForCausalLM, Qwen2ForCausalLM, Qwen3ForCausalLM, Qwen3MoeForCausalLM, MiniMaxM2ForCausalLM Signed-off-by: Bala Marimuthu <bmarimuthu@nvidia.com> Signed-off-by: Balamurugan Marimuthu <246387390+bmarimuthu-nv@users.noreply.github.com>

bmarimuthu-nv added 2 commits March 18, 2026 22:11

github-actions Bot assigned bmarimuthu-nv Mar 19, 2026

bmarimuthu-nv closed this Mar 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[None][docs] Update supported models matrix with AD-onboarded architectures#248

[None][docs] Update supported models matrix with AD-onboarded architectures#248
bmarimuthu-nv wants to merge 2 commits into
feat/paperclip_maximizerfrom
feat/update-supported-models-matrix-v2

bmarimuthu-nv commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

bmarimuthu-nv commented Mar 19, 2026

Summary

New architectures

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant