Skip to content

Commit 31ec5b4

Browse files
authored
Merge pull request #244 from weikuo0506/update-a4x-training-readme
docs: Update A4X training benchmarks support matrix and correct paths
2 parents 16ad7f7 + 3696271 commit 31ec5b4

1 file changed

Lines changed: 19 additions & 6 deletions

File tree

README.md

Lines changed: 19 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -51,12 +51,25 @@ Models | GPU Machine Type
5151

5252
Models | GPU Machine Type | Framework | Workload Type | Orchestrator | Link to the recipe
5353
------------------ | ---------------------------------------------------------------------------------------------------- | --------- | ------------- | ------------ | ------------------
54-
**Llama-3.1-8B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo | Pre-training | GKE | [Link](./training/a4x/llama3-1-8b/nemo-pretraining-gke/)
55-
**Llama-3.1-70B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo | Pre-training | GKE | [Link](./training/a4x/llama3-1-70b/nemo-pretraining-gke/)
56-
**Llama-3.1-405B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo | Pre-training | GKE | [Link](./training/a4x/llama3-1-405b/nemo-pretraining-gke/)
57-
**Nemotron-4-340B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo | Pre-training | GKE | [Link](./training/a4x/nemotron4-340B/nemo-pretraining-gke/)
58-
**Wan-2.1-14B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo | Pre-training | GKE | [Link](./training/a4x/wan2-1-14b/nemo-pretraining-gke/)
59-
**Wan-2.1-14B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo | Pre-training | Slurm | [Link](./training/a4x/wan2-1-14b/nemo-pretraining-slurm/)
54+
**Llama-3.1-8B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo (25.07) | Pre-training | GKE | [Link](./training/a4x/llama3_8b/nemo-gke/nemo2507/)
55+
**Llama-3.1-8B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (25.11) | Pre-training | GKE | [Link](./training/a4x/llama3_8b/megatron-bridge-gke/nemo2511/)
56+
**Llama-3.1-8B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (25.11) | Pre-training | Slurm | [Link](./training/a4x/llama3_8b/megatron-bridge-slurm/nemo2511/)
57+
**Llama-3.1-70B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo (25.07) | Pre-training | GKE | [Link](./training/a4x/llama3_70b/nemo-gke/nemo2507/)
58+
**Llama-3.1-70B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (26.02) | Pre-training | GKE | [Link](./training/a4x/llama3_70b/nemo-gke/nemo2602/)
59+
**Llama-3.1-405B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo (25.07) | Pre-training | GKE | [Link](./training/a4x/llama31_405b/nemo-gke/nemo2507/)
60+
**Llama-3.1-405B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo (26.02) | Pre-training | GKE | [Link](./training/a4x/llama31_405b/nemo-gke/nemo2602/)
61+
**Llama-3.1-405B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (26.02) | Pre-training | GKE | [Link](./training/a4x/llama31_405b/megatron-bridge-gke/nemo2602/)
62+
**Llama-3.1-405B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (25.09) | Pre-training | Slurm | [Link](./training/a4x/llama31_405b/megatron-bridge-slurm/nemo2509/)
63+
**Nemotron-4-340B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo (25.09) | Pre-training | GKE | [Link](./training/a4x/nemotron4_340b/nemo-gke/nemo2509/)
64+
**Wan-2.1-14B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo (25.11) | Pre-training | GKE | [Link](./training/a4x/wan_14b/nemo-gke/nemo2511/)
65+
**Wan-2.1-14B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo (26.02) | Pre-training | GKE | [Link](./training/a4x/wan_14b/nemo-gke/nemo2602/)
66+
**Wan-2.1-14B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | NeMo (25.11) | Pre-training | Slurm | [Link](./training/a4x/wan_14b/nemo-slurm/nemo2511/)
67+
**DeepSeek-V3** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (25.11) | Pre-training | GKE | [Link](./training/a4x/deepseek_v3/megatron-bridge-gke/nemo2511/)
68+
**Qwen-3-235B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (25.11) | Pre-training | GKE | [Link](./training/a4x/qwen3_235b_a22b/megatron-bridge-gke/nemo2511/)
69+
**Qwen-3-235B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (25.11) | Pre-training | Slurm | [Link](./training/a4x/qwen3_235b_a22b/megatron-bridge-slurm/nemo2511/)
70+
**Qwen-3-30B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (25.11) | Pre-training | GKE | [Link](./training/a4x/qwen3_30b_a3b/megatron-bridge-gke/nemo2511/)
71+
**Qwen-3-30B** | [A4X (NVIDIA GB200)](https://cloud.google.com/compute/docs/accelerator-optimized-machines#a4x-vms) | Megatron-Bridge (25.11) | Pre-training | Slurm | [Link](./training/a4x/qwen3_30b_a3b/megatron-bridge-slurm/nemo2511/)
72+
6073

6174
### Inference benchmarks A3 Mega
6275

0 commit comments

Comments
 (0)