Skip to content

Recipe for llama3.1-8b 16nodes with gbs 256/seq 8192#155

Merged
junjieqian merged 9 commits intoAI-Hypercomputer:non-standard-modelsfrom
incredere:vishwas/recipe-llama3-1-8b
Apr 27, 2026
Merged

Recipe for llama3.1-8b 16nodes with gbs 256/seq 8192#155
junjieqian merged 9 commits intoAI-Hypercomputer:non-standard-modelsfrom
incredere:vishwas/recipe-llama3-1-8b

Conversation

@incredere
Copy link
Copy Markdown
Contributor

Run ID: nemo2_training-llama3-1-8b-bf16-seq8192-gbs256-gpus128-2026-02-11_192924-b8db3f67-e0a2-453b-8832-1af78a2f44f5

@dipakg-lang
Copy link
Copy Markdown

LGTM

@junjieqian junjieqian merged commit a3aadeb into AI-Hypercomputer:non-standard-models Apr 27, 2026
1 check passed
@incredere incredere deleted the vishwas/recipe-llama3-1-8b branch April 27, 2026 18:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants