@@ -80,13 +80,12 @@ The following summarizes observed runtime efficiency and scaling behaviors of Ma
8080
8181- ** Model Implementation Guides & Source Code:**
8282
83- - ** Llama** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/llama2/run_llama2.md ) | [ Llama2 and Llama3 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/llama2.py ) | [ Llama4 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/llama4.py )
84- - ** Gemma** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/gemma/Run_Gemma.md ) | [ Gemma Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/gemma.py ) | [ Gemma2 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/gemma2.py ) | [ Gemma3 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/gemma3.py )
85- - ** Mixtral** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/mixtral/Run_Mixtral.md ) | [ Mixtral Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/mixtral.py ) | [ Mistral Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/mistral.py )
86- - ** DeepSeek** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/deepseek/Run_DeepSeek.md ) | [ DeepSeek Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/deepseek.py )
87- - ** Qwen3** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/qwen/moe/run_qwen_moe.md ) | [ Qwen3 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/qwen3.py )
88- - ** Qwen3-Next** [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/qwen/next/run_qwen3_next.md ) | [ Qwen3-Next Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/qwen3.py )
89- - ** GPT-OSS** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/gpt_oss/run_gpt_oss.md ) | [ GPT-OSS Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/MaxText/layers/gpt_oss.py )
83+ - ** Llama** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/llama2/run_llama2.md ) | [ Llama2 and Llama3 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/llama2.py ) | [ Llama4 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/llama4.py )
84+ - ** Gemma** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/gemma/Run_Gemma.md ) | [ Gemma Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/gemma.py ) | [ Gemma2 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/gemma2.py ) | [ Gemma3 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/gemma3.py )
85+ - ** Mixtral** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/mixtral/Run_Mixtral.md ) | [ Mixtral Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/mixtral.py ) | [ Mistral Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/mistral.py )
86+ - ** DeepSeek** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/deepseek/Run_DeepSeek.md ) | [ DeepSeek Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/deepseek.py )
87+ - ** Qwen3** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/qwen/moe/run_qwen_moe.md ) | [ Qwen3-Next Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/qwen/next/run_qwen3_next.md ) | [ Qwen3 Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/qwen3.py ) | [ Qwen3-Next Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/qwen3.py )
88+ - ** GPT-OSS** : [ Guide] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/tests/end_to_end/tpu/gpt_oss/run_gpt_oss.md ) | [ GPT-OSS Source] ( https://github.com/AI-Hypercomputer/maxtext/blob/main/src/maxtext/models/gpt_oss.py )
9089
9190- ** Technical Explanations:**
9291
0 commit comments