Skip to content

Commit 228b1c8

Browse files
Update
1 parent 257711e commit 228b1c8

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

src/maxtext/configs/models/deepseek3-test.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ routed_scaling_factor: 2.5
3535
routed_score_func: "sigmoid"
3636
routed_bias: True
3737
decoder_block: "deepseek"
38-
# MLA
38+
# MLA.
3939
attention_type: "mla"
4040
q_lora_rank: 1536
4141
kv_lora_rank: 512

0 commit comments

Comments
 (0)