Commit c96502e
committed
feat(cookbook): update transformers model configuration for Qwen3.5
- Replace generic TransformersModel with Qwen3.5ForConditionalGeneration
- Set custom `_no_split_modules` to {'Qwen3_5DecoderLayer'} for FSDP compatibility
- Use specific model ID 'ms://Qwen/Qwen3.5-4B' instead of generic MODEL_ID
- Remove explicit strategy parameter as it's handled by model configuration1 parent 7816375 commit c96502e
1 file changed
Lines changed: 3 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
62 | 62 | | |
63 | 63 | | |
64 | 64 | | |
65 | | - | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
71 | 68 | | |
72 | 69 | | |
73 | 70 | | |
| |||
0 commit comments