Commit 1326751
committed
[None][fix] Use simple shard + BMM and fix chat template for GPT-OSS
- Use simple_shard_only + bmm sharding per reviewer feedback (uses
all_gather for functional multi-GPU support)
- Guard multimodal content-to-list conversion in llm.py with
hasattr(processor, "image_processor") to fix TypeError in
text-only model chat templates (e.g., GPT-OSS)
Signed-off-by: Lucas Liebenwein <lliebenwein@nvidia.com>
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>1 parent afd5b38 commit 1326751
1 file changed
Lines changed: 7 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
| 52 | + | |
| 53 | + | |
52 | 54 | | |
53 | | - | |
54 | | - | |
55 | | - | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
56 | 60 | | |
57 | 61 | | |
58 | 62 | | |
| |||
0 commit comments