Commit c90f007
A2: Multi-shard safetensors + Gemma 4B support
- Multi-shard loading: detect index.json, mmap each shard separately
- Tensor name: language_model.model.* prefix for Gemma 4B
- Per-tensor data_base pointer for shard-aware weight access
- Auto-detect gemma-3-4b-it in HF cache (tq_convert)
- All existing tests pass, Qwen3.5 + Gemma 270M verified
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 781cca0 commit c90f007
3 files changed
Lines changed: 387 additions & 82 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
157 | 157 | | |
158 | 158 | | |
159 | 159 | | |
160 | | - | |
161 | | - | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
162 | 163 | | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
163 | 167 | | |
164 | 168 | | |
165 | 169 | | |
| |||
0 commit comments