Commit 0013cc8
Switch to fp16 ONNX model (87MB → 44MB)
No frontend code changes needed — onnxruntime handles fp16 internally
with float32 I/O. Max logit diff vs fp32 is 0.04 (negligible). Bump
model version to '3' to force re-download of cached models.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 4c2554e commit 0013cc8
3 files changed
Lines changed: 3 additions & 3 deletions
File tree
- public/maia3
- src
- components/Common
- contexts
Binary file not shown.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
120 | 120 | | |
121 | 121 | | |
122 | 122 | | |
123 | | - | |
| 123 | + | |
124 | 124 | | |
125 | 125 | | |
126 | 126 | | |
| |||
175 | 175 | | |
176 | 176 | | |
177 | 177 | | |
178 | | - | |
| 178 | + | |
179 | 179 | | |
180 | 180 | | |
181 | 181 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
| 40 | + | |
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
| |||
0 commit comments