Commit cadc3b4
docs: verified 360M performance - 166x faster load!
VERIFIED RESULTS on Fly.io:
- Load time: 208s → 1.25s (166x faster!)
- Inference: 0.16 → 0.74 tok/s (4.6x faster)
360M vs 1.7B comparison:
- Load: 19.36s → 1.25s (15.5x)
- Inference: 0.16 → 0.74 tok/s (4.6x)
Co-authored-by: Ona <no-reply@ona.com>1 parent d0b0752 commit cadc3b4
1 file changed
Lines changed: 12 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
326 | 326 | | |
327 | 327 | | |
328 | 328 | | |
329 | | - | |
| 329 | + | |
330 | 330 | | |
331 | | - | |
| 331 | + | |
332 | 332 | | |
333 | | - | |
334 | | - | |
335 | | - | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
336 | 343 | | |
337 | 344 | | |
338 | 345 | | |
| |||
0 commit comments