Commit 0ea9f8b
feat(e2e): fix CLI, add streaming spec, E2E inference working
- Fix error_reporter.zig ArrayList API (Zig 0.13 compatibility)
- Fix cli.zig std.io.getStdOut() usage
- Create specs/tri/streaming_loader.vibee for large model support
- Fix tri_inference.zig missing ternary_output_norm field
- Add 10 streaming/large model tests (25 total passing)
- Update PERFORMANCE_COMPARISON.md with E2E results
E2E Results (TinyLlama-1.1B):
- Conversion: 638 MB GGUF → 497 MB TRI (22% smaller)
- Load time: 4.3 seconds
- Inference: 1.98 tok/s (needs SIMD optimization)
Co-authored-by: Ona <no-reply@ona.com>1 parent 7ca5843 commit 0ea9f8b
5 files changed
Lines changed: 564 additions & 14 deletions
File tree
- docs
- specs/tri
- src/vibeec
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
178 | 178 | | |
179 | 179 | | |
180 | 180 | | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
181 | 204 | | |
182 | 205 | | |
183 | 206 | | |
| |||
0 commit comments