Commit e770377
Antigravity Agent
feat(hslm): Patch #6 - bf16 shadow weights
Add bf16/bf16 shadow weights module for:
- 2× memory reduction vs f32 weights
- Bf16VecF16 type for SIMD operations
- Conversion utilities between bf16 and f32
Based on Railway x86_64 (Intel Xeon 6975P-C) benchmark:
- 52,732 ops/sec with 1.90 µs latency
- 2× lower latency than ARM M1/M2 NEON
See PR description for detailed results and motivation.1 parent e79438b commit e770377
1 file changed
Lines changed: 15 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
0 commit comments