Commit aad8dba
committed
Phase-2: apply 2 KB cap to BOTH constants and activations - CCT_2 now PASSES
Earlier 'cap-on-constants-only' (a8694f7) made CCT_2 fail 10/10
because its 7x 32 KB residual / LayerNorm-output activations hit the
write-side mirror of the §10.1 codegen bug (writer layer tile-loops
output to a fixed L2 base address, every tile clobbers the previous).
The §10.1 bug is symmetric: any buffer >2 KB tiled across the
producer's output dimension corrupts under promotion, regardless of
whether it's a static-PI_L2 ConstantBuffer (read-side) or a
dynamically-allocated MEMORYARENA_L2 VariableBuffer (write-side).
Apply the cap uniformly.
Cost: IC100 / miniMobileNet lose their large-activation gains
(x3_tensor_split 16 KB blocked), back to const-only level.
microLlama1 retains its activation gains (all activations <2 KB
already). All 5 models PASS:
| Model | const-only | act+cap+users>=2 | Δ vs baseline |
|-----------------|----------------:|------------------:|--------------:|
| IC100 | 2 389 072 cyc | 2 389 072 cyc | -3.9 %% |
| microLlama1 | 3 475 840 cyc | 3 095 426 cyc | -26.8 %% |
| miniMobileNet | 134 946 cyc | 134 946 cyc | -29.6 %% |
| AnomalyDet 200 | 509 558 cyc | 509 738 cyc | -15.3 %% |
| CCT_2 | 309 888 350 cyc | 309 893 862 cyc | ~0 %% (PASS) |
Correctness wins over peak perf for IC/mmn — recovering the
large-activation gains there requires the proper Phase-3 fix
(per-tile L1↔L2 source/destination advancement when the buffer is
at the closure's externalMemory level).1 parent a8694f7 commit aad8dba
1 file changed
Lines changed: 16 additions & 11 deletions
Lines changed: 16 additions & 11 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
234 | 234 | | |
235 | 235 | | |
236 | 236 | | |
237 | | - | |
238 | | - | |
239 | | - | |
240 | | - | |
241 | | - | |
242 | | - | |
243 | | - | |
244 | | - | |
245 | | - | |
246 | | - | |
247 | | - | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
248 | 253 | | |
249 | 254 | | |
250 | 255 | | |
| |||
0 commit comments