Commit 678f4ba
committed
Add excludeNamePatterns to PromoteTensorsToL2Greedy
Default-skip Conv-weight buffers ("_conv_layers_", "_conv_weight",
"_conv_bias"). Phase-1 bisection showed that promoting one specific
FP32 Conv weight (tokenizer_conv_layers_0_0_weight, 6912B) breaks
CCT_2_32_32_128 output equality 10/10 by itself, while all 21 other
small constants in the candidate set promote safely.
This is a partial defensive workaround — the AnomalyDetection int8
bug and the larger CCT proj-bias / positional-embedding cases hit the
same per-tile weight DMA codegen bug from a different angle and are
NOT fixed by this filter. See AI_AGENT Phase-0 doc \xc2\xa710.1 for the
full minimal-repro table and the proposed kernel-side fix.1 parent 95a097c commit 678f4ba
1 file changed
Lines changed: 13 additions & 1 deletion
Lines changed: 13 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
98 | 98 | | |
99 | 99 | | |
100 | 100 | | |
101 | | - | |
| 101 | + | |
| 102 | + | |
102 | 103 | | |
103 | 104 | | |
104 | 105 | | |
| |||
113 | 114 | | |
114 | 115 | | |
115 | 116 | | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
116 | 126 | | |
117 | 127 | | |
118 | 128 | | |
| |||
173 | 183 | | |
174 | 184 | | |
175 | 185 | | |
| 186 | + | |
| 187 | + | |
176 | 188 | | |
177 | 189 | | |
178 | 190 | | |
| |||
0 commit comments