Commit ecab71e
committed
[NE16] enable --enableStrides for model tests: stride-2 convs on NE16
With the engine-aware DW NHWC fixup pass in place, the previous saturation
failure on stride-2 NE16 dispatch is gone — the root cause was the global
NHWC swap forcing wrong layout, not a HAL-level bug. Add --enableStrides
alongside --enable-3x3 to the model fixture so all 27 MobileNetV1 convs go
to NE16 (no cluster fallback).
gvsoc gap9.evk:
- PW-only: 1 847 256 cyc MAC/Cyc 4.05
- PW + DW-s1 (--enable-3x3): 1 190 437 cyc MAC/Cyc 6.29
- All convs (--enable-3x3 + Strides): 845 217 cyc MAC/Cyc 8.86
Final speedup vs PW-only baseline: 2.19x (-54.2% cycles).
NE16 dispatch count goes from 14 -> 28 (all 27 Convs + the final Gemm-as-PW),
cluster path runs only the residual MaxPool.
All 10 NE16 tests still pass (9 kernels + MobileNetV1).1 parent b8a518a commit ecab71e
1 file changed
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1057 | 1057 | | |
1058 | 1058 | | |
1059 | 1059 | | |
1060 | | - | |
| 1060 | + | |
1061 | 1061 | | |
1062 | 1062 | | |
1063 | 1063 | | |
| |||
0 commit comments