Skip to content

Commit f1c9b89

Browse files
committed
debug: add PW_block_0_Cout5 single-kernel test for integration bug
New test matching the integrated blocks.0.pw first-tile shape (C_in=8, C_out=5, HW=48×48) to isolate whether the 40% dW[0] drift in integrated MobileNet is a kernel-level bug or integration bug. Result: standalone test passes bit-exact (2/40 err at FP32 noise level 1e-6 rel). Kernel itself works correctly. Bug is integration-specific.
1 parent f28e418 commit f1c9b89

1 file changed

Lines changed: 3 additions & 0 deletions

File tree

debug/gen_kernel_tests.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -423,6 +423,9 @@ def build_gradw_generic(out_dir: Path, Ci: int, Co: int, Hi: int, Wi: int,
423423
# block_0 PW 8→16 HW=48 (biggest PW shape in MobileNet — most stressful for L1)
424424
("PW_block_0", "X", 8, 16, 48, 48, 48, 48, 1, 1, 1, 0, 1),
425425
("PW_block_0", "W", 8, 16, 48, 48, 48, 48, 1, 1, 1, 0, 1),
426+
# blocks.0.pw integrated C_out tile shape — reproduces integration-only bug
427+
# where C_out=5 < NUM_CORES=8 produces ~40% off dW[0]
428+
("PW_block_0_Cout5","W", 8, 5, 48, 48, 48, 48, 1, 1, 1, 0, 1),
426429
# block_1 DW stride-2 (C=16 HW 48→24)
427430
("DW_block_1_s2", "X", 16, 16, 48, 48, 24, 24, 3, 3, 2, 1, 16),
428431
("DW_block_1_s2", "W", 16, 16, 48, 48, 24, 24, 3, 3, 2, 1, 16),

0 commit comments

Comments
 (0)