This note records the current best interpretation of the scoped mixed-family result on the held-out mixed_trig_hidden probe.
- Default failure: root-collapse.
- Penalty rescue: early frontier reshaping.
- Solved and unsolved rows share the same first penalized action.
- Separation happens in the next 1-3 steps through hidden-branch access.
This is the strongest current explanation for why recovered HyperKAN improves from 0/60 to 36/60 on held-out mixed composition under beam search with an unconditional root penalty of 2.0.
Baseline recovered HyperKAN on held-out mixed_trig_hidden:
- default greedy:
0/60 - default beam-4:
0/60 - unconditional root penalty
2.0, greedy:24/60 - unconditional root penalty
2.0, beam-4:36/60
From artifacts/scoped_structural_probe_search_checkpoints/hyperkan/penalty_rescue_analysis_v2.json:
- Default top-3 on every mixed-family row:
expr@root::expandexpr@root::cancelexpr@root::together
- Under unconditional root penalty
2.0, the first action changes on all 60 rows, and it changes both site and op every time. - But solved and unsolved rows still share the same penalized top-1 action:
expr@0::trigsimp.
So the rescue is not explained by first-step localization alone.
Early Hidden-Branch Metric
The useful discriminator is early hidden-branch access:
- Solved beam rows reaching any hidden-block site within the first 3 actions:
36/36 - Solved beam rows reaching
expr@2::cancelwithin the first 3 actions:36/36 - Unsolved beam rows reaching any hidden-block site within the first 3 actions:
12/24 - Unsolved beam rows reaching
expr@2::cancelwithin the first 3 actions:12/24
This suggests the bottleneck is not merely selecting a local site. It is getting beam search onto the hidden branch early enough.
A simple inference-time bonus for hidden-branch actions in the first 3 steps was tested on top of the unconditional root penalty:
- config: no retraining, recovered HyperKAN
best_search.pt - search condition:
root_action_penalty = 2.0early_hidden_bonus = 0.5early_hidden_bonus_steps = 3
Result:
- validation beam-4:
14/16with mean expansions57.06 - held-out mixed greedy:
24/60with mean expansions24.8 - held-out mixed beam-4:
36/60with mean expansions56.38
Interpretation:
- This first sequencing bonus does not improve the held-out mixed solve count beyond the current best
36/60. - It does reduce beam expansions on the mixed family relative to the unconditional-penalty baseline (
56.38vs66.67). - It hurts seen-family validation, so it is not yet a better overall eval condition.
- The unconditional root penalty remains the best mixed-family rescue condition.
- Early hidden-branch access is now an explicit, measured mechanism.
- The next useful intervention should target subgoal sequencing / early hidden-branch access more precisely, rather than another generic localization loss.