Skip to content

Commit 951be6e

Browse files
committed
BENCHMARKS: split-schedule validation — hydrophobes rescued, polars marginally improved
Methane pred at split schedule: +2.05 kcal/mol, expt +2.00, residual +0.05. A 3.9 kcal/mol fix vs the M5 Max pilot-1 run's -1.85 that started this investigation. At 11x1000 smoke params, well within FreeSolv production-run noise. Ethanol pred at split schedule: +8.49, residual +13.5 — a ~1.5 kcal/mol improvement over the old schedule's +10.06, but most of the overshoot survives. The split schedule closes the water- penetration channel cleanly for nonpolar solutes; polars expose a second pathology. Best hypothesis: slow H-bond network equilibration in stage B of the split schedule (waters reorienting around partially-charged ligand as lambda_elec sweeps 0->1 with full LJ repulsion already on). At 1000 production steps = 1 ps per window, water hasn't had time to rearrange. Test running autonomously: ethanol at 11x5000 and 11x10000 on split schedule. Monotonic decrease with sampling = slow convergence (fix by running longer). Flat = third bug.
1 parent b89dd51 commit 951be6e

1 file changed

Lines changed: 26 additions & 0 deletions

File tree

BENCHMARKS.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -669,6 +669,32 @@ This is what yank/perses use. Our code couples them linearly,
669669
which is the source of the water-penetration artifact. Fix is a
670670
sampling.py schedule change, not a FF or sampler rewrite.
671671

672+
**Split-schedule validation (commit b89dd51), 2026-04-24:**
673+
674+
| compound | old pred | split pred | expt | split residual |
675+
|---|---:|---:|---:|---:|
676+
| methane | −1.85 (M5 Max) | **+2.05** | **+2.00** | **+0.05**|
677+
| ethanol | +10.06 (post-sign-fix) | +8.49 | −5.01 | +13.50 |
678+
679+
Hydrophobes: **fully rescued**. Methane 0.05 kcal/mol from expt
680+
at 11×1000 smoke sampling — a 3.9 kcal/mol fix vs the M5 Max
681+
run that triggered this investigation.
682+
683+
Polars: **marginally improved** (~1.5 kcal/mol on ethanol), most
684+
of the overshoot survives. The split schedule closes the water-
685+
penetration channel but exposes a second polar-specific
686+
pathology — most likely slow H-bond network equilibration during
687+
stage B, where waters need to reorient around the partially-
688+
charged ligand as λ_elec sweeps 0 → 1 with full LJ already on.
689+
At 1000 production steps per window (1 ps), that reorganisation
690+
hasn't happened yet.
691+
692+
Test queued autonomously: ethanol at 11×5000 and 11×10000 on the
693+
split schedule. If residual drops monotonically with sampling
694+
time, polar is "just slow convergence" — needs the production
695+
25 000-step budget on GPU. If residual stays flat at ~+13, a
696+
third bug exists that we haven't found.
697+
672698
---
673699

674700
## 1.3 Alchemical FEP — Milestone B scaffold (binding ΔG / ΔΔG)

0 commit comments

Comments
 (0)