benchmarks: convergence suite v1 (Ipopt + MadNLP on X gate) by jack-champagne · Pull Request #93 · harmoniqs/DirectTrajOpt.jl

jack-champagne · 2026-05-16T05:06:21Z

Summary

Adds benchmark/convergence/ — a small DTO benchmark environment that
exercises HarmoniqsBenchmarks.jl v0.2.0's new convergence API
(InfidelityConvergence, ipopt_capture, compare_convergence).

Scope is deliberately tight (one IPOPT + one MadNLP testitem on the same
X-gate state-transfer); atoms-demo / spin-qubit-demo / bosonic-demo problems
land in separate follow-up PRs once this harness is proven green.

Depends on: HarmoniqsBenchmarks.jl main (v0.2.0 + convergence schema, not yet registered).

What's included

benchmark/convergence/Project.toml — isolated env with HBJ (main),
Ipopt, MadNLP, NamedTrajectories. DirectTrajOpt pulled in as a path dep
to ../.. so we test the package under PR HEAD, not General.
benchmark/convergence/problem_utils.jl — _make_xgate_prob (bilinear
4D rep + terminal cost + control regularizers) and
_build_convergence_result (rebuilds a BenchmarkResult with a
ConvergenceCriterion attached).
benchmark/convergence/convergence.jl — two @testitems:
- X gate convergence: Ipopt — ipopt_capture() extracts final
  iter_count + inf_pr, builds InfidelityConvergence with
  target_infidelity = 1e-3, feas_tol = 1e-6, asserts converged(...).
- X gate convergence: MadNLP — same problem, MadNLP solver; uses
  post-solve evaluator.constraint_violation as the primal-infeasibility
  proxy (no MadNLP capture analogue yet).
benchmark/convergence/README.md — usage instructions.
.github/workflows/benchmark-convergence.yml — Julia 1.12, 30 min
timeout, JLD2 artifact upload.

Local results

=== X gate convergence (Ipopt) ===
  iters=22  final_inf=4.430e-11  inf_pr=3.307e-09  wall=12.363s  converged=true
=== X gate convergence (MadNLP) ===
  final_inf=3.164e-14  cviol=1.537e-10  wall=13.647s  converged=true
Test Summary: | Pass  Total   Time
Package       |    6      6  37.2s

JLD2 artifacts produced:

xgate_convergence_ipopt_N51_<commit>.jld2
xgate_convergence_madnlp_N51_<commit>.jld2

Test plan

benchmark-convergence workflow green on this PR
Both testitems assert converged(result.convergence) == true
compare_convergence([result]) returns one row per converging benchmark
JLD2 artifacts uploaded as workflow artifact

Out of scope (follow-up PRs)

Atoms-demo convergence (Piccolo / QuantumCollocation downstream)
Spin-qubit-demo convergence
NYU bosonic-demo convergence
MadNLP capture-callback analogue to ipopt_capture()

codecov · 2026-05-20T05:27:02Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

After the deterministic+sweep rework no testitems use :experimental anymore, and the sweeps add ~5 seconds in aggregate on top of a ~10-minute matrix — not worth a parallel filter framework. Drops both INCLUDE_EXPERIMENTAL and INCLUDE_ROBUSTNESS env vars and the ti.tags closure. The robustness sweeps now run on every PR (along with their deterministic baselines). Keeps a `/benchmark/` path filter following Piccolissimo.jl's convention — the benchmark subtree has its own Project.toml + deps + workflow, so its @testItems shouldn't be discovered by `@run_package_tests`. This also preemptively unbreaks PR #93's CI (which fails today because benchmark/convergence/convergence.jl is being picked up). Verified locally on Julia 1.12: 355 pass / 1 broken / 0 fail. TimeConsistency sweep: pass_rate = 1.0 (20/20) NonlinearKnotPoint sweep: pass_rate = 0.8 (16/20, threshold 0.65) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Re-cut of #93 onto current main. The branch had gone stale (it predated #75 + src changes), so a direct merge would have deleted #75's timing suite/docs and reverted src/ — this re-cut adds ONLY the convergence suite + its workflow. - benchmark/convergence/: Ipopt + MadNLP X-gate convergence testitems (InfidelityConvergence + ipopt_capture), report.jl emitting the /bench-convergence dashboard JSON + job-summary table (reuses the shared benchmark/BenchmarkUtils.jl from #75). README frames it as a sanity baseline. - .github/workflows/benchmark-convergence.yml. - Dropped the runtests.jl change: main already excludes benchmark/ testitems (the filter landed with #75). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

… from timing run) Re-cut onto current main (post #75 + #101). Adds the Ipopt+MadNLP X-gate convergence suite (own benchmark-convergence.yml + /bench-convergence dashboard, reusing the shared benchmark/BenchmarkUtils.jl) and extends the timing benchmark.yml filter to skip BOTH alloc_profile and benchmark/convergence/ — those suites have their own workflows + project envs, so the timing run picking them up used the wrong env / blew the 60-min budget (the prior failure here). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

The docs job had no timeout, so it defaulted to 360 min. On 2026-06-08 a deploydocs gh-pages push stalled and ran 90+ min, and because docs.yml's concurrency group does not cancel in-progress main builds, it queued the next main docs deploy behind it. makedocs is ~5 min; 30 min bounds build+deploy with margin and fails fast on a deploy stall instead of blocking the queue. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

This was referenced May 20, 2026

test: deterministic seeds + multi-seed robustness sweeps for flaky tests #45

Draft

bench: Ipopt + MadNLP allocation profile script #71

Closed

jack-champagne force-pushed the benchmarks/convergence-suite-v1 branch from b1a48b8 to 248dde9 Compare June 8, 2026 05:14

jack-champagne force-pushed the benchmarks/convergence-suite-v1 branch from 248dde9 to d106b7a Compare June 8, 2026 06:32

jack-champagne merged commit b481744 into main Jun 8, 2026
2 of 7 checks passed

jack-champagne deleted the benchmarks/convergence-suite-v1 branch June 8, 2026 06:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmarks: convergence suite v1 (Ipopt + MadNLP on X gate)#93

benchmarks: convergence suite v1 (Ipopt + MadNLP on X gate)#93
jack-champagne merged 2 commits into
mainfrom
benchmarks/convergence-suite-v1

jack-champagne commented May 16, 2026

Uh oh!

codecov Bot commented May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jack-champagne commented May 16, 2026

Summary

What's included

Local results

Test plan

Out of scope (follow-up PRs)

Uh oh!

codecov Bot commented May 20, 2026

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant