OptimizationAdjoint for constrained optimization solution sensitivities by jClugstor · Pull Request #1444 · SciML/SciMLSensitivity.jl

jClugstor · 2026-05-18T16:16:56Z

Checklist

Appropriate tests were added
Any code changes were done in a way that does not break public API
All documentation related to code changes were updated
The new code follows the
contributor guidelines, in particular the SciML Style Guide and
COLPRAC.
Any new documentation only uses public API

Additional context

Add any other context about the problem here.

jClugstor · 2026-05-19T16:42:14Z

Summary

Adds a new OptimizationAdjoint sensitivity algorithm that computes $dG/dp$ for a downstream loss G evaluated at the optimum $x^*(p)$ of a (possibly constrained) OptimizationProblem, without re-solving the optimization. Handles equality constraints, two-sided inequality constraints (lcons/ucons), and variable box bounds (lb/ub). Sits alongside the existing UnconstrainedOptimizationAdjoint.

opt_sol = solve(prob, NLopt.LD_SLSQP())                # solve once
dgdu!(out, u, _, _, _) = (out .= dG_dx)                # cotangent of loss w.r.t. x*
dp = adjoint_sensitivities(opt_sol, nothing;
                           sensealg = OptimizationAdjoint(),
                           dgdu = dgdu!)

jClugstor · 2026-05-19T17:16:24Z

The Math

A simple way to say this is that this is basically the same thing as SteadyStateAdjoint, but just doing the implicit differentiation on the Nonlinear system that is made up of the gradient of the lagrangian

$$ \mathcal{L}(x,y,z,p) = f(x,p) + y^\top g(x,p) + z^\top h(x,p) $$

and the constraint equations all set to zero.

First of all though, we do need all of the KKT variables/ dual variables / lagrange multipliers, the variables that are multiplied by the constraint functions in the lagrangian that satisfy the KKT conditions. We get the optimal $x^*$ from the optimization forward pass, but not every solver will return the dual variables, so we need to recover them from the stationarity condition:

$$ [J_g; J_{h_I}]^\top \mu = -\nabla f $$

This is typically overdetermined (more x's than active constraints), so we solve via LinearSolve.QRFactorization — least-squares, which under LICQ recovers the exact multipliers. LICQ says that at the optimal point, the gradients of the active constraints are all linearly independent.

QR decomposition finds the least squares solution, so it's finding

$$ \min ||J^\top \mu^* - (-\nabla f)||^2 $$

but if we found a viable local minimum, the KKT theorem states that $\mu$ exists , so the residual there will be zero. Plus, LICQ means that the Jacobian has full column rank, so the $\mu$ is unique. So a QR solve finds the unique solution.

If any returned z_I is negative (KKT violation), the corresponding constraint was wrongly classified as active; drop it and redo the multiplier solve. The sign-check + redo handles boundary degeneracy that proximity-based active-set detection alone gets wrong.

Now for the sensitivity, the idea is that a solution of the constrained optimization problem satisfies the Karush-Kuhn Tucker (KKT) conditions:

$$\nabla_x f(x,p) + \sum_{i=1}^{m_e} y_i \nabla_x g_i(x,p) + \sum_{i=1}^{m_i}z_i \nabla_x h_i(x,p) = 0$$

$$g(x,p) = 0, \quad h(x,p) \le 0, \quad z \ge 0, \quad z^\top h(x,p) = 0$$

Taking only the set of inequality constraints that are actually active, $h_I$, the KKT conditions define a nonlinear system of equations:

$$F(w, p) = \begin{pmatrix} \nabla \mathcal{L}(w, p) \ g(w,p) \ h_\mathcal{I}(w,p) \end{pmatrix} = 0$$ where $w$ is $$w := (x, y, z_\mathcal{I})$$

The solution of the optimization problem satisfies this nonlinear system of equations.
Differentiate by $p$, using implicit differentiation:

$$ \frac{\partial F}{\partial w}\frac{\partial w}{\partial{p}} + \frac{\partial F}{\partial p } = 0 $$

Our end goal is to find $\frac{\partial w}{\partial p}$ , which is the sensitivity of the solution of the optimization problem $w$, with respect to the parameters.

Using the above,

$$\frac{\partial w}{\partial p} = -\frac{\partial F}{\partial w}^{-1}\frac{\partial F}{\partial p}$$

Now consider that we have some scalar cost function that will be a function of the solution of the optimization problem, $C(w)$.

$$\frac{dC}{dp} = \frac{\partial C}{\partial w}\frac{\partial w}{\partial p } + \frac{\partial C}{\partial p} = \frac{\partial C}{\partial w}\left(- \frac{\partial F}{\partial w}^{-1} \frac{\partial F}{\partial p} \right) + \frac{\partial C}{\partial p}$$

$$\frac{dC}{dp} = \frac{\partial C}{\partial p} - \left(\frac{\partial C}{\partial w} \frac{\partial F}{\partial w}^{-1} \right)\frac{\partial F}{\partial p} $$

But this term $\frac{\partial C}{\partial w} \frac{\partial F}{\partial w}^{-1}$ can be represented as a linear solve,

$$\frac{\partial F}{\partial w} \lambda = \frac{\partial C}{\partial w}$$

so

$$\frac{dC}{dp} = \frac{\partial C}{\partial p} - \lambda \frac{\partial F}{\partial p}$$

that $\lambda \frac{\partial F}{\partial p}$ is what vecjacobian! does.

jClugstor added 19 commits May 20, 2026 15:32

add OptimizationNLopt to test dependencies

7254ae3

add optimization_adjoint file

ea05457

add OptimizationAdjoint and implement adjoint

1805050

add test

528f5eb

add more tests

db1547a

implement adjoint_sensitivities interface

7697cf4

use appropriate derivatives from OptimizationFunction if available

53c6855

account for lb / ub in optimization problem

86cb4a5

better test

0c3a7c4

finding dual variables doesn't need kwargs

d2d9cba

DifferentiationInterface gradients, with vecjacobian

a7efe25

introduce wrapper types, use type parameters for AD choosing

ee5857a

format

48ad1d7

drop the lazybuffer cache

00f8379

fix tests

6480329

fix compat for QA

a4a15e4

add docs page

561a921

Iterate active-set refinement until no negative multipliers

71a4187

fix local variable declarations

9c6375c

jClugstor force-pushed the full_optimization_adjoint branch from 908f0f0 to 9c6375c Compare May 20, 2026 19:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OptimizationAdjoint for constrained optimization solution sensitivities #1444

OptimizationAdjoint for constrained optimization solution sensitivities #1444
jClugstor wants to merge 19 commits into
SciML:masterfrom
jClugstor:full_optimization_adjoint

jClugstor commented May 18, 2026

Uh oh!

jClugstor commented May 19, 2026 •

edited

Loading

Uh oh!

jClugstor commented May 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

jClugstor commented May 18, 2026

Checklist

Additional context

Uh oh!

jClugstor commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

jClugstor commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The Math

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jClugstor commented May 19, 2026 •

edited

Loading

jClugstor commented May 19, 2026 •

edited

Loading