Change default almost_fair_crps_alpha from 0.95 to 1.0 by mcgibbon · Pull Request #1139 · ai2cm/ace

mcgibbon · 2026-05-07T18:14:02Z

Changes the default almost_fair_crps_alpha in EnsembleLoss from 0.95 (almost-fair CRPS) to 1.0 (fair CRPS). The almost-fair modification was originally motivated by avoiding unconstrained ensemble members when one member exactly matches the target, but in practice the loss is smooth in this regime and the gradient signal is sufficient without the modification.

Changes:

fme.core.loss.EnsembleLoss: change almost_fair_crps_alpha default from 0.95 to 1.0
Tests added
If dependencies changed, "deps only" image rebuilt and "latest_deps_only_image.txt" file updated

Depends on #1138

Ran benchmark on 4-degree daily era5-only no-co2 training:

alpha=0.95: https://wandb.ai/ai2cm/ace/runs/injiirnf
alpha=1.0: https://wandb.ai/ai2cm/ace/runs/ozsyoxtz

Adds FiniteDifferenceCRPSLoss which computes CRPS on spatial finite differences, with optional multi-level coarsening via avg_pool2d. Integrates into EnsembleLoss via finite_difference_crps_weight and finite_difference_crps_levels parameters. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

mcgibbon · 2026-05-07T18:14:24Z

This is motivated by a) never having shown that afCRPS is helpful for our use case, and b) the theoretical motivation for afCRPS breaking down when we include energy score or finite difference CRPS as part of the loss, and c) the original theoretical motivation for afCRPS not being fully convincing, and the paper not clearly expressing it was done to fix an encountered issue vs just chosen for theoretical reasons.

Regarding b), afCRPS exists for the case where one of the predictions equals exactly the target, making it so the CRPS doesn't constrain the other target at all. However, we never only use CRPS in practice, and the other loss terms (energy score and finite difference CRPS) require many outputs to exactly equal the target, which is not a risk.

Regarding c), the failure mode really just means that sample won't contribute to gradient updates - the behavior for epsilon differences from the target is smooth, with small gradients/updates that reduce to zero as one of the two ensemble values approaches the target.

Finally, I'm a little worried that our SSR metrics are consistently a bit uncalibrated. It would be nice to remove this as a potential source of that mis-calibration.

jpdunc23 · 2026-05-07T18:53:15Z

        finite_difference_crps_weight: float = 0.0,
        finite_difference_crps_levels: int = 1,
-        almost_fair_crps_alpha: float = 0.95,
+        almost_fair_crps_alpha: float = 1.0,


If we do decide this is the better approach, I lean toward making this "opt-in" and adding it to the baseline configs, rather than changing the pre-existing default behavior.

My hesitance with that option is that in the past when Troy and I have had that scenario (we have a new config set we agree to use, and we update the baseline configs) we invariably forget to add it to several experiments. I think we're still missing affine_norms: true in a lot of our experimental configs.

Arcomano1234 · 2026-05-07T19:16:15Z

It would be nice to just launch a quick experiment to verify this doesn't lead to any noticeable degradation.

mcgibbon · 2026-05-08T16:25:19Z

It would be nice to just launch a quick experiment to verify this doesn't lead to any noticeable degradation.

Yeah we can hold off for this - it's a one-line PR that isn't likely to develop merge conflicts.

mcgibbon · 2026-05-26T19:17:50Z

Ran benchmark on 4-degree daily era5-only no-co2 training:

alpha=0.95: https://wandb.ai/ai2cm/ace/runs/injiirnf
alpha=1.0: https://wandb.ai/ai2cm/ace/runs/ozsyoxtz

The runs show slightly lower CRPS across many variables while having slightly higher RMSE, and SSRs are slightly higher (which is good as most variables generally go low-biased on signal later in training.

A stronger sign that this is a good change is that long-inference power spectral biases are improved for many variables, despite this not being directly optimized by the change:

… default of 1.0 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Arcomano1234

Yeah I tend to agree that the "best" values for these types of parameters should be the default because we will inevitably be copying and pasting old configs and forget to explicitly set this. It also seems to not hurt most metrics and in some cases as Jeremy pointed out it improves them (marginally).

mcgibbon and others added 5 commits May 7, 2026 17:02

Add almost_fair_crps_alpha parameter to EnsembleLoss

4a2be7f

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Use ceil_mode in avg_pool2d to include partial edge windows

a7006b1

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Default almost_fair_crps_alpha to 0.95

0e5754e

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Change default almost_fair_crps_alpha from 0.95 to 1.0

7b4dfdb

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

jpdunc23 reviewed May 7, 2026

View reviewed changes

jpdunc23 mentioned this pull request May 7, 2026

Add finite difference CRPS loss for horizontal stochastic structure #1138

Merged

2 tasks

Base automatically changed from feature/finite-difference-crps to main May 11, 2026 14:36

Merge origin/main, resolving conflicts to keep almost_fair_crps_alpha…

cda61a9

… default of 1.0 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Arcomano1234 approved these changes May 26, 2026

View reviewed changes

Merge branch 'main' into feature/fair-crps-default

28333ca

mcgibbon enabled auto-merge (squash) May 27, 2026 17:26

mcgibbon merged commit ddcb9d5 into main May 27, 2026
7 checks passed

mcgibbon deleted the feature/fair-crps-default branch May 27, 2026 17:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change default almost_fair_crps_alpha from 0.95 to 1.0#1139

Change default almost_fair_crps_alpha from 0.95 to 1.0#1139
mcgibbon merged 7 commits into
mainfrom
feature/fair-crps-default

mcgibbon commented May 7, 2026 •

edited

Loading

Uh oh!

mcgibbon commented May 7, 2026

Uh oh!

jpdunc23 May 7, 2026

Uh oh!

mcgibbon May 8, 2026

Uh oh!

Arcomano1234 commented May 7, 2026

Uh oh!

mcgibbon commented May 8, 2026

Uh oh!

mcgibbon commented May 26, 2026

Uh oh!

Arcomano1234 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mcgibbon commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcgibbon commented May 7, 2026

Uh oh!

jpdunc23 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Arcomano1234 commented May 7, 2026

Uh oh!

mcgibbon commented May 8, 2026

Uh oh!

mcgibbon commented May 26, 2026

Uh oh!

Arcomano1234 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mcgibbon commented May 7, 2026 •

edited

Loading