Naive 1.0 prompts + calibration by bkorycki · Pull Request #1469 · mlcommons/modelbench

bkorycki · 2026-01-26T22:48:44Z

This PR changes the "naive" test+hazard+benchmark to use the new 1.0 naive prompts.

The naive prompts are 10% of the general holdback set. This subset isn’t being used anywhere else, so we created a new file which I added to SECURITY_NAIVE_PROMPT_SETS.

Important note: security benchmarks are now meaningless because we don't have 1.0 attack prompts yet. So if you were to run a security benchmark, it would run 1.0 naive prompts and 0.5 jailbreaks. I changed the version of the benchmark to "0.0" in case someone were to run it (either via baas or one of us runs it accidentally), but maybe we should disable the security benchmark cli entirely?

github-actions · 2026-01-26T22:48:54Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

wpietri

Seems like a reasonable response to a weird situation.

bkorycki added 3 commits January 26, 2026 13:40

modelgague naive test is now 1.0

6a5a6a8

update modelbench code

c9a6394

calibrate

6781c1c

bkorycki requested review from rogthefrog and wpietri January 26, 2026 22:48

bkorycki requested a review from a team as a code owner January 26, 2026 22:48

bkorycki temporarily deployed to Scheduled Testing January 26, 2026 22:48 — with GitHub Actions Inactive

rogthefrog approved these changes Jan 26, 2026

View reviewed changes

wpietri approved these changes Jan 26, 2026

View reviewed changes

bkorycki merged commit ce52d5d into main Jan 27, 2026
2 checks passed

bkorycki deleted the naive-1.0 branch January 27, 2026 00:16

github-actions Bot locked and limited conversation to collaborators Jan 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Naive 1.0 prompts + calibration#1469

Naive 1.0 prompts + calibration#1469
bkorycki merged 3 commits into
mainfrom
naive-1.0

bkorycki commented Jan 26, 2026

Uh oh!

github-actions Bot commented Jan 26, 2026

Uh oh!

wpietri left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

bkorycki commented Jan 26, 2026

Uh oh!

github-actions Bot commented Jan 26, 2026

Uh oh!

wpietri left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants