Use independent seeds in makeSizedByteStrings by zeme-wana · Pull Request #7735 · IntersectMBO/plutus

zeme-wana · 2026-04-23T10:11:17Z

Summary

Fixes the -- FIXME: this is terrible in plutus-core/cost-model/budgeting-bench/Generators.hs.

makeSizedByteStrings passed the same H.Seed to every call of makeSizedByteString, so each generated ByteString was a prefix of the same deterministic byte sequence — smaller ones were literal prefixes of larger ones. This produced correlated inputs for benchmarks that depend on byte content (equality, hashing, string ops).

`makeSizedByteStrings` used the same `H.Seed` for every element, so each generated ByteString was a prefix of the same deterministic byte sequence. Use `unfoldr (Just . Seed.split)` to produce a stream of independent SplitMix seeds instead, giving uncorrelated content across sizes.

zeme-wana · 2026-04-23T10:14:59Z

/benchmark nofib

github-actions · 2026-04-23T11:25:35Z

Click here to check the status of your benchmark.

zliu41

Use independent seeds

Looks reasonable, but are you sure this is the reason for the "FIXME: this is terrible"? Who added the FIXME?

zliu41 · 2026-04-23T11:54:12Z

/benchmark nofib

This has nothing to do with nofib. It updates the costing benchmark.

github-actions · 2026-04-23T12:28:20Z

Comparing benchmark results of 'nofib' on '0468c1c57d' (base) and '5a6c9917d0' (PR)

Results table

Script	`0468c1c`	`5a6c991`	Change
clausify/formula1	2.029 ms	2.042 ms	+0.6%
clausify/formula2	2.738 ms	2.759 ms	+0.8%
clausify/formula3	7.494 ms	7.544 ms	+0.7%
clausify/formula4	16.46 ms	16.56 ms	+0.6%
clausify/formula5	36.38 ms	36.55 ms	+0.5%
knights/4x4	12.06 ms	12.14 ms	+0.7%
knights/6x6	30.13 ms	30.38 ms	+0.8%
knights/8x8	52.74 ms	53.15 ms	+0.8%
primetest/05digits	5.752 ms	5.793 ms	+0.7%
primetest/10digits	11.39 ms	11.47 ms	+0.7%
primetest/30digits	33.03 ms	33.43 ms	+1.2%
primetest/50digits	53.91 ms	54.76 ms	+1.6%
queens4x4/bt	3.569 ms	3.590 ms	+0.6%
queens4x4/bm	4.668 ms	4.700 ms	+0.7%
queens4x4/bjbt1	4.340 ms	4.372 ms	+0.7%
queens4x4/bjbt2	4.067 ms	4.092 ms	+0.6%
queens4x4/fc	9.196 ms	9.239 ms	+0.5%
queens5x5/bt	48.81 ms	48.96 ms	+0.3%
queens5x5/bm	54.42 ms	54.49 ms	+0.1%
queens5x5/bjbt1	57.86 ms	58.05 ms	+0.3%
queens5x5/bjbt2	55.90 ms	56.07 ms	+0.3%
queens5x5/fc	115.8 ms	116.5 ms	+0.6%

	`0468c1c`	`5a6c991`	Change
TOTAL	622.7 ms	626.6 ms	+0.6%

Unisay

Two suggestions, neither blocking:

Could seeds be a top-level binding? unfoldr (Just . Seed.split) is enough of a head-scratcher that I'd give it a name, and the same stream is useful for the next sized-list helper:
```
-- | Infinite stream of independent seeds derived from a root seed.
splitSeeds :: H.Seed -> [H.Seed]
splitSeeds = unfoldr (Just . Seed.split)
```

The same bug is in makeSizedTextStrings (line 108) and makeSizedUtf8ByteStrings (line 118). Both fmap the same seed across all sizes, so string and decodeUtf8 benchmarks get correlated inputs too. With splitSeeds extracted, each fix is one line:

makeSizedTextStrings :: H.Seed -> [Integer] -> [Text]
makeSizedTextStrings seed sizes =
  zipWith makeSizedTextString (splitSeeds seed) (fmap fromInteger sizes)

makeSizedUtf8ByteStrings :: H.Seed -> [Integer] -> [ByteString]
makeSizedUtf8ByteStrings seed sizes =
  zipWith makeSizedUtf8ByteString (splitSeeds seed) (fmap fromInteger sizes)

If you'd rather keep this PR scoped, I'm happy to do these in a follow-up.

`makeSizedTextStrings` and `makeSizedUtf8ByteStrings` had the same bug as `makeSizedByteStrings`: they fmap'd a single seed across all sizes, producing correlated string and decodeUtf8 benchmark inputs. Extract the seed stream into a named top-level binding `splitSeeds` and reuse it across all three sized-list helpers.

kwxm · 2026-05-12T12:42:43Z

Looks reasonable, but are you sure this is the reason for the "FIXME: this is terrible"? Who added the FIXME?

That was me. It was a quick implementation and I meant to go back and improve it some time. This seems to do the job though.

kwxm · 2026-05-12T13:36:56Z

If you'd rather keep this PR scoped, I'm happy to do these in a follow-up.

@Unisay Please do that!

kwxm

Sorry for the long delay in reviewing this! It seems to do what's required. I don't think the original version was actually problematic, but it certainly wasn't ideal, and this should make it less easy to get things wrong in the future. I ran costing benchmarks for some of the bytestring functions with both this branch and the version in master and the results weren't siginficantly different.

The generators for the costing benchmarks have grown over time and they're a bit repetitive and unsystematic. There a long-standing issue to rework them all and we should do that when we get the chance. This improves things for the time being though.

zeme-wana added the No Changelog Required Add this to skip the Changelog Check label Apr 23, 2026

zliu41 reviewed Apr 23, 2026

View reviewed changes

Unisay reviewed Apr 27, 2026

View reviewed changes

kwxm self-requested a review May 7, 2026 13:44

kwxm approved these changes May 12, 2026

View reviewed changes

zeme-wana merged commit 59541d1 into master May 12, 2026
12 checks passed

zeme-wana deleted the affectionate-kowalevski-0a435e branch May 12, 2026 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use independent seeds in makeSizedByteStrings#7735

Use independent seeds in makeSizedByteStrings#7735
zeme-wana merged 2 commits into
masterfrom
affectionate-kowalevski-0a435e

zeme-wana commented Apr 23, 2026 •

edited

Loading

Uh oh!

zeme-wana commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

zliu41 left a comment

Uh oh!

zliu41 commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

Unisay left a comment

Uh oh!

kwxm commented May 12, 2026

Uh oh!

kwxm commented May 12, 2026

Uh oh!

kwxm left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

zeme-wana commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

zeme-wana commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

zliu41 left a comment

Choose a reason for hiding this comment

Uh oh!

zliu41 commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

Unisay left a comment

Choose a reason for hiding this comment

Uh oh!

kwxm commented May 12, 2026

Uh oh!

kwxm commented May 12, 2026

Uh oh!

kwxm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zeme-wana commented Apr 23, 2026 •

edited

Loading