1.5.x cloud benchmarks by kmatasfp · Pull Request #3615 · golemcloud/golem

kmatasfp · 2026-06-05T00:18:48Z

Also contains replacement for memory semaphore plus fixes eviction deadlock.

for memory semaphore replacement see:

golem-worker-executor/src/services/active_workers/admission/mod.rs
golem-worker-executor/src/services/active_workers/component_charge/mod.rs
golem-worker-executor/src/services/active_workers/memory_probe.rs

…nup (#3596)

…plan-id optional

kill_all() is called after cloud_preflight_warmup completes. ProvidedShardManager wraps an already-running process we don't own, so neither kill nor restart should crash the binary. Both are now silent no-ops, matching UnavailableShardManager.

… component

…ot own its memory environment

…rrent overlap

…e continuing the test

kmatasfp · 2026-06-27T09:12:24Z


    assert_eq!(result1.len(), 2, "G1002"); // TODO: this is temporarily not working because of using the dynamic invoke API and not having structured information in the oplog
-    assert_eq!(result2.len(), 2, "imported-function");
+    assert_eq!(result2.len(), 1, "imported-function");


this and other similar test changes are because we do not call host function to get agent metadata anymore

kmatasfp · 2026-06-27T09:13:53Z

+    if matches!(get_asyncness(&constructor_method.sig), Asyncness::Future) {
+        return syn::Error::new_spanned(
+            &constructor_method.sig.ident,
+            "Agent constructors must be synchronous. Async constructors can call host APIs during snapshot restore and make recovery fall back to full replay.",


prob need a better message here, regardless Agent constructor should not be async imho, adds more complexity than value, unless I am missing something

I disagree, I think that would be too limiting (no awaitable rpc, no http calls etc) - and whether it does side effects or not (it has to be able to!) does not actually depend on the "asyncness", in p2 all host functions are sync and in p3 we get async ones beside them.

But I see how this interferes with load-snapshot initializing the agent. I need to think more about it :)

One real-world example: it has been a useful pattern to spawn child agents and trigger some background operation in the constructor. During recovery, of course this does not get triggered again because it's persisted in the oplog. But if load-snapshot calls the same constructor, it's going to be re-trigger the same background operation with a fresh idempotency key (or crash if we detect it as an unallowed side-effect). Both are wrong. The assumption was so far that "it's the user's responsibility" but it's probably too easy to do it wrong.

kmatasfp · 2026-06-27T09:18:30Z

 pub mod websocket;

+#[cfg(any(test, feature = "test-support"))]
+pub mod snapshot_recovery_test_support {


could not think of a better way how to do this, I need some hook inside the snapshot recovery code that actually records that we recovered from snapshot or not. Rust makes it easier than other languages to use test Spy with conditional compilation

kmatasfp and others added 3 commits June 4, 2026 17:05

feat: cloud-mode TestMode::Cloud for benchmarks with best-effort clea…

981191f

…nup (#3596)

feat: add run specific details to perf tests

341bab3

fix(benchmark): make --builtin-plugin-owner-account-id and --default-…

b1764ec

…plan-id optional

kmatasfp requested a review from a team June 5, 2026 00:18

mschuwalow approved these changes Jun 5, 2026

View reviewed changes

kmatasfp added 24 commits June 5, 2026 10:44

feat(benchmark): enable all tests

5b9902b

feat: retry connectivity to shard manager

742a669

chore: fmt

18d5af6

investigation: run echo test first to see if they get stuck again

395bcd2

feat(benchmark): lower number of conccurent live apps

dac3c69

feat: more observability, make memory component coefficient configurable

2256623

feat(benchmark): run only throughput-echo test

02e527a

feat(bench): try 200 apps after tuning

faeb651

feat: try 250 again

f8dd565

feat(benchmark): run all the tests again

1bf0063

fix: metric description

2e53af6

feat: proper load for our cluster

32ef9e5

feat(benchmark): run only benchmark tests

bc11779

feat: enable all tests again

5347626

feat(benchmark): increase max number of concurrent compilations

9e582a2

feat(worker-executor): add measured-headroom memory admission gate

e7b44bf

feat(worker-executor): charge component module size once per resident…

817c672

… component

fix(worker-executor): disable measured admission when executor does n…

35874d3

…ot own its memory environment

feat(benchmark): add throughput-under-memory-saturation benchmarks

acb9968

test(worker-executor): exercise admission reserve under maximum concu…

bfe1b14

…rrent overlap

feat(benchmark): longer sustained load, bumpt the number of agents

c3af739

fix: add empty workspace

7dcb2d3

fix: use snake case as method names

139aed5

chore: 300 already saturates, no need for 500

442c1c5

kmatasfp added 15 commits June 23, 2026 23:13

Reduce long-lived tracing spans

ed33d0a

load only 4000 in case of oplog replay test

45f6676

load only 3000 in case of oplog replay test

73e79e0

feat: better client timeout handling

0630070

Make ephemeral archive drain async

9be5c84

Fix ephemeral cleanup order

af26c30

speed up scenario 4 warmup

2a15d83

feat: reworked scenarios

f43ba7d

feat: wait for the executor pod to be ready after restart/crash befor…

2bdc851

…e continuing the test

feat: better canary agent for secanrio 4

dc936f3

feat: also restart worker-service to clear routing cache

8ffb8d3

feat: add density probe diagnostics

d16bdd2

feat: revert custom debug log

296cf08

feat: more logging

50b6bf5

degug: only log snapshot related extra telemetry

617b89c

kmatasfp force-pushed the 1.5.x-cloud-benchmarks branch 7 times, most recently from c5df097 to 2556fa3 Compare June 27, 2026 08:03

kmatasfp commented Jun 27, 2026

View reviewed changes

kmatasfp force-pushed the 1.5.x-cloud-benchmarks branch 2 times, most recently from 54ce165 to 63aff3d Compare June 30, 2026 04:00

Fix snapshot recovery

13df001

kmatasfp force-pushed the 1.5.x-cloud-benchmarks branch from 63aff3d to 13df001 Compare June 30, 2026 04:06

feat: snapshot recovery tests only

ce3e72d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

1.5.x cloud benchmarks#3615

1.5.x cloud benchmarks#3615
kmatasfp wants to merge 141 commits into
1.5.xfrom
1.5.x-cloud-benchmarks

kmatasfp commented Jun 5, 2026 •

edited

Loading

Uh oh!

kmatasfp Jun 27, 2026 •

edited

Loading

Uh oh!

kmatasfp Jun 27, 2026

Uh oh!

vigoo Jun 29, 2026

Uh oh!

vigoo Jun 29, 2026

Uh oh!

vigoo Jun 29, 2026

Uh oh!

kmatasfp Jun 27, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

kmatasfp commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kmatasfp Jun 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kmatasfp Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

vigoo Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

vigoo Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

vigoo Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

kmatasfp Jun 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kmatasfp commented Jun 5, 2026 •

edited

Loading

kmatasfp Jun 27, 2026 •

edited

Loading

kmatasfp Jun 27, 2026 •

edited

Loading