`async`-ify Wasmtime (and v8) execution by gefjon · Pull Request #3263 · clockworklabs/SpacetimeDB

gefjon · 2025-09-22T17:22:16Z

Description of Changes

In service of adding procedures, which will need to execute WASM code in an async environment so as to suspend execution while making HTTP requests and whatnot.

Prior to this PR, JobCores worked by spawning an OS thread for each database. These threads would then each be pinned to a specific core, and in multi-tenant deployments multiple threads could be pinned to the same core. Now, instead, we spawn one thread per available core at startup. Each of these threads runs a single-threaded Tokio executor. Each database is assigned to one of these executors, and runs tasks on it via tokio::spawn.

When we run without core pinning (usually due to having too few hardware cores), we won't spawn any additional threads or Tokio runtimes at all; instead we will run database jobs on the "global" Tokio executor. These jobs may block Tokio worker threads, which might be an issue if a very core-constrained device runs multiple databases with very long-running reducers. If this is an issue, we could in this case instead build a second Tokio runtime only for running database jobs, and let the OS scheduler figure things out like it did previously.

Previously, we implemented load-balancing among the database cores by occasionally instructing per-database threads to re-pin themselves. Now, we instead periodically send the database a new wasmtime::runtime::Handle, which they will spawn future jobs into.

Previously, it was possible for a database thread to become canceled, most likely as a result of ModuleHost::exit, after which calls would fail with NoSuchModule. Cancellation is no longer meaningful, as the database holds a Handle to a long-lived tokio::runtime::Runtime, which should always outlive the ModuleHost. I have added an AtomicBool flag to ModuleHost which is flipped by exit and checked by calls to maintain the previous functionality.

Within this PR, the jobs run on the database-execution Tokio tasks are not actually asynchronous; they will never yield. This is important because these jobs may (will) hold a transaction open, and attempting to swap execution to another job which wants a transaction on the same database would be undesirable.

Note that this may regress our multi-tenant performance / fairness: previously, in multi-tenant environments, the OS scheduler would divide the database cores' time between the per-database threads, potentially causing one high-load database to be interrupted in the middle of a reducer in order to run other databases pinned to the same core. Now, a high-load database will instead run its entire reducer to completion before any other database gets to run.

We could, in the future, change this by instructing Wasmtime to yield periodically, either via epoch interruption or fuel, both of which we're already configuring Wasmtime to track. We'd need (or at least want) to (re-)introduce a queue s.t. we only attempt to run one job for each database at a time. I have chosen not to do so within this patch because I felt the changeset was complex enough already, and we have so far not treated fairness in multi-tenant environments as a high priority.

I have also reworked our module host machinery to no longer use dynamic dispatch and trait polymorphism to manage modules and their instances, and instead introduced enum Module and enum Instance, each of which has a variant for Wasm and another for V8.

During this rewrite, I reworked AutoReplacingModuleInstance, which previously used type-erased trait generics in a way that was brittle and hard to re-use in the new async context. (Specifically, the module instance no longer lives on the job thread, rather, the database grabs the instance and sends it to the job thread, then gets it back when the job exits. This is necessary to allow the re-worked load balancing described above, as we can't have a single long-lived async task.) While refactoring, I replaced it with ModuleInstanceManager, which can now maintain multiple instances of the same module. This is not yet useful, but will become necessary with procedures, as each concurrent procedure will need its own instance. Relatedly, I changed ModuleHost::on_module_thread (used by one-off and initial subscription queries) to no longer acquire the/an instance. I discussed this with the team, and consensus was that "locking" the module instance in that path was not a useful behavior, just an artifact of the previous implementation.

I have also switched our Wasmtime configuration to set async_support(true). This causes a variety of methods, notably InstancePre::instantiate and TypedFunc::call, to panic, and requires that we instead call their _async variants. As mentioned above, I have not yet introduced any actual asynchronicity or concurrency, so these methods should never yield. Rather than .awaiting their futures, I have defined a degenerate async executor, poll_once_executor, which polls a future exactly once, failing if it does not return Poll::Ready. This means that we will panic if one of these futures returns Poll::Pending unexpectedly.

The previous trait Module had a method initial_instances. Module is now a concrete type, and I gave it this method, but it appears to be unused. This is causing lints to fail. I am unsure what, if anything, that method was for.

The previous AutoReplacingModuleInstance called create_instance on the job thread. I am unsure if this was intentional, or just an artifact of the previous implementation, where the AutoReplacingModuleInstance lived on the job thread. I have written the new ModuleInstanceManager to call create_instance on the calling thread, but it would be easy to move that call into the job executor if that behavior is desired.

API and ABI breaking changes

None user-facing

Expected complexity level and risk

4: significant rewrite of performance-sensitive fiddly concurrency code.

Note specifically in above description:

Running database jobs on the global Tokio runtime when not using core pinning.
Multi-tenant fairness issue: no longer possible to interrupt a performance-intensive database mid-reducer to run another database pinned to the same core.
Unused method module_instances.
Running create_instance on the calling thread rather than the database thread.

Testing

Will arrange for a bot test.
Determine to what extent we can run with real or synthetic multi-tenant load in a test or staging environment.

At this stage, changes are not integrated, so build is broken.

But these are never supposed to block, so use a custom "executor" which only calls `poll` once.

Centril · 2025-09-24T09:38:29Z

Could you extract b26d447 to its own PR? We can approve and merge that quickly.

gefjon · 2025-09-24T14:27:21Z

Could you extract b26d447 to its own PR? We can approve and merge that quickly.

@Centril #3284 .

This reverts commit b26d447. These changes are now in a separate branch, `phoebe/skippable-tests`.

@Centril

# Description of Changes In environments without dotnet installed, these tests were the only thing preventing `cargo test --all -- --skip csharp` from completing successfully. I also included `rust` in the name, though running tests in an environment without cargo/rustc installed seems less likely to work. Extracted from #3263 at request of @Centril . # API and ABI breaking changes N/a # Expected complexity level and risk 1 # Testing N/a

joshua-spacetime

This looks reasonable. I don't believe we'll see performance regressions, but we should definitely block merging until we've run a full scale performance test.

Personally I'd like to see us lower the threshold (currently 10) for core pinning and probably isolate db work in its own runtime when not pinning, but I also don't think it's that important as long as the performance is acceptable when we do pin.

Centril

This looks good in general. If you don't mind, this seems like it would conflict a lot with the changes in v8/mod.rs, so it would be neat if this could go after my PR.

Co-authored-by: Mazdak Farrokhzad <twingoow@gmail.com> Co-authored-by: joshua-spacetime <josh@clockworklabs.io> Signed-off-by: Phoebe Goldman <phoebe@goldman-tribe.org>

…etimeDB into phoebe/wasmtime-async

So that we can determine whether we should be running this on the database core or not

And fix typo that mis-categorized `Module::Js` as `ModuleType::Wasm`.

It's not used in private either, and no one knew what it was for, so I'm cutting it.

coolreader18

Seems pretty reasonable!

Prior to this commit, when shutting down SpacetimeDB-standalone via C-c / SIGINT, the process would print a panic message and backtrace (if enabled) due to dropping nested Tokio runtimes. This commit avoids doing so in SpacetimeDB-standalone, and adds a note to `JobCores` instructing how to avoid it in other uses.

joshua-spacetime

Perf test showed no regressions

# Description of Changes This got a little bigger than I had hoped, but I think it's still pretty manageable. This PR partially reverts back to before #3263: cores are re-balanced with a `watch::Receiver<CoreId>` that each database thread will listen for updates on in order to repin itself, and multiple OS threads (each matched to a database) can be pinned to one core. As I understand it, that second part is something Phoebe was trying to avoid, but given that there's no way to asyncify a JS module, it's kind of necessary. JS is single-threaded, and uses cooperative rather than preemptive multitasking (callbacks/async, not green threads). That means that if a JS function has an infinite loop, no other event handlers would be able to run unless that loop were to exit. Coupled with the fact that we can't `Send` a v8 isolate across threads, it makes more sense to keep the module host on one thread and repin that thread as needed. An alternative option, as was brought up, would be to deconstruct and reconstruct the module onto a different thread when needed, since load-balancing won't be happening often anyway. # Expected complexity level and risk 3 - reworks the threadpool that databases run on, and so could lead to deadlocks or other concurrency bugs. However, that seems unlikely, since this separates databases each onto their own thread, and as such decreases the likelihood of them interacting poorly with each other. # Testing Not sure if there's anything specific I should do, since this doesn't change behavior. - [ ]  --------- Co-authored-by: Mazdak Farrokhzad <twingoow@gmail.com>

gefjon added 8 commits September 12, 2025 08:42

Rewrite jobs.rs to use single-threaded Tokio runtimes

f26e20f

At this stage, changes are not integrated, so build is broken.

Thread jobs.rs changes through all of core crate.

54b7d5a

Merge remote-tracking branch 'origin/master' into phoebe/wasmtime-async

ba27106

Fix standalone main tyck

4c088f2

Clean up testing JobCores initialization

0fedef3

Wrap some tests in a mod so that I can --skip them

b26d447

Use Wasmtime async functions where necessary

e54e728

But these are never supposed to block, so use a custom "executor" which only calls `poll` once.

Fix clippy lints (except unused method I can't explain)

21830cd

gefjon marked this pull request as ready for review September 22, 2025 21:12

gefjon requested review from Centril, coolreader18 and joshua-spacetime September 22, 2025 21:12

coolreader18 reviewed Sep 22, 2025

View reviewed changes

Comment thread crates/core/src/util/poll_once_executor.rs Outdated

Add futures-util and use their now_or_never

56c89e6

gefjon requested a review from coolreader18 September 23, 2025 13:48

Centril reviewed Sep 24, 2025

View reviewed changes

Comment thread crates/core/src/host/v8/mod.rs

gefjon mentioned this pull request Sep 24, 2025

Wrap some tests in a mod so that I can --skip them #3284

Merged

Revert "Wrap some tests in a mod so that I can --skip them"

9bb6df0

This reverts commit b26d447. These changes are now in a separate branch, `phoebe/skippable-tests`.

joshua-spacetime assigned gefjon Sep 24, 2025

joshua-spacetime reviewed Sep 24, 2025

View reviewed changes

Comment thread crates/core/src/host/module_host.rs

Comment thread crates/core/src/util/jobs.rs Outdated

Comment thread crates/core/src/util/jobs.rs Outdated

Comment thread crates/core/src/host/module_host.rs

Centril reviewed Sep 25, 2025

View reviewed changes

Comment thread crates/core/src/util/jobs.rs

Comment thread crates/core/src/host/module_host.rs

Comment thread crates/core/src/host/module_host.rs Outdated

Comment thread crates/core/src/host/module_host.rs

Comment thread crates/core/src/host/module_host.rs Outdated

gefjon and others added 5 commits September 25, 2025 12:43

Fix typo'd comments revealed in PR review

c1da1b8

Co-authored-by: Mazdak Farrokhzad <twingoow@gmail.com> Co-authored-by: joshua-spacetime <josh@clockworklabs.io> Signed-off-by: Phoebe Goldman <phoebe@goldman-tribe.org>

Expand comments as requested in code review

db90e6d

Merge branch 'phoebe/wasmtime-async' of github.com:clockworklabs/Spac…

b27c69d

…etimeDB into phoebe/wasmtime-async

Merge branch 'master' into phoebe/wasmtime-async

a4f5d50

Add metric for time spent in create_instance

f1c5938

So that we can determine whether we should be running this on the database core or not

gefjon requested a review from Centril September 25, 2025 17:19

gefjon requested a review from joshua-spacetime September 25, 2025 17:19

Centril reviewed Sep 25, 2025

View reviewed changes

Comment thread crates/core/src/host/module_host.rs Outdated

Centril reviewed Sep 25, 2025

View reviewed changes

Comment thread crates/core/src/host/module_host.rs Outdated

gefjon added 2 commits September 25, 2025 14:00

Use existing HostType enum

492bd93

And fix typo that mis-categorized `Module::Js` as `ModuleType::Wasm`.

Remove unused initial_instances method

81c6366

It's not used in private either, and no one knew what it was for, so I'm cutting it.

bfops added the release-any To be landed in any release window label Sep 29, 2025

coolreader18 approved these changes Oct 1, 2025

View reviewed changes

gefjon added 5 commits October 2, 2025 12:42

Merge remote-tracking branch 'origin/master' into phoebe/wasmtime-async

9dbb4e7

Merge remote-tracking branch 'origin/master' into phoebe/wasmtime-async

438cb98

Merge remote-tracking branch 'origin/master' into phoebe/wasmtime-async

e22d3cd

Clippy

af591f8

joshua-spacetime approved these changes Oct 7, 2025

View reviewed changes

gefjon added this pull request to the merge queue Oct 8, 2025

Merged via the queue into master with commit aa29582 Oct 8, 2025
26 of 27 checks passed

coolreader18 mentioned this pull request Jan 26, 2026

Rework JobCores in order to core-pin v8 instance threads #4128

Merged

1 task

Conversation

gefjon commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of Changes

API and ABI breaking changes

Expected complexity level and risk

Testing

Uh oh!

Uh oh!

Centril commented Sep 24, 2025

Uh oh!

Uh oh!

gefjon commented Sep 24, 2025

Uh oh!

joshua-spacetime left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Centril left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coolreader18 left a comment

Choose a reason for hiding this comment

Uh oh!

joshua-spacetime left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gefjon commented Sep 22, 2025 •

edited

Loading