Support local ephemeral nvme disks by djeebus · Pull Request #1411 · e2b-dev/infra

djeebus · 2025-10-27T23:13:53Z

Local benchmarks indicate that this drop base sandbox latency ~20-30ms

Downsides:

Live migrations will probably get slower for longer.

Note

Replaces PD cache disks with configurable local NVMe SSDs for client/build nodes, provisioning them via startup script and wiring new disk-count variables through Terraform.

Compute templates (client/build):
- Replace PD cache disks with dynamic local-ssd NVMe scratch disks (375GB) using dynamic "disk" and range(var.*_cluster_cache_disk_count).
- Set boot disk auto_delete on client; keep root disk sizing via build_cluster_root_disk_size_gb.
Startup script (scripts/start-client.sh):
- Partition local NVMe disks and assemble RAID0 (mdadm) when multiple; persist config.
- Format XFS, add to /etc/fstab, mount at /orchestrator; create sandbox, template, build dirs.
- Add 100G swap, persist swap and sysctl tuning; remove sudo usage; minor robustness (retry loops).
- Template var LOCAL_CACHE_DISK_COUNT passed from Terraform.
Terraform variables/wiring:
- Add build_cluster_cache_disk_count and client_cluster_cache_disk_count (with validation) in root and module; pass through in main.tf.
- Remove cache disk size/type vars from nomad-cluster module (PD-based settings).

^{Written by Cursor Bugbot for commit 4e008cd. This will update automatically on new commits. Configure here.}

ValentaTomas · 2025-10-27T23:15:41Z

  }

  disk {
+    auto_delete  = true


Do we want this?

No reason to keep a boot or cache disk around after we delete the VM instance, right?

Maybe in some strange debugging circumstances, but agree here.
Does this mean we are accumulating the disk somewhere right now?

No, turns out auto_delete is set to true by default. I can remove this if we don't want it be clear.

ValentaTomas · 2025-10-27T23:16:10Z

+echo "persisting array configuration"
+sudo mdadm --detail --scan --verbose | sudo tee -a /etc/mdadm/mdadm.conf
+%{ else }
 DISK="/dev/disk/by-id/google-persistent-disk-1"


Is this relevant anymore?

I wasn't sure if we want to commit to this, or a/b test this in production. I can assume we'll commit and remove the old stuff.

ValentaTomas · 2025-10-27T23:17:46Z

Will the local cache cleaner in orchestrator work ok with the RAID?

djeebus · 2025-10-27T23:23:04Z

Will the local cache cleaner in orchestrator work ok with the RAID?

I'll double check, but assuming it's just looking at files and folders, shouldn't be a problem.

djeebus · 2025-10-28T22:17:28Z

Yup, unix.Statfs works regardless of how the path is mounted.

Support local ephemeral nvme disks

d24357a

djeebus requested review from ValentaTomas, dobrac and jakubno as code owners October 27, 2025 23:13

e2b-request-same-site-reviewers Bot assigned ValentaTomas Oct 27, 2025

This comment was marked as outdated.

Sign in to view

ValentaTomas reviewed Oct 27, 2025

View reviewed changes

Comment thread iac/provider-gcp/nomad-cluster/nodepool-client.tf Outdated

ValentaTomas reviewed Oct 27, 2025

View reviewed changes

append, don't overwrite

e137aad

This comment was marked as outdated.

Sign in to view

remove support for persistent disks

4e008cd

ValentaTomas approved these changes Oct 29, 2025

View reviewed changes

djeebus merged commit 8f2c2de into main Oct 30, 2025
27 checks passed

djeebus deleted the support-local-ephemeral-nvme-disks branch October 30, 2025 22:27

ValentaTomas pushed a commit that referenced this pull request May 4, 2026

Support local ephemeral nvme disks (#1411)

25506b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support local ephemeral nvme disks#1411

Support local ephemeral nvme disks#1411
djeebus merged 3 commits into
mainfrom
support-local-ephemeral-nvme-disks

djeebus commented Oct 27, 2025 •

edited by cursor Bot

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

ValentaTomas Oct 27, 2025

Uh oh!

djeebus Oct 27, 2025 •

edited

Loading

Uh oh!

ValentaTomas Oct 28, 2025

Uh oh!

djeebus Oct 28, 2025

Uh oh!

ValentaTomas Oct 27, 2025

Uh oh!

djeebus Oct 27, 2025

Uh oh!

ValentaTomas commented Oct 27, 2025

Uh oh!

djeebus commented Oct 27, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

djeebus commented Oct 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

djeebus commented Oct 27, 2025 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

ValentaTomas Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

djeebus Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ValentaTomas Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

djeebus Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

ValentaTomas Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

djeebus Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

ValentaTomas commented Oct 27, 2025

Uh oh!

djeebus commented Oct 27, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

djeebus commented Oct 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

djeebus commented Oct 27, 2025 •

edited by cursor Bot

Loading

djeebus Oct 27, 2025 •

edited

Loading