[MWG-1605] feat: add cube and gpu support via templates by lucasl0st · Pull Request #351 · ionos-cloud/cluster-api-provider-ionoscloud

lucasl0st · 2026-04-01T14:14:00Z

What is the purpose of this pull request/Why do we need it?

We need kubernetes nodes with GPUs.

Description of changes:

GPU servers are similar to cubes, they use templates.
Therefore I added support for templates and since CUBE is just another server type, I added support for it in addition to GPU as well.
Added an e2e test that tests with cubes because testing with GPUs requires an image with UEFI and would also just be too expensive.

Checklist:

Documentation updated
Unit Tests added
E2E Tests added
Includes emojis

lucasl0st · 2026-04-02T11:20:29Z

@jriedel-ionos I want to add an additional e2e test for cubes, could you please set the variable IONOSCLOUD_CUBE_TEMPLATE_ID to 72e73b81-8551-4e74-b398-fc63b39994af (smallest cube XS)

lucasl0st · 2026-04-02T11:32:27Z

+
+	// when using templates (cubes or gpu servers) we cannot delete the boot volume
+	// the whole server must be deleted at once
+	if !deleteVolumes && bootVolumeID != nil && (server.Properties != nil && server.Properties.TemplateUuid == nil) {


for this change here I am not 100% sure if this works as expected.
when using templates you must delete the whole server including the boot volume at once, you cannot detach or delete the boot volume by itself.

but we also dont want to delete the attached volumes from PVCs.
in testing I noticed that CAPI (or CAPIC, not sure) waits until all PVCs are detached, I could perform a node rebuild/deletion without loosing the PVC volumes. but I am not sure if this a guarantee

lpape-ionos · 2026-04-02T14:30:17Z

Note to reviewers: I have this running on our teams sandbox here: https://github.com/ionos-cloud/mwg-deployment/tree/main/projects/sandbox-cluster/capi/templates

Copilot

Pull request overview

This PR adds support for provisioning IONOS Cloud CUBE and GPU Kubernetes nodes via server templates, including new clusterctl templates, CRD/schema updates, and tests (with e2e coverage using CUBE as a cheaper proxy for GPU template behavior).

Changes:

Add templateID plus new server/disk types (CUBE/GPU, DAS) to the API types and CRDs, including validation rules.
Update server reconciliation to set template-backed server properties correctly and handle template-specific boot volume constraints.
Add new cluster templates (cube/gpu) and extend e2e coverage with a CUBE flavor test.

Reviewed changes

Copilot reviewed 16 out of 17 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
`internal/service/cloud/server.go`	Applies template-specific server/boot-volume property rules and skips boot-volume deletion for template-backed servers.
`internal/service/cloud/server_test.go`	Adds unit tests for CUBE/GPU template provisioning and template deletion behavior.
`api/v1alpha1/ionoscloudmachine_types.go`	Introduces `templateID`, new `ServerType` values (CUBE/GPU), and `DAS` disk type + validation annotations.
`api/v1alpha1/ionoscloudmachine_types_test.go`	Adds/extends validation tests for new server types and `templateID` rules.
`config/crd/bases/infrastructure.cluster.x-k8s.io_ionoscloudmachines.yaml`	Updates CRD schema/enum/validations for template-backed server types and DAS.
`config/crd/bases/infrastructure.cluster.x-k8s.io_ionoscloudmachinetemplates.yaml`	Same as above for machine templates CRD.
`templates/cluster-template-cube.yaml`	Adds clusterctl flavor template for CUBE servers using `templateID` (and DAS).
`templates/cluster-template-gpu.yaml`	Adds clusterctl flavor template for GPU servers using `templateID`.
`test/e2e/data/infrastructure-ionoscloud/cluster-template-cube.yaml`	Adds e2e cluster template for the `cube` flavor.
`test/e2e/config/ionoscloud.yaml`	Registers the new e2e template and adds `IONOSCLOUD_CUBE_TEMPLATE_ID` variable.
`test/e2e/capic_test.go`	Adds an e2e QuickStartSpec covering the `cube` flavor.
`.github/workflows/e2e.yaml`	Plumbs `IONOSCLOUD_CUBE_TEMPLATE_ID` into the e2e workflow environment.
`docs/quickstart.md`	Documents new server types and the new `cube`/`gpu` templates and variables.
`docs/custom-image.md`	Documents EFI/UEFI requirements for GPU usage and updated build guidance.
`envfile.example`	Adds example env vars for cube/gpu template IDs.
`go.mod`, `go.sum`	Bumps `github.com/ionos-cloud/sdk-go/v6` to `v6.3.6`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lucasl0st · 2026-04-02T15:20:14Z

~~TODO~~: I need to check this:

E0402 15:17:44.288900       1 controller.go:324] "Reconciler error" err=<
	error in step ReconcileIPFailover: failed to patch LAN 1: request to Cloud API has failed: 422 Unprocessable Entity {
	  "httpStatus" : 422,
	  "messages" : [ {
	    "errorCode" : "345",
	    "message" : "[(root).properties.ipFailover] NICs of a Cube instance are not allowed to be added to an IP Failover setup"
	  } ]
	}

Edit: this essentially means that cubes should not be used as control plane nodes.

For #351 I need the cube template id added to the e2e tests. For the tests to run I need to merge this to main because I am coming from a fork.

…udMachineSpec

sonarqubecloud · 2026-04-08T10:50:38Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

lucasl0st temporarily deployed to e2e April 1, 2026 14:14 — with GitHub Actions Inactive

lucasl0st force-pushed the gpus branch from 63f46e7 to 816e538 Compare April 2, 2026 09:48

lucasl0st had a problem deploying to e2e April 2, 2026 09:48 — with GitHub Actions Failure

lucasl0st force-pushed the gpus branch from 816e538 to ca234b2 Compare April 2, 2026 09:49

lucasl0st had a problem deploying to e2e April 2, 2026 09:49 — with GitHub Actions Failure

lucasl0st force-pushed the gpus branch from ca234b2 to 53b6a4b Compare April 2, 2026 09:51

lucasl0st had a problem deploying to e2e April 2, 2026 09:51 — with GitHub Actions Failure

lucasl0st force-pushed the gpus branch from 53b6a4b to 509b68c Compare April 2, 2026 09:57

lucasl0st had a problem deploying to e2e April 2, 2026 09:57 — with GitHub Actions Failure

lucasl0st changed the title ~~feat: add gpu support via templates~~ feat: add cube and gpu support via templates Apr 2, 2026

lucasl0st force-pushed the gpus branch from 509b68c to 604bd6e Compare April 2, 2026 10:12

lucasl0st had a problem deploying to e2e April 2, 2026 10:12 — with GitHub Actions Failure

lucasl0st force-pushed the gpus branch from 604bd6e to 30c579c Compare April 2, 2026 10:39

lucasl0st had a problem deploying to e2e April 2, 2026 10:39 — with GitHub Actions Failure

lucasl0st force-pushed the gpus branch from 30c579c to 8d00c93 Compare April 2, 2026 10:56

lucasl0st temporarily deployed to e2e April 2, 2026 10:57 — with GitHub Actions Inactive

lucasl0st had a problem deploying to e2e April 2, 2026 11:23 — with GitHub Actions Failure

lucasl0st commented Apr 2, 2026

View reviewed changes

lucasl0st had a problem deploying to e2e April 2, 2026 11:41 — with GitHub Actions Failure

lucasl0st force-pushed the gpus branch from 38c16ca to 58cb4a4 Compare April 2, 2026 11:49

lucasl0st had a problem deploying to e2e April 2, 2026 11:49 — with GitHub Actions Failure

lpape-ionos mentioned this pull request Apr 2, 2026

ci: add IONOSCLOUD_CUBE_TEMPLATE_ID var for e2e tests #355

Merged

lucasl0st force-pushed the gpus branch from 58cb4a4 to a45be88 Compare April 2, 2026 14:04

lucasl0st had a problem deploying to e2e April 2, 2026 14:04 — with GitHub Actions Failure

lucasl0st had a problem deploying to e2e April 2, 2026 14:23 — with GitHub Actions Failure

lpape-ionos requested a review from Copilot April 2, 2026 14:33

Copilot started reviewing on behalf of lpape-ionos April 2, 2026 14:33 View session

Copilot AI reviewed Apr 2, 2026

View reviewed changes

Comment thread api/v1alpha1/ionoscloudmachine_types.go

lucasl0st had a problem deploying to e2e April 2, 2026 15:55 — with GitHub Actions Failure

lpape-ionos added a commit that referenced this pull request Apr 7, 2026

ci: add IONOSCLOUD_CUBE_TEMPLATE_ID var for e2e tests (#355)

e3d3ef4

For #351 I need the cube template id added to the e2e tests. For the tests to run I need to merge this to main because I am coming from a fork.

lucasl0st added 8 commits April 7, 2026 12:29

feat: update ionos-cloud sdk-go to v6.3.6

0323949

feat: add cube and gpu support via templates

6eed06d

fix: move cpuFamily CEL validation from IonosCloudMachine to IonosClo…

cb33ef8

…udMachineSpec

test: add CEL validation tests for GPU and CUBE server types

968eadb

test: add unit tests for CUBE server creation and deletion

abcca87

test: add e2e test for CUBE server type

d95c783

docs: add CUBE and GPU server type documentation and templates

40594e8

fix: boot volume disk type cannot be set for GPU servers

0cc78b8

lucasl0st force-pushed the gpus branch from a66a0c7 to 7a50a78 Compare April 7, 2026 10:29

lucasl0st had a problem deploying to e2e April 7, 2026 10:29 — with GitHub Actions Failure

fix: cubes cannot be used as control planes

fe99b46

lucasl0st force-pushed the gpus branch from 7a50a78 to fe99b46 Compare April 8, 2026 08:41

lucasl0st had a problem deploying to e2e April 8, 2026 08:41 — with GitHub Actions Failure

lucasl0st marked this pull request as ready for review April 8, 2026 08:42

lucasl0st requested review from gfariasalves-ionos, jriedel-ionos, mcbenjemaa, piepmatz and wikkyk as code owners April 8, 2026 08:42

fix: do not delete volumes separately for template servers

413d702

lucasl0st temporarily deployed to e2e April 8, 2026 10:50 — with GitHub Actions Inactive

mspoeri changed the title ~~feat: add cube and gpu support via templates~~ [MWG-1605] feat: add cube and gpu support via templates Apr 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MWG-1605] feat: add cube and gpu support via templates#351

[MWG-1605] feat: add cube and gpu support via templates#351
lucasl0st wants to merge 10 commits into
ionos-cloud:mainfrom
lucasl0st:gpus

lucasl0st commented Apr 1, 2026 •

edited by lpape-ionos

Loading

Uh oh!

lucasl0st commented Apr 2, 2026

Uh oh!

lucasl0st Apr 2, 2026

Uh oh!

lpape-ionos commented Apr 2, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

lucasl0st commented Apr 2, 2026 •

edited by lpape-ionos

Loading

Uh oh!

sonarqubecloud Bot commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

lucasl0st commented Apr 1, 2026 • edited by lpape-ionos Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucasl0st commented Apr 2, 2026

Uh oh!

lucasl0st Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

lpape-ionos commented Apr 2, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

lucasl0st commented Apr 2, 2026 • edited by lpape-ionos Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonarqubecloud Bot commented Apr 8, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lucasl0st commented Apr 1, 2026 •

edited by lpape-ionos

Loading

lucasl0st commented Apr 2, 2026 •

edited by lpape-ionos

Loading