gitlab-runner/docs/configuration/gpus.md at main · gitlabhq/gitlab-runner

stage	Verify
group	Runner Core
info	To determine the technical writer assigned to the Stage/Group associated with this page, see <https://handbook.gitlab.com/handbook/product/ux/technical-writing/#assignments>
title	Using Graphical Processing Units (GPUs)

Tier: Free, Premium, Ultimate
Offering: GitLab.com, GitLab Self-Managed, GitLab Dedicated

Introduced in GitLab Runner 13.9.

GitLab Runner supports the use of Graphical Processing Units (GPUs). The following section describes the required configuration to enable GPUs for various executors.

Shell executor

No runner configuration is needed.

Docker executor

Warning

If you're using Podman as the container runtime engine, GPUs are not detected. For more information, see issue 39095.

Prerequisites:

Install NVIDIA Driver.
Install NVIDIA Container Toolkit.

Use the gpus or service_gpus configuration option in the runners.docker section:

[runners.docker]
    gpus = "all"
    service_gpus = "all"

Docker Machine executor

See the documentation for the GitLab fork of Docker Machine.

Kubernetes executor

Prerequisites:

Ensure that the node selector chooses a node with GPU support.
Enable the FF_USE_ADVANCED_POD_SPEC_CONFIGURATION feature flag.

To enable GPU support, configure the runner to request GPU resources in the pod specification. For example:

[[runners.kubernetes.pod_spec]]
  name = "gpu"
  patch = '''
    containers:
    - name: build
      resources:
        requests:
          nvidia.com/gpu: 1
        limits:
          nvidia.com/gpu: 1
  '''
  patch_type = "strategic" # <--- `strategic` patch_type

Adjust the GPU count in requests and limits based on your job requirements.

GitLab Runner has been tested on Amazon Elastic Kubernetes Service with GPU-enabled instances.

Validate that GPUs are enabled

You can use runners with NVIDIA GPUs. For NVIDIA GPUs, one way to ensure that a GPU is enabled for a CI job is to run nvidia-smi at the beginning of the script. For example:

train:
  script:
    - nvidia-smi

If GPUs are enabled, the output of nvidia-smi displays the available devices. In the following example, a single NVIDIA Tesla P4 is enabled:

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.51.06    Driver Version: 450.51.06    CUDA Version: 11.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Tesla P4            Off  | 00000000:00:04.0 Off |                    0 |
| N/A   43C    P0    22W /  75W |      0MiB /  7611MiB |      3%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

If the hardware does not support a GPU, nvidia-smi should fail either because it's missing or because it can't communicate with the driver:

modprobe: ERROR: could not insert 'nvidia': No such device
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shell executor

Docker executor

Docker Machine executor

Kubernetes executor

Validate that GPUs are enabled

FilesExpand file tree

gpus.md

Latest commit

History

gpus.md

File metadata and controls

Shell executor

Docker executor

Docker Machine executor

Kubernetes executor

Validate that GPUs are enabled