Using the NVIDIA GPU Operator

You can use the NVIDIA GPU Operator with {VirtProductName} to accelerate the deployment of worker nodes for running GPU-enabled virtual machines (VMs). The NVIDIA GPU Operator manages NVIDIA GPU resources in an {product-title} cluster and automates tasks when preparing nodes for GPU workloads.

The NVIDIA GPU Operator can also facilitate provisioning complex artificial intelligence and machine learning (AI/ML) workloads.

Procedure

Configure your ClusterPolicy manifest. Your ClusterPolicy manifest must match the provided example:

apiVersion: nvidia.com/v1
kind: ClusterPolicy
metadata:
  name: gpu-cluster-policy
spec:
  daemonsets:
    updateStrategy: RollingUpdate
  dcgm:
    enabled: true
  dcgmExporter: {}
  devicePlugin: {}
  driver:
    enabled: false
    kernelModuleType: auto
  gfd: {}
  mig:
    strategy: single
  migManager:
    enabled: true
  nodeStatusExporter:
    enabled: true
  operator:
    defaultRuntime: crio
    initContainer: {}
    runtimeClass: nvidia
    use_ocp_driver_toolkit: true
  sandboxDevicePlugin:
    enabled: true
  sandboxWorkloads:
    defaultWorkload: vm-vgpu
    enabled: true
  toolkit:
    enabled: true
    installDir: /usr/local/nvidia
  validator:
    plugin:
      env:
      - name: WITH_WORKLOAD
        value: "true"
  vfioManager:
    enabled: true
  vgpuDeviceManager:
    config:
      default: default
      name: vgpu-devices-config
    enabled: true
  vgpuManager:
    enabled: true
    image: <vgpu_image_name>
    repository: <vgpu_container_registry>
    version: <nvidia_vgpu_manager_version>

where:

<vgpu_image_name>: Specifies the vGPU image name.
<vgpu_container_registry>: Specifies the vGPU container registry value.
<nvidia_vgpu_manager_version>: Specifies the version of the vGPU driver you have downloaded from the NVIDIA website and used to build the image.

Use the NVIDIA GPU Operator to configure mediated devices. For more information see NVIDIA GPU Operator with OpenShift Virtualization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using the NVIDIA GPU Operator

FilesExpand file tree

about-using-gpu-operator.adoc

Latest commit

History

about-using-gpu-operator.adoc

File metadata and controls

Using the NVIDIA GPU Operator