File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -21,6 +21,7 @@ High-performance inference of [OpenAI's Whisper](https://github.com/openai/whisp
2121- [ Vulkan support] ( #vulkan-gpu-support )
2222- Support for CPU-only inference
2323- [ Efficient GPU support for NVIDIA] ( #nvidia-gpu-support )
24+ - [ AMD ROCm GPU support] ( #amd-rocm-gpu-support )
2425- [ OpenVINO Support] ( #openvino-support )
2526- [ Ascend NPU Support] ( #ascend-npu-support )
2627- [ Moore Threads GPU Support] ( #moore-threads-gpu-support )
@@ -340,6 +341,27 @@ cmake -B build -DGGML_VULKAN=1
340341cmake --build build -j --config Release
341342```
342343
344+ ## AMD ROCm GPU support
345+
346+ With AMD GPUs the processing can be accelerated via HIP/ROCm.
347+ First, make sure you have installed [ ROCm] ( https://rocm.docs.amd.com/en/latest/ ) .
348+
349+ Now build ` whisper.cpp ` with HIP support:
350+
351+ ```
352+ cmake -B build -DGGML_HIP=1 -DAMDGPU_TARGETS="gfx1201"
353+ cmake --build build -j --config Release
354+ ```
355+
356+ Replace ` gfx1201 ` with your GPU architecture. You can find it with:
357+
358+ ```
359+ rocminfo | grep "gfx"
360+ ```
361+
362+ Common architectures: ` gfx1100 ` (RX 7900 XTX), ` gfx1101 ` (RX 7800 XT), ` gfx1201 ` (RX 9070 XT).
363+ For multiple GPUs with different architectures: ` -DAMDGPU_TARGETS="gfx1100;gfx1201" ` .
364+
343365## BLAS CPU support via OpenBLAS
344366
345367Encoder processing can be accelerated on the CPU via OpenBLAS.
You can’t perform that action at this time.
0 commit comments