Skip to content

Commit 44a50ca

Browse files
authored
readme : add AMD ROCm/HIP GPU build instructions (#3823)
Signed-off-by: Kaihui-AMD <Kaihui.Tang@amd.com>
1 parent 865ec17 commit 44a50ca

1 file changed

Lines changed: 22 additions & 0 deletions

File tree

README.md

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,7 @@ High-performance inference of [OpenAI's Whisper](https://github.com/openai/whisp
2121
- [Vulkan support](#vulkan-gpu-support)
2222
- Support for CPU-only inference
2323
- [Efficient GPU support for NVIDIA](#nvidia-gpu-support)
24+
- [AMD ROCm GPU support](#amd-rocm-gpu-support)
2425
- [OpenVINO Support](#openvino-support)
2526
- [Ascend NPU Support](#ascend-npu-support)
2627
- [Moore Threads GPU Support](#moore-threads-gpu-support)
@@ -340,6 +341,27 @@ cmake -B build -DGGML_VULKAN=1
340341
cmake --build build -j --config Release
341342
```
342343

344+
## AMD ROCm GPU support
345+
346+
With AMD GPUs the processing can be accelerated via HIP/ROCm.
347+
First, make sure you have installed [ROCm](https://rocm.docs.amd.com/en/latest/).
348+
349+
Now build `whisper.cpp` with HIP support:
350+
351+
```
352+
cmake -B build -DGGML_HIP=1 -DAMDGPU_TARGETS="gfx1201"
353+
cmake --build build -j --config Release
354+
```
355+
356+
Replace `gfx1201` with your GPU architecture. You can find it with:
357+
358+
```
359+
rocminfo | grep "gfx"
360+
```
361+
362+
Common architectures: `gfx1100` (RX 7900 XTX), `gfx1101` (RX 7800 XT), `gfx1201` (RX 9070 XT).
363+
For multiple GPUs with different architectures: `-DAMDGPU_TARGETS="gfx1100;gfx1201"`.
364+
343365
## BLAS CPU support via OpenBLAS
344366

345367
Encoder processing can be accelerated on the CPU via OpenBLAS.

0 commit comments

Comments
 (0)