Skip to content

Add ROCm detection and rocm variant support#214

Open
leonardopereira10 wants to merge 1 commit into
open-webui:mainfrom
leonardopereira10:main
Open

Add ROCm detection and rocm variant support#214
leonardopereira10 wants to merge 1 commit into
open-webui:mainfrom
leonardopereira10:main

Conversation

@leonardopereira10
Copy link
Copy Markdown

@leonardopereira10 leonardopereira10 commented May 25, 2026

#208 Detect AMD/ROCm GPUs and expose a rocm variant. On Linux, detection checks for /opt/rocm or /usr/lib/rocm. On Windows, dxdiag is invoked and parsed for Adapter Family gfx identifiers (supported gfx list added) to identify AMD GPUs; temporary dxdiag_output.txt is cleaned up and protected with try/catch. Also added a Windows rocm asset name in getAssetPattern and added the 'rocm' option to the inference runtime UI.

Description

I included the autodetect implementation, but I should at least keep the option to choose ROCm as the backend, as it offers better stability in contexts with more than 20,000 tokens.

Autodetect is based on a parsing of Windows DxDiag to identify the supported GPU for ROCm (as per the list provided in the HIP documentation).

The download and extraction of HIP versions from llama.cpp has been adjusted by adding the item to the dictionary.

An option has been added to the backend configuration screen to allow selecting llama.cpp HIP/ROCm.

Related Issues

Closes #208


Contributor License Agreement

Note

Deleting the CLA section will lead to immediate closure of your PR and it will not be merged in.

open-webui#208  Detect AMD/ROCm GPUs and expose a rocm variant. On Linux, detection checks for /opt/rocm or /usr/lib/rocm. On Windows, dxdiag is invoked and parsed for Adapter Family gfx identifiers (supported gfx list added) to identify AMD GPUs; temporary dxdiag_output.txt is cleaned up and protected with try/catch. Also added a Windows rocm asset name in getAssetPattern and added the 'rocm' option to the inference runtime UI.
@leonardopereira10
Copy link
Copy Markdown
Author

I included the autodetect implementation, but I should at least keep the option to choose ROCm as the backend, as it offers better stability in contexts with more than 20,000 tokens.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add AMD HIP support on windows

1 participant