#

rdna3

Here are 29 public repositories matching this topic...

zinc

zolotukhin / zinc

Zig INferenCe Engine — Local LLM inference on AMD GPUs and Apple Silicon

amd gpu zig pytorch transformer openai gpt amdgpu rdna3 rdna4 qwen3

Updated Jun 7, 2026
Zig

patientx-cfz / comfyui-rocm

Windows-only version of ComfyUI which uses AMD's official ROCm and PyTorch libraries to get better performance with AMD GPUs. [auto-installation and popular performance enhancing packages like triton * sage-attention * flash-attention * bitsandbytes included ]

windows triton rdna rocm miopen bitsandbytes flash-attention rdna3 rdna2 rdna4 sage-attention rdna1

Updated Jun 7, 2026
Python

0ptiscaler4linux

ind4skylivey / 0ptiscaler4linux

The intelligent OptiScaler installer Linux gamers needed. Automates FSR4, XeSS & DLSS configuration with GPU-optimized profiles for RDNA3/4, Arc & RTX cards.

vulkan proton linux-tools shell-scripting upscaling mesa gaming-performance gpu-optimization dlss linux-gaming xess steam-deck rdna3 frame-generation rdna4 optiscaler fsr4 amd-fsr

Updated Jan 27, 2026
Shell

GPUOpen-Tools / isa_spec_manager

Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.

amd gpu specification isa rdna cdna mi300 rdna3 rdna2 cdna3 cdna2 rdna4

Updated Apr 9, 2026
C++

Mateusz-Dera / ROCm-AI-Installer

Installation script for an AI applications using ROCm on Linux.

audio music linux video ai debian amd voice tts image-generation 3d amdgpu rocm radeon voice-generation comfyui rdna3

Updated May 31, 2026
Shell

CalebisGross / TRELLIS-AMD

TRELLIS (Microsoft's Image-to-3D generator) running on AMD GPUs with ROCm. Includes Gaussian splatting, mesh extraction, and GLB export. Tested on RX 7800 XT.

trellis hip rocm 3d-generation amd-gpu image-to-3d amd-gpus gaussian-splatting rdna3 mesh-extraction nvdiffrast

Updated May 9, 2026
Jupyter Notebook

Dev-next-gen / diffusers-rocm-parallel

Multi-GPU tensor/context parallel diffusion on AMD ROCm — with the patch that makes it actually work.

flux amd pytorch text-to-image multi-gpu rocm tensor-parallelism diffusers rdna3 ring-attention

Updated Apr 19, 2026
Python

rocm-7700xt-pytorch

thejeangenie18 / rocm-7700xt-pytorch

Working ROCm 6.1 + PyTorch environment for RDNA3 with QLoRA training. Built after fighting pipeline rats and wondering, “Lisa Su, girl… what are they doing down there.”

machine-learning ai deep-learning amd transformers pytorch gpu-computing lora rocm radeon fine-tuning amd-gpu huggingface local-inference pytorch-rocm local-llm qlora rdna3 gfx1101

Updated May 30, 2026
Python

Dev-next-gen / flux-amd-rocm

FLUX.1-dev on AMD Radeon consumer GPUs — fast, low-VRAM, and shippable. Backport patches + benchmarks for torchao + diffusers group_offload on ROCm.

flux amd pytorch text-to-image rocm int8 diffusers rdna3 torchao group-offload

Updated Apr 19, 2026
Python

hec-ovi / rocm-strix-docker

Docker infrastructure for AMD Strix Halo (RDNA 3.5 / gfx1151): PyTorch + ROCm base container and a separate Ollama LLM service. Two folders, two Compose files, one Strix Halo box.

docker ubuntu docker-compose amd gpu self-hosted pytorch rocm llm ollama rdna3 ryzen-ai strix-halo gfx1151

Updated Apr 26, 2026
Shell

GetNyrex / strix-halo-guide

Unlock fast, local LLM inference on AMD-powered mini PCs delivering 65-87 t/s for large models without cloud or subscription costs

amd optimization inference rocm mini-pc asus-rog linux-gaming unified-memory beelink cachyos llm llama-cpp local-llm ollama gguf rdna3 strix-halo gfx1151

Updated Jun 7, 2026
Shell

Peterc3-dev / pytorch-gfx1150

PyTorch built from source for AMD RDNA 3.5 (gfx1150) — Radeon 890M/880M GPU acceleration

pytorch gpu-acceleration rocm amd-gpu build-from-source rdna3 gfx1150 radeon-890m

Updated Mar 30, 2026
Shell

uulong950 / qingming-flat

High-performance exact vector search on NVIDIA & AMD GPUs. 9354 QPS (RTX 5090D), 6275 QPS (RX 7900 XTX). Open-source, deterministic, and recall=1.0 — enabling Trustworthy AI at the edge.

cuda brute-force gpu-acceleration cuda-kernels hip amdgpu explainable-ai vector-search edge-ai vector-search-engine trustworthy-ai exact-searching rdna3 secure-ai-practices deterministic-ai rocml ann-benchmarks

Updated Jan 22, 2026
C++

M64GitHub / whisper-rocm

Local speech-to-text with AMD ROCm GPU acceleration

linux privacy self-hosted pytorch voice-recognition speech-to-text transcription whisper rocm radeon amd-gpu fastapi openai-whisper rdna3 strix-point

Updated Dec 29, 2025
Python

Peterc3-dev / miopen-gfx1150

MIOpen research for AMD RDNA 3.5 (gfx1150) — whitelist patch and three-bug analysis for Strix Point APU training

deep-learning amd gpu rocm miopen rdna3 gfx1150

Updated May 29, 2026

blockfeed / llama-swap_homelab

llama-swap config for MTP speculative decoding on AMD RX 7900 XTX (ROCm). Qwen3.6-35B-A3B-MTP and Gemma 4 26B-A4B-MTP with VRAM-tuned context sizing.

amd gpu inference multi-model mtp rocm llm llama-cpp qwen rdna3 llama-swap

Updated Jun 4, 2026

slb350 / strix-benchmarks

Local LLM benchmarks on AMD Strix Halo — 26+ models tested across RADV, AMDVLK, and ROCm with llama.cpp

amd vulkan benchmarks rocm radeon llm llama-cpp local-llm rdna3 strix-halo

Updated Jun 6, 2026
Astro

sammyjoyce / rocm-nightly-flake

ROCm nightly monolithic tarball packaged as a Nix flake (gfx1151)

nix nixos amd gpu hip rocm nix-flake rdna3 strix-halo gfx1151

Updated Jun 7, 2026
Nix

thejeangenie18 / rocm-7700xt-qlora

A reproducible QLoRA training pipeline for Qwen2.5‑3B on AMD ROCm. Built after four days of being an absolute pipeline rat debugging kernels, dtype mismatches, and ROCm quirks until the model finally trained clean. Includes Quanto 4‑bit, LoRA, templates, and accessibility‑focused examples.

python open-source machine-learning ai deep-learning transformers pytorch rocm fine-tuning peft amd-gpu local-llm llm-training rdna3 nf4 qwen2-5 local-first-ai gfx1101 aceelerate

Updated Jun 6, 2026
Python

doublemover / RNS8

RNS8 explores exact integer matrix multiplication on AMD GPU matrix engines.

hpc amd matrix hip rdna crt amdgpu rocm ck rns cdna matrix-engine rdna3 rdna4 hipblaslt rocwmma

Updated Jun 7, 2026
C++

Improve this page

Add a description, image, and links to the rdna3 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rdna3 topic, visit your repo's landing page and select "manage topics."