rdna3
Here are 29 public repositories matching this topic...
Windows-only version of ComfyUI which uses AMD's official ROCm and PyTorch libraries to get better performance with AMD GPUs. [auto-installation and popular performance enhancing packages like triton * sage-attention * flash-attention * bitsandbytes included ]
-
Updated
Jun 7, 2026 - Python
The intelligent OptiScaler installer Linux gamers needed. Automates FSR4, XeSS & DLSS configuration with GPU-optimized profiles for RDNA3/4, Arc & RTX cards.
-
Updated
Jan 27, 2026 - Shell
TRELLIS (Microsoft's Image-to-3D generator) running on AMD GPUs with ROCm. Includes Gaussian splatting, mesh extraction, and GLB export. Tested on RX 7800 XT.
-
Updated
May 9, 2026 - Jupyter Notebook
Multi-GPU tensor/context parallel diffusion on AMD ROCm — with the patch that makes it actually work.
-
Updated
Apr 19, 2026 - Python
Working ROCm 6.1 + PyTorch environment for RDNA3 with QLoRA training. Built after fighting pipeline rats and wondering, “Lisa Su, girl… what are they doing down there.”
-
Updated
May 30, 2026 - Python
FLUX.1-dev on AMD Radeon consumer GPUs — fast, low-VRAM, and shippable. Backport patches + benchmarks for torchao + diffusers group_offload on ROCm.
-
Updated
Apr 19, 2026 - Python
Docker infrastructure for AMD Strix Halo (RDNA 3.5 / gfx1151): PyTorch + ROCm base container and a separate Ollama LLM service. Two folders, two Compose files, one Strix Halo box.
-
Updated
Apr 26, 2026 - Shell
Unlock fast, local LLM inference on AMD-powered mini PCs delivering 65-87 t/s for large models without cloud or subscription costs
-
Updated
Jun 7, 2026 - Shell
PyTorch built from source for AMD RDNA 3.5 (gfx1150) — Radeon 890M/880M GPU acceleration
-
Updated
Mar 30, 2026 - Shell
High-performance exact vector search on NVIDIA & AMD GPUs. 9354 QPS (RTX 5090D), 6275 QPS (RX 7900 XTX). Open-source, deterministic, and recall=1.0 — enabling Trustworthy AI at the edge.
-
Updated
Jan 22, 2026 - C++
Local speech-to-text with AMD ROCm GPU acceleration
-
Updated
Dec 29, 2025 - Python
Local LLM benchmarks on AMD Strix Halo — 26+ models tested across RADV, AMDVLK, and ROCm with llama.cpp
-
Updated
Jun 6, 2026 - Astro
A reproducible QLoRA training pipeline for Qwen2.5‑3B on AMD ROCm. Built after four days of being an absolute pipeline rat debugging kernels, dtype mismatches, and ROCm quirks until the model finally trained clean. Includes Quanto 4‑bit, LoRA, templates, and accessibility‑focused examples.
-
Updated
Jun 6, 2026 - Python
Improve this page
Add a description, image, and links to the rdna3 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rdna3 topic, visit your repo's landing page and select "manage topics."