You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- compile.sh: target Vulkan 1.1 (SPIR-V 1.3) so the 12 subgroup-using
quantized kernels (matmul_q*k*, matmul_gpuq*) actually compile; the default
`glslangValidator -V` emits SPIR-V 1.0 and failed on them. Added set -euo
pipefail and a non-zero exit on any failure.
- vulkan_engine.cpp: drop the hardcoded /home/raz/... shader path (leaked a
developer path and broke on every other machine). Fall back to the
TORCH_VULKAN_SHADER_DIR env var instead.
- __init__.py: export the resolved bundled-shader dir into
TORCH_VULKAN_SHADER_DIR so the lazily-constructed VulkanEngine resolves the
same shaders. Aligned the module docstring with the README (.to("vulkan")
is the supported path; torch.randn(device=) / .vulkan() are partial).
- tests: removed hardcoded /home/raz/builds/pytorch-gfx1150 sys.path inserts;
honour TORCH_VULKAN_PYTORCH_PATH instead. Fixed the misleading "relu isn't
implemented" fallback test (relu IS wired) and split it into a real relu
correctness test plus a genuine CPU-fallback test using an unimplemented op.
- Removed dead imports (numpy, time, sys) and unused locals flagged by ruff.
- Added pyproject.toml (ruff config) and .github/workflows/ci.yml that
compiles all shaders and runs ruff + py_compile. The GPU build/test suite is
not run in CI (no GPU on hosted runners).
Co-authored-by: Peter Clemente III <peterc3@live.com>
Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
0 commit comments