Skip to content

Hoist W4A8 activation quantization out of GEMM K-loop #13118

Hoist W4A8 activation quantization out of GEMM K-loop

Hoist W4A8 activation quantization out of GEMM K-loop #13118

Triggered via pull request May 7, 2026 11:01
Status Cancelled
Total duration 12s
Artifacts

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Waiting for pending jobs
Matrix: test-cuda-builds
Waiting for pending jobs
test-models-cuda  /  job
test-models-cuda / job
unittest-cuda  /  job
unittest-cuda / job
Matrix: test-cuda-pybind
Waiting for pending jobs
Matrix: test-model-cuda-e2e
Waiting for pending jobs
check-all-cuda-builds
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Annotations

1 error
Test CUDA Builds
Canceling since a higher priority waiting request for Test CUDA Builds-19209-false-false exists