Skip to content

[CUDA] Branchless NF4/FP4 kDequantizeBlockwise kernel for faster dequantization #1306

[CUDA] Branchless NF4/FP4 kDequantizeBlockwise kernel for faster dequantization

[CUDA] Branchless NF4/FP4 kDequantizeBlockwise kernel for faster dequantization #1306

Status Success
Total duration 3m 50s
Artifacts 1

build_pr_documentation.yml

on: pull_request
build  /  build_pr_documentation
3m 42s
build / build_pr_documentation
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
doc-build-artifact Expired
339 KB
sha256:0b4121e7fd04f48a0ba79de406081882bb10790c45a2bd8de7c226c71d8424a8