Skip to content

Update SPIRVIntrinsics version to 1#713

Merged
christiangnrd merged 1 commit into
mainfrom
spriv
Jun 18, 2026
Merged

Update SPIRVIntrinsics version to 1#713
christiangnrd merged 1 commit into
mainfrom
spriv

Conversation

@christiangnrd

Copy link
Copy Markdown
Member

@christiangnrd christiangnrd requested a review from maleadt June 18, 2026 14:08
@christiangnrd christiangnrd merged commit a8022b2 into main Jun 18, 2026
40 of 44 checks passed
@christiangnrd christiangnrd deleted the spriv branch June 18, 2026 14:21
@github-actions

Copy link
Copy Markdown
Contributor

Benchmark Results

main 17deba4... main / 17deba4...
saxpy/default/Float32/1024 0.0743 ± 0.031 ms 0.0759 ± 0.03 ms 0.979 ± 0.56
saxpy/default/Float32/1048576 0.459 ± 0.024 ms 0.462 ± 0.026 ms 0.993 ± 0.076
saxpy/default/Float32/16384 0.062 ± 0.032 ms 0.0618 ± 0.031 ms 1 ± 0.73
saxpy/default/Float32/2048 0.0714 ± 0.03 ms 0.0742 ± 0.028 ms 0.961 ± 0.55
saxpy/default/Float32/256 0.0652 ± 0.033 ms 0.0672 ± 0.033 ms 0.971 ± 0.69
saxpy/default/Float32/262144 0.166 ± 0.03 ms 0.166 ± 0.03 ms 1 ± 0.26
saxpy/default/Float32/32768 0.0684 ± 0.03 ms 0.0689 ± 0.03 ms 0.993 ± 0.62
saxpy/default/Float32/4096 0.0722 ± 0.026 ms 0.0581 ± 0.028 ms 1.24 ± 0.75
saxpy/default/Float32/512 0.0692 ± 0.033 ms 0.071 ± 0.031 ms 0.975 ± 0.63
saxpy/default/Float32/64 0.0535 ± 0.033 ms 0.0565 ± 0.034 ms 0.947 ± 0.81
saxpy/default/Float32/65536 0.0809 ± 0.031 ms 0.0825 ± 0.031 ms 0.981 ± 0.52
saxpy/default/Float64/1024 0.0716 ± 0.032 ms 0.0748 ± 0.03 ms 0.957 ± 0.58
saxpy/default/Float64/1048576 0.56 ± 0.081 ms 0.553 ± 0.087 ms 1.01 ± 0.22
saxpy/default/Float64/16384 0.0604 ± 0.027 ms 0.0603 ± 0.026 ms 1 ± 0.61
saxpy/default/Float64/2048 0.0566 ± 0.032 ms 0.0609 ± 0.031 ms 0.929 ± 0.71
saxpy/default/Float64/256 0.0647 ± 0.033 ms 0.0659 ± 0.033 ms 0.982 ± 0.71
saxpy/default/Float64/262144 0.193 ± 0.036 ms 0.193 ± 0.037 ms 0.997 ± 0.27
saxpy/default/Float64/32768 0.0745 ± 0.029 ms 0.0736 ± 0.028 ms 1.01 ± 0.55
saxpy/default/Float64/4096 0.0718 ± 0.026 ms 0.0618 ± 0.028 ms 1.16 ± 0.68
saxpy/default/Float64/512 0.0684 ± 0.033 ms 0.072 ± 0.031 ms 0.949 ± 0.62
saxpy/default/Float64/64 0.0513 ± 0.033 ms 0.0537 ± 0.033 ms 0.954 ± 0.85
saxpy/default/Float64/65536 0.0946 ± 0.03 ms 0.0908 ± 0.029 ms 1.04 ± 0.47
saxpy/static workgroup=(1024,)/Float32/1024 0.0724 ± 0.031 ms 0.072 ± 0.03 ms 1 ± 0.6
saxpy/static workgroup=(1024,)/Float32/1048576 0.457 ± 0.024 ms 0.449 ± 0.022 ms 1.02 ± 0.073
saxpy/static workgroup=(1024,)/Float32/16384 0.0594 ± 0.031 ms 0.0589 ± 0.03 ms 1.01 ± 0.73
saxpy/static workgroup=(1024,)/Float32/2048 0.0682 ± 0.03 ms 0.0713 ± 0.028 ms 0.956 ± 0.56
saxpy/static workgroup=(1024,)/Float32/256 0.0552 ± 0.029 ms 0.0573 ± 0.029 ms 0.964 ± 0.7
saxpy/static workgroup=(1024,)/Float32/262144 0.164 ± 0.028 ms 0.163 ± 0.028 ms 1.01 ± 0.24
saxpy/static workgroup=(1024,)/Float32/32768 0.0647 ± 0.028 ms 0.0654 ± 0.028 ms 0.989 ± 0.6
saxpy/static workgroup=(1024,)/Float32/4096 0.0691 ± 0.027 ms 0.0682 ± 0.028 ms 1.01 ± 0.58
saxpy/static workgroup=(1024,)/Float32/512 0.0677 ± 0.03 ms 0.068 ± 0.03 ms 0.995 ± 0.62
saxpy/static workgroup=(1024,)/Float32/64 0.0521 ± 0.03 ms 0.0542 ± 0.029 ms 0.96 ± 0.76
saxpy/static workgroup=(1024,)/Float32/65536 0.0785 ± 0.03 ms 0.0793 ± 0.03 ms 0.99 ± 0.54
saxpy/static workgroup=(1024,)/Float64/1024 0.0682 ± 0.032 ms 0.0699 ± 0.031 ms 0.975 ± 0.63
saxpy/static workgroup=(1024,)/Float64/1048576 0.547 ± 0.089 ms 0.52 ± 0.086 ms 1.05 ± 0.24
saxpy/static workgroup=(1024,)/Float64/16384 0.0593 ± 0.027 ms 0.0568 ± 0.025 ms 1.04 ± 0.66
saxpy/static workgroup=(1024,)/Float64/2048 0.055 ± 0.032 ms 0.0547 ± 0.031 ms 1 ± 0.82
saxpy/static workgroup=(1024,)/Float64/256 0.0639 ± 0.029 ms 0.0634 ± 0.03 ms 1.01 ± 0.66
saxpy/static workgroup=(1024,)/Float64/262144 0.191 ± 0.033 ms 0.186 ± 0.033 ms 1.02 ± 0.26
saxpy/static workgroup=(1024,)/Float64/32768 0.0707 ± 0.027 ms 0.0677 ± 0.026 ms 1.04 ± 0.57
saxpy/static workgroup=(1024,)/Float64/4096 0.0711 ± 0.027 ms 0.0599 ± 0.028 ms 1.19 ± 0.71
saxpy/static workgroup=(1024,)/Float64/512 0.0667 ± 0.031 ms 0.0641 ± 0.03 ms 1.04 ± 0.68
saxpy/static workgroup=(1024,)/Float64/64 0.0549 ± 0.032 ms 0.0498 ± 0.03 ms 1.1 ± 0.92
saxpy/static workgroup=(1024,)/Float64/65536 0.0896 ± 0.028 ms 0.0862 ± 0.027 ms 1.04 ± 0.46
time_to_load 0.989 ± 0.011 s 1.01 ± 0.015 s 0.978 ± 0.018

Benchmark Plots

A plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR.
Go to "Actions"->"Benchmark a pull request"->[the most recent run]->"Artifacts" (at the bottom).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants