Commit 11bf290
ex_voxelBlockManager_host_cuda: add -fopenmp-simd compile flag
Activates #pragma omp simd hints in decodeInverseMaps and
buildMaskPrefixSums without requiring the OpenMP runtime library,
yielding ~10-15% faster CPU decode throughput.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Efty Sifakis <esifakis@nvidia.com>1 parent 8c33baf commit 11bf290
1 file changed
Lines changed: 2 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
117 | 117 | | |
118 | 118 | | |
119 | 119 | | |
120 | | - | |
121 | | - | |
| 120 | + | |
| 121 | + | |
122 | 122 | | |
123 | 123 | | |
124 | 124 | | |
| |||
0 commit comments