Skip to content

Commit 516983e

Browse files
authored
Disallow programmatic stream serialization in JIT kernel launches (#1932)
This option allows the driver to overlap kernels on the same stream. But this is not really used by the cuVS kernel model and we expect no overlap between kernels on the same stream. This may cause some downstream issues in mixed stream usage. Authors: - Divye Gala (https://github.com/divyegala) Approvers: - Jinsol Park (https://github.com/jinsolp) URL: #1932
1 parent af70615 commit 516983e

1 file changed

Lines changed: 1 addition & 6 deletions

File tree

cpp/src/detail/jit_lto/AlgorithmLauncher.cpp

Lines changed: 1 addition & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -37,17 +37,12 @@ AlgorithmLauncher& AlgorithmLauncher::operator=(AlgorithmLauncher&& other) noexc
3737
void AlgorithmLauncher::call(
3838
cudaStream_t stream, dim3 grid, dim3 block, std::size_t shared_mem, void** kernel_args)
3939
{
40-
cudaLaunchAttribute attribute[1];
41-
attribute[0].id = cudaLaunchAttributeProgrammaticStreamSerialization;
42-
attribute[0].val.programmaticStreamSerializationAllowed = 1;
43-
4440
cudaLaunchConfig_t config;
4541
config.gridDim = grid;
4642
config.blockDim = block;
4743
config.stream = stream;
48-
config.attrs = attribute;
49-
config.numAttrs = 1;
5044
config.dynamicSmemBytes = shared_mem;
45+
config.numAttrs = 0;
5146

5247
RAFT_CUDA_TRY(cudaLaunchKernelExC(&config, kernel, kernel_args));
5348
}

0 commit comments

Comments
 (0)