Skip to content

Commit ecbec13

Browse files
committed
well actually
Signed-off-by: Alexander Droste <alexander.droste@protonmail.com>
1 parent 76445a0 commit ecbec13

1 file changed

Lines changed: 5 additions & 4 deletions

File tree

vortex-cuda/src/dynamic_dispatch/plan_builder.rs

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -224,11 +224,12 @@ impl DispatchPlan {
224224
}
225225

226226
impl FusedPlan {
227-
/// Maximum shared memory per block in bytes (96 KB).
227+
/// Maximum shared memory per block in bytes (48 KB).
228228
///
229-
/// NVIDIA GPUs from Fermi (CC 2.x) through Blackwell (CC 10.0)
230-
/// use 96 KB as their default limit for shared memory per block.
231-
const MAX_SHARED_MEM_BYTES: u32 = 96 * 1024;
229+
/// 48 KB is the default per-block dynamic shared memory limit across
230+
/// all CUDA architectures. Higher limits (up to 227 KB on Hopper)
231+
/// require an explicit opt-in via `cuFuncSetAttribute`.
232+
const MAX_SHARED_MEM_BYTES: u32 = 48 * 1024;
232233

233234
/// Build a plan by walking the encoding tree from root to leaf.
234235
///

0 commit comments

Comments
 (0)