Skip to content

Commit c4d1343

Browse files
committed
Skip rate limiter when pc sampling is active
1 parent c71dfa4 commit c4d1343

1 file changed

Lines changed: 5 additions & 2 deletions

File tree

src/cupti.cpp

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -528,8 +528,11 @@ class CuptiProfiler : public proton::Singleton<CuptiProfiler> {
528528
CUPTI_RUNTIME_TRACE_CBID_cudaGraphLaunch_ptsz_v10000);
529529
}
530530

531-
// Rate limit probes using token bucket (skip for graph launches)
532-
if (!isGraphLaunch) {
531+
// Rate limit probes using token bucket. Skip rate limiting for graph
532+
// launches (they share one correlation ID across many kernels) and when
533+
// PC sampling is active (every kernel needs its correlation callback so
534+
// PC samples can be matched with CPU stacks on the agent side).
535+
if (!isGraphLaunch && !g_pcSamplingState.active) {
533536
if (!callbackLimiter.tryAcquire()) {
534537
DEBUG_PRINTF(
535538
"[PARCAGPU] Rate limited: skipping probe for correlationId=%u\n",

0 commit comments

Comments
 (0)