[BugFix] Add patch for local_cache_hit calculation in vllm 0.18.0 met… by sumingZero · Pull Request #941 · ModelEngine-Group/unified-cache-management

sumingZero · 2026-04-29T07:46:57Z

Purpose

Fix negative local_cache_hit calculation that can crash Prometheus counters.
Related vLLM issue: vllm-project/vllm#36755
When preemption occurs with async scheduling, there's a race condition where:

schedule(N+1) can preempt a request and reset its state
update_from_output(N) reads the already-mutated request state
This can result in num_external_computed_tokens exceeding num_cached_tokens + recomputed, causing local_cache_hit to become negative. Prometheus counters then crash with:
ValueError: Counters can only be incremented by non-negative amounts.

Modifications

Patch vllm.v1.metrics.stats.PromptTokenStats.update_from_output to wrap local_cache_hit calculation with max(0, ...)
Add v0180/vllm/pc/metrics/stats.py with the patched method
Add v0180/vllm/pc_patch.py to register the patch
Update apply_patch.py to support vllm 0.18.0 version

…rics stats

[BugFix] Add patch for local_cache_hit calculation in vllm 0.18.0 met…

6470a76

…rics stats

sumingZero requested review from Infinite666, harrisonyhq, mag1c-h, qyh111 and ygwpz as code owners April 29, 2026 07:46

qyh111 approved these changes Apr 29, 2026

View reviewed changes

qyh111 merged commit 3b26ce4 into ModelEngine-Group:develop Apr 29, 2026
20 of 22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Add patch for local_cache_hit calculation in vllm 0.18.0 met…#941

[BugFix] Add patch for local_cache_hit calculation in vllm 0.18.0 met…#941
qyh111 merged 1 commit intoModelEngine-Group:developfrom
sumingZero:patch_018

sumingZero commented Apr 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sumingZero commented Apr 29, 2026

Purpose

Modifications

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants