Skip to content

feat: calibrated token counter with online regression from LLM usage#1498

Open
rootfs wants to merge 2 commits into
vllm-project:mainfrom
rootfs:prompt-len-signal-v2
Open

feat: calibrated token counter with online regression from LLM usage#1498
rootfs wants to merge 2 commits into
vllm-project:mainfrom
rootfs:prompt-len-signal-v2

Conversation

@rootfs
Copy link
Copy Markdown
Collaborator

@rootfs rootfs commented Mar 11, 2026

The context length estimator is not accurate. This PR uses regression based estimate to be model and prompt content agnostic.

Signed-off-by: Huamin Chen <hchen@redhat.com>
@rootfs rootfs requested a review from Xunzhuo as a code owner March 11, 2026 18:57
@netlify
Copy link
Copy Markdown

netlify Bot commented Mar 11, 2026

Deploy Preview for vllm-semantic-router ready!

Name Link
🔨 Latest commit 43fa4e0
🔍 Latest deploy log https://app.netlify.com/projects/vllm-semantic-router/deploys/69b900281e650400070722c9
😎 Deploy Preview https://deploy-preview-1498--vllm-semantic-router.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Mar 11, 2026

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 src

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

  • src/semantic-router/pkg/classification/calibrated_token_counter.go
  • src/semantic-router/pkg/classification/calibrated_token_counter_test.go
  • src/semantic-router/pkg/classification/classifier.go
  • src/semantic-router/pkg/extproc/processor_res_body_pipeline.go
  • src/semantic-router/pkg/extproc/processor_res_body_streaming.go

vLLM

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants