[Cherry-Pick]【TI-Consisent】Added Metric logits_stats to the ZMQ branch (#6979)#6978
Open
liuruyan wants to merge 9 commits into
Open
[Cherry-Pick]【TI-Consisent】Added Metric logits_stats to the ZMQ branch (#6979)#6978liuruyan wants to merge 9 commits into
liuruyan wants to merge 9 commits into
Conversation
|
Thanks for your contribution! |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## release/2.5 #6978 +/- ##
==============================================
Coverage ? 68.40%
==============================================
Files ? 390
Lines ? 54120
Branches ? 8523
==============================================
Hits ? 37021
Misses ? 14408
Partials ? 2691
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
EmmonsCurse
approved these changes
Mar 27, 2026
Collaborator
EmmonsCurse
left a comment
There was a problem hiding this comment.
LGTM~ Skip coverage check as it mainly relies on end-to-end tests.
Contributor
Author
|
/re-run run_tests_with_coverage |
sunlei1024
approved these changes
Mar 31, 2026
zoooo0820
approved these changes
Apr 1, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
背景:出于对训推一致性丰富检测指标及长期CI/CE监控考量,对sample之后的logits添加logits_stats(min/max/mean/std),用来保证确定性及稳定性。
Modifications
数据结构及接口:由于logprob与logits_stat同样是输出的重要检测指标且均从logits计算而来,暂时实现先将logits_stat存入LogprobsTensors数据结构保存,并升级logprob传播链路上相关接口,支持同时透传logits_stats
FLAG:添加与enable_logprob同级别model_config:
self.compute_logits_stats = False,且在server启动时支持配置--compute-logits-statsUsage or Command
--compute-logits-stats,--enable-logproblogprobs=True,top_logprobs=0response = client.chat.completions.create( model="null", messages=[ {"role": "system", "content": "I'm a helpful AI assistant."}, {"role": "user", "content": "把李白的静夜思改写为现代诗"}, ], stream=True, # False max_tokens=100, logprobs=True, top_logprobs=0 )Accuracy Tests
本PR不涉及精度修改,稳定性测试结果如下:
各自多次实验所有Token logits指标均可对齐
流式及非流式所有Token logits指标均可对齐
流式request返回体:
Checklist
[FDConfig],[APIServer],[Engine],[Scheduler],[PD Disaggregation],[Executor],[Graph Optimization],[Speculative Decoding],[RL],[Models],[Quantization],[Loader],[OP],[KVCache],[DataProcessor],[BugFix],[Docs],[CI],[Optimization],[Feature],[Benchmark],[Others],[XPU],[HPU],[GCU],[DCU],[Iluvatar],[Metax]]pre-commitbefore commit.releasebranch, make sure the PR has been submitted to thedevelopbranch, then cherry-pick it to thereleasebranch with the[Cherry-Pick]PR tag.