feat: add ReVSI evaluation by eamonn-zh · Pull Request #1307 · EvolvingLMMs-Lab/lmms-eval

eamonn-zh · 2026-04-24T23:22:28Z

Summary

Add evaluation code for ReVSI benchmark

In scope

Add a new task revsi

Type of Change

kcz358

Hi, thank you for the contributions. For the different frames, I think this should be model parameters adjustment instead of creating extra tasks? So maybe we should only need the all frame yaml if that is the case. And the the aggregation, it would be better if you can add the metrics in the metrics list so it will be logged in the result.json instead just use logger to log the results. Thank you.

Co-authored-by: Copilot <copilot@github.com>

eamonn-zh · 2026-05-06T05:09:37Z

Hi @kcz358, thanks for the suggestions! We have added the metrics to the metrics list and removed the self-logging of results.

Regarding the different frame settings, ReVSI defines four frame-budget subsets. Each subset provides different ground-truth answers to ensure that questions remain answerable under the corresponding frame sampling budget and are not invalidated by missing visual evidence. This frame-adaptive evaluation protocol is an important feature and core contribution of ReVSI, so all four sub-tasks (all-frame, 64-frame, 32-frame and 16-frame) are necessary. Thank you!

kcz358

Thanks for the clarification. LGTM

eamonn-zh added 2 commits April 4, 2026 15:16

add the ReVSI benchmark

97d1972

add the ReVSI benchmark

a91af10

eamonn-zh changed the title ~~Add ReVSI Evaluation~~ feat: add ReVSI evaluation Apr 25, 2026

kcz358 reviewed May 6, 2026

View reviewed changes

feat: enhance REVSI metrics and aggregation functions

ea062f5

Co-authored-by: Copilot <copilot@github.com>

eamonn-zh requested a review from kcz358 May 6, 2026 05:11

kcz358 approved these changes May 6, 2026

View reviewed changes

kcz358 merged commit 78d6c72 into EvolvingLMMs-Lab:main May 6, 2026
1 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add ReVSI evaluation#1307

feat: add ReVSI evaluation#1307
kcz358 merged 3 commits intoEvolvingLMMs-Lab:mainfrom
eamonn-zh:main

eamonn-zh commented Apr 24, 2026

Uh oh!

kcz358 left a comment

Uh oh!

eamonn-zh commented May 6, 2026

Uh oh!

kcz358 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eamonn-zh commented Apr 24, 2026

Summary

In scope

Type of Change

Uh oh!

kcz358 left a comment

Choose a reason for hiding this comment

Uh oh!

eamonn-zh commented May 6, 2026

Uh oh!

kcz358 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants