Skip to content

feat: add ReVSI evaluation#1307

Merged
kcz358 merged 3 commits intoEvolvingLMMs-Lab:mainfrom
eamonn-zh:main
May 6, 2026
Merged

feat: add ReVSI evaluation#1307
kcz358 merged 3 commits intoEvolvingLMMs-Lab:mainfrom
eamonn-zh:main

Conversation

@eamonn-zh
Copy link
Copy Markdown
Contributor

Summary

In scope

  • Add a new task revsi

Type of Change

  • Bug fix (non-breaking change)
  • New feature
  • New benchmark/task
  • New model integration
  • Breaking change
  • Documentation update
  • Refactoring (no functional changes)

@eamonn-zh eamonn-zh changed the title Add ReVSI Evaluation feat: add ReVSI evaluation Apr 25, 2026
Copy link
Copy Markdown
Collaborator

@kcz358 kcz358 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, thank you for the contributions. For the different frames, I think this should be model parameters adjustment instead of creating extra tasks? So maybe we should only need the all frame yaml if that is the case. And the the aggregation, it would be better if you can add the metrics in the metrics list so it will be logged in the result.json instead just use logger to log the results. Thank you.

Co-authored-by: Copilot <copilot@github.com>
@eamonn-zh
Copy link
Copy Markdown
Contributor Author

Hi @kcz358, thanks for the suggestions! We have added the metrics to the metrics list and removed the self-logging of results.

Regarding the different frame settings, ReVSI defines four frame-budget subsets. Each subset provides different ground-truth answers to ensure that questions remain answerable under the corresponding frame sampling budget and are not invalidated by missing visual evidence. This frame-adaptive evaluation protocol is an important feature and core contribution of ReVSI, so all four sub-tasks (all-frame, 64-frame, 32-frame and 16-frame) are necessary. Thank you!

@eamonn-zh eamonn-zh requested a review from kcz358 May 6, 2026 05:11
Copy link
Copy Markdown
Collaborator

@kcz358 kcz358 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the clarification. LGTM

@kcz358 kcz358 merged commit 78d6c72 into EvolvingLMMs-Lab:main May 6, 2026
1 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants