fix: handle internvl_hf video-only inputs and enable frame sampling by akawincent · Pull Request #1279 · EvolvingLMMs-Lab/lmms-eval

akawincent · 2026-03-28T06:41:01Z

Summary

This PR fixes two internvl_hf video handling bugs in lmms_eval/models/chat/internvl_hf.py:

explicitly enables frame sampling when num_frames or fps is configured
normalizes empty image inputs to None for video-only examples
avoids building image_sizes from an empty image list

Why

This PR is motivated by two related internvl_hf bugs reported upstream:

[Bug] internvl_hf: IndexError when visuals is empty list (video-only inputs) #1241 reports that video-only inputs can crash because generate_until() passes images=[] into the processor, and later also assumes visuals is non-empty when building image_sizes
[Bug] Internvl_hf: num_frames not applied to videos #1242 reports that num_frames is not actually applied, because InternVL video processing keeps do_sample_frames=False by default unless it is explicitly enabled

Together, these two issues make internvl_hf unreliable on video tasks:
video-only samples can raise an IndexError, and sampled-video runs can still process all frames and overflow the model context length.

Closes #1241
Closes #1242

Validation

pre-commit run --files lmms_eval/models/chat/internvl_hf.py
python -m py_compile lmms_eval/models/chat/internvl_hf.py

akawincent · 2026-04-09T07:03:55Z

Hi @Luodian!

Need for review and tests.
InternVL is a widely used model, so I think this issue should be resolved as soon as possible.

mwxely

Clean and well-scoped fix. Two minor points to consider:

Implicit behavioral change: Since num_frames defaults to 32 in __init__, setting do_sample_frames=True means frame sampling will now actually take effect by default. Previously num_frames=32 was silently ignored (since do_sample_frames was never set). This is arguably the correct behavior, but worth noting in the PR description as it changes the default video processing pipeline for all internvl_hf users on video tasks.
Validation question: Did you run an end-to-end video eval (e.g., a small subset of VideoMME or similar) to confirm the fix works beyond py_compile and pre-commit? A quick --limit 4 smoke test would be reassuring.

Otherwise LGTM — the visuals = None guard and image_sizes fix are correct and consistent with existing patterns in the file. cc @kcz358

akawincent · 2026-04-09T08:30:20Z

Validation question: Did you run an end-to-end video eval (e.g., a small subset of VideoMME or similar) to confirm the fix works beyond py_compile and pre-commit? A quick --limit 4 smoke test would be reassuring.

@mwxely Hello,

Yes, I did. Actually, I ran it with --limit to test whether the code works, but I didn't run a full benchmark. To be rigorous, a better testing would be to change the num_frames and observe changes in the final metrics.

akawincent added 2 commits March 28, 2026 14:48

fix: enable frame sampling in internvl_hf

fa366a8

fix: handle video-only internvl_hf inputs

24b7ea9

akawincent force-pushed the fix/1241_1242_internvl_hf branch from 1835a30 to 24b7ea9 Compare March 28, 2026 06:49

mwxely requested review from kcz358 and mwxely April 9, 2026 07:25

mwxely reviewed Apr 9, 2026

View reviewed changes

kcz358 approved these changes Apr 9, 2026

View reviewed changes

kcz358 merged commit 037f12b into EvolvingLMMs-Lab:main Apr 9, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle internvl_hf video-only inputs and enable frame sampling#1279

fix: handle internvl_hf video-only inputs and enable frame sampling#1279
kcz358 merged 2 commits intoEvolvingLMMs-Lab:mainfrom
akawincent:fix/1241_1242_internvl_hf

akawincent commented Mar 28, 2026

Uh oh!

akawincent commented Apr 9, 2026

Uh oh!

mwxely left a comment •

edited

Loading

Uh oh!

Uh oh!

akawincent commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

akawincent commented Mar 28, 2026

Summary

Why

Validation

Uh oh!

akawincent commented Apr 9, 2026

Uh oh!

mwxely left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

akawincent commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mwxely left a comment •

edited

Loading