Skip to content

Add Kinetics700 and Kinetics600 zeroshot classification task#4537

Merged
isaac-chung merged 6 commits intoembeddings-benchmark:mainfrom
deep9539:kinetics700_zeroshot
Apr 30, 2026
Merged

Add Kinetics700 and Kinetics600 zeroshot classification task#4537
isaac-chung merged 6 commits intoembeddings-benchmark:mainfrom
deep9539:kinetics700_zeroshot

Conversation

@deep9539
Copy link
Copy Markdown
Contributor

@deep9539 deep9539 commented Apr 28, 2026

  • I have outlined why this dataset is filling an existing gap in mteb
  • I have tested that the dataset runs with the mteb package.
  • I have run the following models on the task (adding the results to the pr). These can be run using the mteb run -m {model_name} -t {task_name} command.
    • mteb/baseline-random encoder - Partially done, Remaining in progress
    • facebook/pe-av-small-16-frame or another small model (Takes too long, maybe a day)
  • I have checked that the performance is neither trivial (close to perfect scores) nor random.
  • I have considered the size of the dataset and reduced it if it is too big (e.g. 2048 examples for binary classification)
mteb/baseline-random encoder facebook/pe-av-small-16-frame
Kinetics600VAZeroShotClassification 0.00094
Kinetics600VZeroShotClassification 0.002089
Kinetics700VAZeroShotClassification 0.001609
Kinetics700VZeroShotClassification 0.001787

Kinetics600VAZeroShotClassification x baseline
Kinetics600VZeroShotClassification x baseline
Kinetics700VAZeroShotClassification x baseline
Kinetics700VAZeroShotClassification x baseline

@deep9539 deep9539 mentioned this pull request Apr 28, 2026
72 tasks

def get_candidate_labels(self) -> list[str]:
return [
name for name in self.dataset["test"].features[self.label_column_name].names
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I used "video of {label}" for another dataset. Perhaps it makes a difference?

Copy link
Copy Markdown
Contributor Author

@deep9539 deep9539 Apr 28, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Happy to add that. But, I think we can create an issue where we analyze the performance of the Zeroshot classification with and without "video of" prefix, wdyt?

Currently, this is fragmented and not standardized across dataset.

Copy link
Copy Markdown
Member

@Samoed Samoed Apr 29, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We use this in all zero shot tasks and this would a lot of unnecessary compute

Currently, this is fragmented and not standardized across dataset.

What do you mean? I can't find examples where we don't use it, except for your tasks #4538

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for the confusion, done.

Copy link
Copy Markdown
Collaborator

@x-tabdeveloping x-tabdeveloping left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It looks quite reasonable to me

@Samoed Samoed added new dataset Issues related to adding a new task or dataset video video extension labels Apr 28, 2026
@deep9539 deep9539 force-pushed the kinetics700_zeroshot branch from a3ef504 to c580d53 Compare April 28, 2026 22:50
@deep9539 deep9539 force-pushed the kinetics700_zeroshot branch from c580d53 to 3b01bfb Compare April 30, 2026 02:00
@deep9539
Copy link
Copy Markdown
Contributor Author

Can we merge this if it looks good?

Copy link
Copy Markdown
Contributor

@AdnanElAssadi56 AdnanElAssadi56 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you run make lint?

@deep9539
Copy link
Copy Markdown
Contributor Author

can you run make lint?

Done

Comment thread mteb/tasks/zeroshot_classification/eng/kinetics700.py Outdated
Comment thread mteb/tasks/zeroshot_classification/eng/kinetics600.py Outdated
Comment thread mteb/tasks/zeroshot_classification/eng/kinetics600.py Outdated
Comment thread mteb/tasks/zeroshot_classification/eng/kinetics700.py Outdated
Comment thread mteb/tasks/zeroshot_classification/eng/kinetics700.py Outdated
Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>
@isaac-chung isaac-chung merged commit a944e46 into embeddings-benchmark:main Apr 30, 2026
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

new dataset Issues related to adding a new task or dataset video video extension

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants