You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We release the attentive probing artifacts for our codec-based model across multiple video understanding benchmarks. For each dataset, we provide the codec-derived patch indices, training logs, model checkpoints, and final evaluation results.
326
-
327
-
> 📦 **Artifacts Repository**: All codec evaluation artifacts (codec indices, training logs, and checkpoints) are available on HuggingFace:
- The model weight path is a required positional argument. You can use `lmms-lab-encoder/onevision-encoder-large` to load directly from HuggingFace, or provide a local path to your model checkpoint.
389
-
- The evaluation scripts are configured for 8 GPUs by default. Adjust `CUDA_VISIBLE_DEVICES` in the shell script if you have a different GPU configuration.
390
-
391
337
**Codec-Specific Parameters:**
392
-
-`K_keep`: Number of patches to keep (e.g., 2048 for 2K patches, 4096 for 4K patches)
393
-
-`num_frames`: Total number of frames in the video sequence (typically 64 for codec evaluation)
394
-
-`frames_token_num`: Number of tokens per frame (e.g., 256 tokens)
395
-
-`cache_dir` (optional): Directory for cached codec patches. Use this to specify where codec-selected patches are stored/loaded when you want to persist or reuse them
396
-
397
-
**Using Pre-computed Codec Indices:**
398
-
399
-
To reproduce our exact results using the pre-computed codec indices:
# Note: Adjust --nproc_per_node based on your available GPUs
429
-
```
338
+
-`K_keep`: Number of patches to keep.
339
+
-`cache_dir` (optional): Directory for cached codec patches. Use this to specify where codec-selected patches are stored/loaded when you want to persist or reuse them.
0 commit comments