- For all the pretraining and finetuning, we adopt spaese/uniform sampling.
-
#Frame$=$ #input_frame$\times$ #crop$\times$ #clip -
#input_framemeans how many frames are input for model per inference -
#cropmeans spatial crops (e.g., 3 for left/right/center) -
#clipmeans temporal clips (e.g., 4 means repeted sampling four clips with different start indices)
| Model | Setting | Model | Shell |
|---|---|---|---|
|
|
K-Mash-1.1M 300e | 🤗 HF link | run.sh |
|
|
K-Mash-2M 300e | TBD | run.sh |
| Model | Setting | Teacher | Model | Shell |
|---|---|---|---|---|
|
|
K-Mash-1.1M 100e |
|
🤗 HF link | run.sh |
|
|
K-Mash-1.1M 100e |
|
🤗 HF link | run.sh |
|
|
K-Mash-1.1M 100e |
|
🤗 HF link | run.sh |
| Model | Setting | #Frame | Top-1 | Model | Shell |
|---|---|---|---|---|---|
|
|
K-Mash PT | 8x3x4 | 87.6 | 🤗 HF link | run.sh |
|
|
K-Mash PT | 8x3x4 | 88.1 | TBD | run.sh |
|
|
K-Mash PT | 8x3x4 | 79.6 | 🤗 HF link | run.sh |
|
|
K-Mash PT | 8x3x4 | 83.5 | 🤗 HF link | run.sh |
|
|
K-Mash PT | 8x3x4 | 86.2 | 🤗 HF link | run.sh |
| Model | Setting | #Frame | Top-1 | Model | Shell |
|---|---|---|---|---|---|
|
|
K-Mash PT + K710 FT | 8x3x4 | 91.3 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 16x3x4 | 91.6 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 91.9 | TBD | run.sh |
|
|
K-Mash PT + K710 FT | 16x3x4 | 92.1 | TBD | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 85.4 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 88.4 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 90.4 | 🤗 HF link | run.sh |
| Model | Setting | #Frame | Top-1 | Model | Shell |
|---|---|---|---|---|---|
|
|
K-Mash PT + K710 FT | 8x3x4 | 91.4 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 16x3x4 | 91.6 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 91.7 | TBD | run.sh |
|
|
K-Mash PT + K710 FT | 16x3x4 | 91.9 | TBD | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 86.0 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 88.9 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 90.6 | 🤗 HF link | run.sh |
| Model | Setting | #Frame | Top-1 | Model | Shell |
|---|---|---|---|---|---|
|
|
K-Mash PT + K710 FT | 8x3x4 | 85.0 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 16x3x4 | 85.4 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 85.7 | TBD | run.sh |
|
|
K-Mash PT + K710 FT | 16x3x4 | 85.9 | TBD | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 75.7 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 80.5 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT | 8x3x4 | 83.5 | 🤗 HF link | run.sh |
| Model | Setting | #Frame | Top-1 | Model | Shell |
|---|---|---|---|---|---|
|
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 50.8 | 🤗 HF link | run.sh |
|
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 51.0 | TBD | run.sh |
|
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 51.2 | TBD | run.sh |
| Model | Setting | #Frame | Top-1 | Model | Shell |
|---|---|---|---|---|---|
|
|
K-Mash PT | 8x3x4 | 68.5 | 🤗 HF link | run.sh |
|
|
K-Mash PT | 8x3x4 | 69.7 | TBD | run.sh |
| Model | Setting | #Frame | Top-1 | Model | Shell |
|---|---|---|---|---|---|
|
|
K-Mash PT | 8x3x4 | 77.1 | 🤗 HF link | run.sh |
|
|
K-Mash PT | 8x3x4 | 77.5 | TBD | run.sh |
|
|
K-Mash PT | 8x3x4 | 71.6 | 🤗 HF link | run.sh |
|
|
K-Mash PT | 8x3x4 | 73.5 | 🤗 HF link | run.sh |
|
|
K-Mash PT | 8x3x4 | 76.4 | 🤗 HF link | run.sh |
| Model | Setting | #Frame | Top-1 | mAP | Model | Shell |
|---|---|---|---|---|---|---|
|
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 95.9 | 98.2 | TBD | run.sh |
| Model | Setting | #Frame | Top-1 | mAP | Model | Shell |
|---|---|---|---|---|---|---|
|
|
K-Mash PT + K710 FT + K400 FT | 8x3x4 | 97.0 | 98.8 | TBD | run.sh |