File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -73,6 +73,16 @@ The visualization below demonstrates our complete video processing pipeline. The
73732 . ** Avoid direct supervision from existing models** - Indirect usage is preferred over direct distillation, which may limit scaling capabilities
74743 . ** Progressive training when resources are limited** - Start with low resolution/frame rate, then gradually fine-tune to higher settings (ref: CLIPA)
7575
76+ ### Training Loss
77+
78+ <p align =" center " >
79+ <picture >
80+ <source media="(prefers-color-scheme: dark)" srcset="https://raw.githubusercontent.com/anxiangsir/asset/main/OneVision/loss_github_dark.gif">
81+ <source media="(prefers-color-scheme: light)" srcset="https://raw.githubusercontent.com/anxiangsir/asset/main/OneVision/loss_github_light.gif">
82+ <img alt="Training Loss Visualization" src="https://raw.githubusercontent.com/anxiangsir/asset/main/OneVision/loss_github_light.gif" width="800" style="max-width: 100%;">
83+ </picture >
84+ </p >
85+
7686---
7787
7888### Attentive Probe Results
You can’t perform that action at this time.
0 commit comments