You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+10-4Lines changed: 10 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -104,15 +104,17 @@ Standard contrastive learning methods (e.g., CLIP) are fundamentally constrained
104
104
105
105
---
106
106
107
-
## 🎯 Codec Style Patch Selection
107
+
## 🎬 Codec Style Patch Selection
108
108
109
109
OneVision Encoder implements a codec-inspired patch selection mechanism that intelligently identifies and processes only the most informative patches from video frames. This approach is inspired by HEVC (High-Efficiency Video Coding) and enables efficient video understanding by focusing computation on temporally salient regions.
110
110
111
111
### Implementation in `llava_next`
112
112
113
113
The codec style patch selection is implemented across several key components in the [`llava_next`](llava_next) directory:
0 commit comments