Skip to content

Commit ea33350

Browse files
authored
Update model_card.md
1 parent ec6a0cc commit ea33350

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

docs/model_card.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,11 +25,11 @@
2525

2626
## Key Features
2727

28+
- **Unified Vision Foundation**: A single base model for consistent understanding of images, videos, and OCR.
2829
- **Codec-Style Patch Selection**: Instead of sampling sparse frames densely (all patches from few frames), OneVision Encoder samples dense frames sparsely (important patches from many frames).
2930
- **3D Rotary Position Embedding**: Uses a 4:6:6 split for temporal, height, and width dimensions to capture spatiotemporal relationships.
3031
- **Global Contrastive Learning**: Trained with a 2M concept bank for better-separated semantic clusters.
3132
- **Native Resolution Support**: Supports native resolution input without tiling or cropping.
32-
- **Flash Attention 2**: Efficient attention implementation for improved performance and memory efficiency.
3333

3434
## Unified Input Processing
3535

0 commit comments

Comments
 (0)