You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+55Lines changed: 55 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -321,6 +321,61 @@ Training configurations and hyperparameters will be documented soon. For now, pl
321
321
322
322
## 📊 Evaluation
323
323
324
+
### LLaVA-NeXT Evaluation
325
+
326
+
To evaluate the OneVision Encoder as a vision backbone for LLaVA-NeXT multimodal models, we use the lmms-eval framework with various vision-language benchmarks.
327
+
328
+
#### Setup
329
+
330
+
Navigate to the llava_next directory and follow the setup instructions:
331
+
332
+
<details>
333
+
<summary>Click to expand LLaVA-NeXT evaluation setup</summary>
334
+
335
+
```bash
336
+
cd llava_next
337
+
338
+
# Using Docker (recommended)
339
+
docker build -t ov_encoder_llava:26.01 .
340
+
docker run -it --gpus all --ipc host --net host --privileged \
341
+
-v "$(pwd)":/workspace/OV-Encoder-Llava \
342
+
-w /workspace/OV-Encoder-Llava \
343
+
ov_encoder_llava:26.01 bash
344
+
```
345
+
346
+
</details>
347
+
348
+
#### Running Evaluation
349
+
350
+
For image benchmarks (ChartQA, DocVQA, AI2D, OCRBench, etc.):
351
+
352
+
<details>
353
+
<summary>Click to expand evaluation commands</summary>
0 commit comments