Add visible_indices example output and Codec Input TODO section

Copilot · anxiangsir · Copilot · commit d6db0a57620a · 2025-12-24T17:16:57.000Z
Co-authored-by: anxiangsir &lt;31175974+anxiangsir@users.noreply.github.com&gt;
diff --git a/README.md b/README.md
@@ -203,11 +203,21 @@ video = torch.randn(1, 3, num_frames, 224, 224).to("cuda")
 # Build visible_indices for temporal sampling
 frame_pos = torch.linspace(0, target_frames - 1, num_frames).long().cuda()
 visible_indices = (frame_pos.unsqueeze(-1) * frame_tokens + torch.arange(frame_tokens).cuda()).reshape(1, -1)
+# visible_indices example (with 256 tokens per frame):
+#   Frame 0 (pos=0):  indices [0, 1, 2, ..., 255]
+#   Frame 1 (pos=4):  indices [1024, 1025, 1026, ..., 1279]
+#   Frame 2 (pos=8):  indices [2048, 2049, 2050, ..., 2303]
+#   ...
+#   Frame 15 (pos=63): indices [16128, 16129, ..., 16383]
 
 with torch.no_grad():
     outputs = model(video, visible_indices=visible_indices)
 ```
 
+### Codec Input
+
+> **TODO:** Add codec-style input documentation for temporal saliency-based patch selection.
+
 ---
 
 ## 🚀 Training