You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+17-25Lines changed: 17 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -64,11 +64,15 @@ Coupled with global contrastive learning over a 2M-scale concept memory bank, On
64
64
65
65
### Video Processing Pipeline
66
66
67
-
The visualization below illustrates four different video processing pipelines.
68
-
(1) **Original Video**: a continuous 64-frame sequence that preserves the complete temporal context.
69
-
(2) **Uniform Frame Sampling**: a conventional strategy that selects 4–8 evenly spaced frames; while simple and efficient, it is inherently lossy and fails to capture fine-grained inter-frame motion.
70
-
(3) **Temporal Saliency Detection**: a global analysis of all 64 frames to identify regions rich in temporal information, including motion patterns, appearance variations, and semantic events.
71
-
(4) **Codec-Style Patch Extraction**: selective extraction of the temporally salient patches in a zigzag order, achieving 75–98% compression while retaining critical temporal dynamics.
67
+
The visualization below illustrates four different video processing pipelines.
68
+
69
+
**1. Original Video**: a continuous 64-frame sequence that preserves the complete temporal context.
70
+
71
+
**2. Uniform Frame Sampling**: a conventional strategy that selects 4–8 evenly spaced frames; while simple and efficient, it is inherently lossy and fails to capture fine-grained inter-frame motion.
72
+
73
+
**3. Temporal Saliency Detection**: a global analysis of all 64 frames to identify regions rich in temporal information, including motion patterns, appearance variations, and semantic events.
74
+
75
+
**4. Codec-Style Patch Extraction**: selective extraction of the temporally salient patches in a zigzag order, achieving 75–98% compression while retaining critical temporal dynamics.
-`cache_dir`: Directory for cached codec patches. This is where the codec-selected patches will be stored/loaded.
331
-
-`K_keep`: Number of patches to keep. For example, 256 patches per frame × 8 frames = 2048 total patches. Adjust based on your frame count and desired compression ratio.
0 commit comments