You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1. Encode throughput: block TQ vs. current TQ at d=128, 768, 1024
545
550
2. Decode throughput: same dimensions
546
551
3. Quantized cosine similarity throughput: block vs. current
547
552
4. L2 norm readthrough latency: O(k) block norms vs. O(1) current
548
553
549
-
Stage 2 should benchmark:
550
-
5. PDX scan throughput vs. row-major scan at d=768, 1024
551
-
6. Full decompression from PDX layout (includes un-transpose overhead)
554
+
Stage 2 should benchmark: 5. PDX scan throughput vs. row-major scan at d=768, 1024 6. Full decompression from PDX layout (includes un-transpose overhead)
552
555
553
556
## Phasing
554
557
@@ -580,6 +583,7 @@ The current TurboQuant test suite validates specific behaviors that will change:
580
583
more children to manage.
581
584
582
585
New tests needed:
586
+
583
587
- SRHT quality at d=64: coordinate distribution vs. analytical Beta at 3, 4, 5 rounds
584
588
- Practical MSE comparison: d=64 blocks vs. d=768 single-rotation at same bit width
585
589
- Straggler block handling: dense rotation, separate centroids
@@ -590,15 +594,14 @@ New tests needed:
590
594
## References
591
595
592
596
[1] Zandieh, A., Daliri, M., Hadian, M. and Mirrokni, V. "TurboQuant: Online
593
-
Vector Quantization with Near-optimal Distortion Rate." arXiv:2504.19874,
594
-
April 2025.
597
+
Vector Quantization with Near-optimal Distortion Rate." arXiv:2504.19874,
598
+
April 2025.
595
599
596
600
[2] Ailon, N. and Chazelle, B. "The Fast Johnson-Lindenstrauss Transform and
597
-
Approximate Nearest Neighbors." SIAM Journal on Computing, 39(1):302-322,
598
-
2009.
601
+
Approximate Nearest Neighbors." SIAM Journal on Computing, 39(1):302-322, 2009.
599
602
600
603
[3] Tropp, J.A. "Improved Analysis of the Subsampled Randomized Hadamard
601
-
Transform." Advances in Adaptive Data Analysis, 3(1-2):115-126, 2011.
604
+
Transform." Advances in Adaptive Data Analysis, 3(1-2):115-126, 2011.
602
605
603
606
[4] Kuffo, L., Krippner, E. and Boncz, P. "PDX: A Data Layout for Vector
604
-
Similarity Search." Proceedings of SIGMOD '25. arXiv:2503.04422, March 2025.
607
+
Similarity Search." Proceedings of SIGMOD '25. arXiv:2503.04422, March 2025.
0 commit comments