Skip to content

Commit fc7619e

Browse files
kaix-nvclaude
andcommitted
Add WaterSIC KV-cache quantization plot example
Self-contained script that reproduces the KV-cache quantization plots comparing RTN, GPTQ, and WaterSIC methods across temperature, power-law, retrieval, sink, and conditioning sweeps. Imports WaterSIC from the ported zsic module and inlines baseline quantization utilities. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Kai Xu <kaix@nvidia.com>
1 parent 296ac08 commit fc7619e

File tree

1 file changed

+1176
-0
lines changed

1 file changed

+1176
-0
lines changed

0 commit comments

Comments
 (0)