Skip to content

Ring-buffer KV cache, chunked prefill, INT8 embedding, and cleanup

9108a5b
Select commit
Loading
Failed to load commit list.
Open

Add Gemma 4 31B-IT model, export, and quantization framework for ExecuTorch #19213

Ring-buffer KV cache, chunked prefill, INT8 embedding, and cleanup
9108a5b
Select commit
Loading
Failed to load commit list.

Select a check to view from the sidebar