Skip to content

Commit 46c784f

Browse files
authored
Update tokenizers submodule to the linear-time HF encode path (pytorch#20472)
Bump extension/llm/tokenizers, picking up the linear-time HFTokenizer encode work (merge_all O(n log n), ReplaceNormalizer::normalize O(N) single forward pass) plus a targeted encode-latency benchmark. This cuts long-prompt prefill tokenization time in the gemma4 / eagle3 runners; token ids and greedy output are unchanged, verified e2e on the gemma4-31B target (identical 18-token encode + decode after the bump).
1 parent 68bb668 commit 46c784f

1 file changed

Lines changed: 1 addition & 1 deletion