Update tokenizers submodule to the linear-time HF encode path

digantdesai · digantdesai · commit dfd3d2e4c006 · 2026-06-24T13:39:35.000-07:00
Bump extension/llm/tokenizers, picking up the linear-time HFTokenizer encode work (merge_all O(n log n), ReplaceNormalizer::normalize O(N) single forward pass) plus a targeted encode-latency benchmark. This cuts long-prompt prefill tokenization time in the gemma4 / eagle3 runners; token ids and greedy output are unchanged, verified e2e on the gemma4-31B target (identical 18-token encode + decode after the bump). ghstack-source-id: 525dc35 ghstack-comment-id: 4734208425 Pull-Request: #20349
diff --git a/extension/llm/tokenizers b/extension/llm/tokenizers
@@ -1 +1 @@
-Subproject commit 3f98e9903e4e9972e5371522d1b64bc7793c250b
+Subproject commit c17438e18855ffb31cf32ccf96c13c1483273df6