You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix ESM2 tokenizer export: patch tokenizer_config.json to use PreTrainedTokenizerFast
In transformers 5.x, AutoTokenizer serializes the class name as
"TokenizersBackend" which is not resolvable by AutoTokenizer.from_pretrained().
Patch the saved tokenizer_config.json after save_pretrained() to force
tokenizer_class="PreTrainedTokenizerFast" and remove non-standard fields.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: svc-bionemo <267129667+svc-bionemo@users.noreply.github.com>
0 commit comments