Commit 02bc46d
fix(vllm_inference): use local tokenizer for nvidia Hub reference model
The nvidia/esm2_t6_8M_UR50D Hub tokenizer_config.json references
TokenizersBackend which was removed in transformers 5.x, causing
AutoTokenizer.from_pretrained() to raise ValueError.
Load the reference model's tokenizer from the local esm_fast_tokenizer
directory (PreTrainedTokenizerFast) instead of from the Hub config.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: Peter St. John <pstjohn@nvidia.com>
Signed-off-by: svc-bionemo <267129667+svc-bionemo@users.noreply.github.com>1 parent 853a54b commit 02bc46d
1 file changed
Lines changed: 7 additions & 3 deletions
Lines changed: 7 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
64 | | - | |
| 64 | + | |
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
69 | | - | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
70 | 72 | | |
71 | 73 | | |
72 | 74 | | |
| |||
133 | 135 | | |
134 | 136 | | |
135 | 137 | | |
136 | | - | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
137 | 141 | | |
138 | 142 | | |
139 | 143 | | |
| |||
0 commit comments