Fix load_compress_model retry using bitwise ~ on bool instead of logical not by Chessing234 · Pull Request #3857 · lm-sys/FastChat

Chessing234 · 2026-04-19T05:11:18Z

Bug

`fastchat/model/compression.py:load_compress_model` retries tokenizer construction when the first call raises `TypeError` (the "`use_fast=True` is not supported for some models" case called out in the comment). The retry is meant to flip `use_fast` to the opposite value, but it uses the bitwise NOT:

```python
try:
tokenizer = AutoTokenizer.from_pretrained(
model_path, use_fast=use_fast, revision=revision, trust_remote_code=True
)
except TypeError:
tokenizer = AutoTokenizer.from_pretrained(
model_path, use_fast=~use_fast, revision=revision, trust_remote_code=True
)
```

On a Python `bool`, `~` does integer bitwise NOT, not logical negation:

`~True` → `-2`
`~False` → `-1`

Both are truthy integers. So when the fast path raised `TypeError` (typically because the model's slow tokenizer is the only one available), the fallback still passes a truthy `use_fast=-2` / `use_fast=-1`, which either triggers the same `TypeError` again or silently builds the fast tokenizer, defeating the retry.

Root cause

`~` is the bitwise-not operator; the logical negation of a bool in Python is `not`. The comment directly above the `try` spells out the intent ("`use_fast=True` is not supported for some models"), which is the classic "try fast, fall back to slow" pattern already used verbatim in `fastchat/model/model_adapter.py` — where the equivalent `TypeError` handler uses `use_fast=False`.

Why the fix is correct

Changing `~use_fast` to `not use_fast` makes the fallback actually flip `True → False` / `False → True`, so a slow-tokenizer-only model now loads on retry instead of re-hitting the same failure. Behavior on the success path of the first `from_pretrained` is unchanged; this only affects the one line inside the `except TypeError` branch.

…cal not

Fix load_compress_model retry using bitwise ~ on bool instead of logi…

492e168

…cal not

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix load_compress_model retry using bitwise ~ on bool instead of logical not#3857

Fix load_compress_model retry using bitwise ~ on bool instead of logical not#3857
Chessing234 wants to merge 1 commit intolm-sys:mainfrom
Chessing234:fix/compression-use-fast-bitwise-not

Chessing234 commented Apr 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Chessing234 commented Apr 19, 2026

Bug

Root cause

Why the fix is correct

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant