|
| 1 | +## v2.0.0b32 (2026-04-17) |
| 2 | + |
| 3 | + |
| 4 | +- Merge pull request #217 from OpenMOSS/dev |
| 5 | +- Release: refactor TransformerLensLanguageModel to be inherited from HookedTransformer |
| 6 | +- fix(backend): cast encoding['input_ids'] to Tensor for basedpyright |
| 7 | +- fix(server): basedpyright/ruff errors from backend refactor |
| 8 | +- - Add TokenizerOnlyLanguageModel to __all__ so ruff sees the export. |
| 9 | +- Assert tokenizer is not None before chat-template paths in circuit |
| 10 | + preview + generate (model.tokenizer is Optional on the new API). |
| 11 | +- Cast apply_chat_template result to str (tokenize=False is known by |
| 12 | + us but not inferable from the overload union). |
| 13 | +- fix(server): offload sync @distributed calls to thread in host-execution mode |
| 14 | +- Under num_workers=0, the @distributed wrapper ran the target function |
| 15 | +synchronously on the asyncio event loop. Long-running torch work (circuit |
| 16 | +attribution, model forwards) blocked the loop, so parallel frontend |
| 17 | +requests (progress polling, sub-resource loads) hung until the heavy call |
| 18 | +finished. Route sync functions through asyncio.to_thread so the loop |
| 19 | +stays responsive; async functions are awaited directly. |
| 20 | +- refactor(backend): TransformerLensLanguageModel via multiple inheritance |
| 21 | +- Switch from composition (`self.model = HookedTransformer(...)`) to |
| 22 | +multiple inheritance (`class TransformerLensLanguageModel(HookedTransformer, |
| 23 | +LanguageModel)`), exposing the full TL API directly without per-method proxies. |
| 24 | +- - Rename our `self.cfg` → `self.lm_cfg` to avoid clashing with |
| 25 | + HookedTransformerConfig. |
| 26 | +- Drop `use_flash_attn`, `load_ckpt`, `tokenizer_only` config fields. |
| 27 | +- Replace the `tokenizer_only=True` flag with a dedicated |
| 28 | + `TokenizerOnlyLanguageModel` backend (`backend="tokenizer_only"`). |
| 29 | +- Add `from_hooked_transformer` classmethod to upgrade bare HookedTransformer |
| 30 | + instances via zero-copy __class__ swap. |
| 31 | +- Update all call sites (`model.model.blocks` → `model.blocks`, etc.) across |
| 32 | + circuits, initializer, analysis, server, and CLI. |
| 33 | + |
1 | 34 | ## v2.0.0b31 (2026-04-17) |
2 | 35 |
|
3 | 36 |
|
|
0 commit comments