Skip to content

feat: automatic checkpoints for models that need it#573

Merged
giladgd merged 24 commits intomasterfrom
gilad/qwen3.5
Mar 15, 2026
Merged

feat: automatic checkpoints for models that need it#573
giladgd merged 24 commits intomasterfrom
gilad/qwen3.5

Conversation

@giladgd
Copy link
Copy Markdown
Member

@giladgd giladgd commented Mar 6, 2026

Description of change

  • feat: automatic checkpoints for models that need it (such as Qwen 3.5 due to its hybrid architecture)
  • feat(QwenChatWrapper): Qwen 3.5 support
  • feat(inspect gpu command): detect and report missing prebuilt binary modules and custom npm registry
  • feat: initial disk cache dir option for future optimizations (disabled for now)
  • fix: Qwen 3.5 memory estimation
  • fix: grammar use with HarmonyChatWrapper
  • fix: add mistral think segment detection
  • fix: compress excessively long segments from the current response on context shift instead of throwing an error
  • fix: default thinking budget to 75% of the context size to prevent low-quality responses
  • fix: bugs
  • test: Qwen 3.5 checkpoint use
  • chore: update modules

Fixes #571
Fixes #575
Fixes #577

Pull-Request Checklist

  • Code is up-to-date with the master branch
  • npm run format to apply eslint formatting
  • npm run test passes with this change
  • This pull request links relevant issues as Fixes #0000
  • There are new or updated unit tests validating the change
  • Documentation has been updated to reflect this change
  • The new commits and pull request title follow conventions explained in pull request guidelines (PRs that do not follow this convention will not be merged)

@giladgd giladgd requested a review from ido-pluto March 6, 2026 06:06
@giladgd giladgd self-assigned this Mar 6, 2026
@giladgd giladgd changed the title fix: Qwen 3.5 suuport fix: Qwen 3.5 support Mar 8, 2026
@giladgd giladgd changed the title fix: Qwen 3.5 support fix: initial Qwen 3.5 support Mar 8, 2026
@giladgd giladgd changed the title fix: initial Qwen 3.5 support feat: automatic checkpoints for models that need it Mar 12, 2026
Copy link
Copy Markdown
Contributor

@ido-pluto ido-pluto left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@giladgd giladgd enabled auto-merge (squash) March 12, 2026 11:34
@giladgd giladgd disabled auto-merge March 13, 2026 21:09
@giladgd giladgd enabled auto-merge (squash) March 13, 2026 21:09
@giladgd giladgd disabled auto-merge March 14, 2026 21:07
@giladgd giladgd enabled auto-merge (squash) March 14, 2026 21:07
@giladgd giladgd merged commit c641959 into master Mar 15, 2026
19 checks passed
@giladgd giladgd deleted the gilad/qwen3.5 branch March 15, 2026 01:00
@github-actions
Copy link
Copy Markdown

🎉 This PR is included in version 3.18.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

2 participants