feat(aihubmix): add Qwen3.7 Max, Gemini 3.5 Flash, Grok 4.3 models and improve Anthropic support#3179
Merged
crazywoola merged 7 commits intoMay 25, 2026
Conversation
Contributor
There was a problem hiding this comment.
Code Review
This pull request adds support for several new models, including Qwen 3.7 Max, Gemini 3.5 Flash, and Grok 4.3, and updates the Anthropic model implementation. Feedback identifies a potential bug in the Anthropic integration where a high default max_tokens value could cause API errors. For the Gemini 3.5 Flash configuration, the reviewer noted a missing response_format parameter needed for structured outputs and a discrepancy between the defined default value and the help text for the thinking_level parameter.
- Remove hardcoded max_tokens=128000 fallback in anthropic.py - Set claude-haiku-4-5 max_tokens default/max to 64000 - Set claude-opus-4-5 max_tokens default/max to 64000 - Set claude-opus-4-7 max_tokens default/max to 128000 - Change gemini-3.5-flash thinking level default to Medium
crazywoola
approved these changes
May 25, 2026
ous50
pushed a commit
to ous50/dify-official-plugins
that referenced
this pull request
May 25, 2026
…d improve Anthropic support (langgenius#3179) * feat(aihubmix): add Gemini 3.5 Flash and Grok 4.3 model support * feat(aihubmix): add default max_tokens parameter for Anthropic models * feat(aihubmix): add Qwen3.7 Max model support * chore(aihubmix): bump version to 0.0.29 * fix(aihubmix): update max_tokens defaults and remove hardcoded fallback - Remove hardcoded max_tokens=128000 fallback in anthropic.py - Set claude-haiku-4-5 max_tokens default/max to 64000 - Set claude-opus-4-5 max_tokens default/max to 64000 - Set claude-opus-4-7 max_tokens default/max to 128000 - Change gemini-3.5-flash thinking level default to Medium * refactor(aihubmix): use exact model name matching for max_tokens defaults
3 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
New Models
enable_thinking), tool call / stream-tool-call / multi-tool-call, agent-thought, ~1M context window, with pricing configured.Anthropic Models Enhancement
max_tokensparameter for Anthropic models to prevent request failures caused by missing value.Change Type
Screenshots / Videos
LLM Plugin Checklist
Areas affected by this change (check all that apply)
Version
versioninmanifest.yaml(not the one undermeta)dify_plugin>=0.3.0,<0.6.0is declared inpyproject.tomland locked inuv.lock(or kept inrequirements.txtfor legacy plugins withoutuv.lock) — SDK docsTesting