Skip to content

feat(aihubmix): add Qwen3.7 Max, Gemini 3.5 Flash, Grok 4.3 models and improve Anthropic support#3179

Merged
crazywoola merged 7 commits into
langgenius:mainfrom
orangeclk:feature/aihubmix-add-models
May 25, 2026
Merged

feat(aihubmix): add Qwen3.7 Max, Gemini 3.5 Flash, Grok 4.3 models and improve Anthropic support#3179
crazywoola merged 7 commits into
langgenius:mainfrom
orangeclk:feature/aihubmix-add-models

Conversation

@orangeclk

@orangeclk orangeclk commented May 22, 2026

Copy link
Copy Markdown
Contributor

Summary

New Models

  • Qwen3.7 Max — Supports deep thinking (enable_thinking), tool call / stream-tool-call / multi-tool-call, agent-thought, ~1M context window, with pricing configured.
  • Gemini 3.5 Flash
  • Grok 4.3

Anthropic Models Enhancement

  • Added default max_tokens parameter for Anthropic models to prevent request failures caused by missing value.

Change Type

  • Documentation / non-plugin change
  • Non-LLM plugin (tools, extensions, datasource, etc.)
  • LLM plugin

Screenshots / Videos

Before After
image image
image

LLM Plugin Checklist

Areas affected by this change (check all that apply)
  • Message flow (system messages, user ↔ assistant turn-taking)
  • Tool interaction flow (multi-round usage, Agent App and Agent Node)
  • Multimodal input (images, PDFs, audio, video, etc.)
  • Multimodal output (images, audio, video, etc.)
  • Structured output (JSON, XML, etc.)
  • Token consumption metrics
  • Other LLM functionality (reasoning, grounding, prompt caching, etc.)
  • New models / model parameter fixes

Version

  • Bumped top-level version in manifest.yaml (not the one under meta)
  • dify_plugin>=0.3.0,<0.6.0 is declared in pyproject.toml and locked in uv.lock (or kept in requirements.txt for legacy plugins without uv.lock) — SDK docs

Testing

  • Local deployment — Dify version: 1.14.2
  • SaaS (cloud.dify.ai)

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds support for several new models, including Qwen 3.7 Max, Gemini 3.5 Flash, and Grok 4.3, and updates the Anthropic model implementation. Feedback identifies a potential bug in the Anthropic integration where a high default max_tokens value could cause API errors. For the Gemini 3.5 Flash configuration, the reviewer noted a missing response_format parameter needed for structured outputs and a discrepancy between the defined default value and the help text for the thinking_level parameter.

Comment thread models/aihubmix/models/llm/anthropic.py Outdated
Comment thread models/aihubmix/models/llm/gemini-3.5-flash.yaml
Comment thread models/aihubmix/models/llm/gemini-3.5-flash.yaml Outdated
@orangeclk orangeclk changed the title Feature/aihubmix Add Qwen 3.7 Max, Grok 4.3 and Gemini 3.5 Flash feat(aihubmix): add Qwen3.7 Max, Gemini 3.5 Flash, Grok 4.3 models and improve Anthropic support May 22, 2026
@orangeclk orangeclk temporarily deployed to models/aihubmix May 22, 2026 03:43 — with GitHub Actions Inactive
- Remove hardcoded max_tokens=128000 fallback in anthropic.py
- Set claude-haiku-4-5 max_tokens default/max to 64000
- Set claude-opus-4-5 max_tokens default/max to 64000
- Set claude-opus-4-7 max_tokens default/max to 128000
- Change gemini-3.5-flash thinking level default to Medium
@orangeclk orangeclk temporarily deployed to models/aihubmix May 22, 2026 06:13 — with GitHub Actions Inactive
@orangeclk orangeclk marked this pull request as ready for review May 22, 2026 06:35
@dosubot dosubot Bot added size:S This PR changes 10-29 lines, ignoring generated files. enhancement New feature or request labels May 22, 2026
@orangeclk orangeclk temporarily deployed to models/aihubmix May 22, 2026 06:37 — with GitHub Actions Inactive
@dosubot dosubot Bot added the lgtm This PR has been approved by a maintainer label May 25, 2026
@crazywoola crazywoola merged commit 5de2a3a into langgenius:main May 25, 2026
3 checks passed
@orangeclk orangeclk deleted the feature/aihubmix-add-models branch May 25, 2026 01:44
ous50 pushed a commit to ous50/dify-official-plugins that referenced this pull request May 25, 2026
…d improve Anthropic support (langgenius#3179)

* feat(aihubmix): add Gemini 3.5 Flash and Grok 4.3 model support

* feat(aihubmix): add default max_tokens parameter for Anthropic models

* feat(aihubmix): add Qwen3.7 Max model support

* chore(aihubmix): bump version to 0.0.29

* fix(aihubmix): update max_tokens defaults and remove hardcoded fallback

- Remove hardcoded max_tokens=128000 fallback in anthropic.py
- Set claude-haiku-4-5 max_tokens default/max to 64000
- Set claude-opus-4-5 max_tokens default/max to 64000
- Set claude-opus-4-7 max_tokens default/max to 128000
- Change gemini-3.5-flash thinking level default to Medium

* refactor(aihubmix): use exact model name matching for max_tokens defaults
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request lgtm This PR has been approved by a maintainer size:S This PR changes 10-29 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants