feat(openai/azure): enable temperature when reasoning_effort=none for… by henry-fung · Pull Request #2829 · langgenius/dify-official-plugins

henry-fung · 2026-04-04T02:17:45Z

Summary

Starting from GPT-5.2 (confirmed also for GPT-5.1), when reasoning_effort is
set to "none", temperature and top_p become valid parameters. Passing them
with any other reasoning effort value causes a 400 API error. This PR implements
the conditional logic for gpt-5.1, gpt-5.2, and gpt-5.4 in both the OpenAI
and Azure OpenAI providers.

OpenAI provider

gpt-5.1.yaml, gpt-5.2.yaml, gpt-5.4.yaml: add temperature to
parameter_rules
llm.py (_build_responses_api_params): strip temperature/top_p/logprobs
when reasoning is active (non-"none")
llm.py (_chat_generate): same stripping for the standard Chat Completions path

Azure OpenAI provider

constants.py: add temperature ParameterRule to gpt-5.1, gpt-5.2,
gpt-5.4 entries
llm.py (_chat_generate_with_responses): restructure reasoning vs
temperature/top_p as mutually exclusive — reasoning active → set
reasoning: {effort} only; reasoning="none" or absent → pass
temperature/top_p, skip reasoning param

Related Issues or Context

Implements the parameter constraints documented by OpenAI:

"The following parameters are only supported when using GPT-5.4 with reasoning
effort set to none: temperature, top_p, logprobs. Requests to GPT-5.4 or GPT-5.2
with any other reasoning effort setting … will raise an error."

Verified via direct API testing that GPT-5.1 follows the same rules.

This PR contains Changes to Non-Plugin

Documentation
Other

This PR contains Changes to Non-LLM Models Plugin

I have Run Comprehensive Tests Relevant to My Changes

This PR contains Changes to LLM Models Plugin

My Changes Affect Message Flow Handling (System Messages and User→Assistant Turn-Taking)
My Changes Affect Tool Interaction Flow (Multi-Round Usage and Output Handling, for both Agent App and Agent Node)
My Changes Affect Multimodal Input Handling (Images, PDFs, Audio, Video, etc.)
My Changes Affect Multimodal Output Generation (Images, Audio, Video, etc.)
My Changes Affect Structured Output Format (JSON, XML, etc.)
My Changes Affect Token Consumption Metrics
My Changes Affect Other LLM Functionalities (Reasoning Process, Grounding, Prompt Caching, etc.)
Other Changes (Add New Models, Fix Model Parameters etc.)

gpt-5.1/5.2/5.4 — reasoning_effort vs temperature behavior:

reasoning_effort	temperature sent to API	Result
`none` (default)	✅ included	Success
`low` / `medium` / `high`	❌ stripped	Would cause 400 error

Version Control

I have Bumped Up the Version in Manifest.yaml
- models/openai/manifest.yaml: 0.3.4 → 0.3.5
- models/azure_openai/manifest.yaml: 0.0.49 → 0.0.50

Dify Plugin SDK Version

I have Ensured dify_plugin>=0.3.0,<0.6.0 is in requirements.txt

Environment Verification

Local Deployment Environment

Dify Version is: , I have Tested My Changes on Local Deployment Dify

SaaS Environment

I have Tested My Changes on cloud.dify.ai

… gpt-5.1/5.2/5.4 Starting from GPT-5.2 (and confirmed also for GPT-5.1), when reasoning_effort is set to "none", temperature and top_p become valid parameters. With any other reasoning effort value, passing these parameters causes a 400 API error. Changes: - OpenAI provider: add temperature parameter to gpt-5.1/5.2/5.4 YAML definitions - OpenAI llm.py (_build_responses_api_params): strip temperature/top_p/logprobs when reasoning_effort is active (non-"none") - OpenAI llm.py (_chat_generate): same stripping for standard Chat Completions path - Azure OpenAI constants.py: add temperature ParameterRule to gpt-5.1/5.2/5.4 entries - Azure OpenAI llm.py (_chat_generate_with_responses): restructure reasoning vs temperature/top_p as mutually exclusive — reasoning active → set reasoning param only; reasoning=none or absent → pass temperature/top_p, skip reasoning param Tested against Azure OpenAI (dscgpt.openai.azure.com) with all three deployments: - reasoning=none + temperature=0.7 → success - reasoning=low + temperature=0.7 → expected 400 error Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request introduces support for the reasoning_effort parameter in Azure OpenAI and OpenAI models, specifically targeting GPT-5 variants. It includes logic to conditionally include or strip parameters like temperature, top_p, and logprobs based on whether reasoning is active, as these are mutually exclusive in the underlying APIs. Review feedback highlights the need for consistent top_p parameter rules across model definitions and suggests ensuring logprobs is consistently stripped in the standard chat generation path when reasoning is enabled.

… text-embedding-3-* models Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Add top_p to gpt-5.1 parameter rules (openai yaml and azure constants) - Strip logprobs when reasoning_effort is active in _chat_generate path Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

dosubot Bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Apr 4, 2026

henry-fung temporarily deployed to models/openai April 4, 2026 02:18 — with GitHub Actions Inactive

henry-fung temporarily deployed to models/azure_openai April 4, 2026 02:18 — with GitHub Actions Inactive

dosubot Bot added the enhancement New feature or request label Apr 4, 2026

gemini-code-assist Bot reviewed Apr 4, 2026

View reviewed changes

Comment thread models/azure_openai/models/constants.py

Comment thread models/openai/models/llm/gpt-5.1.yaml

Comment thread models/openai/models/llm/llm.py

henry-fung and others added 2 commits May 10, 2026 12:59

Merge branch 'main' into feat/gpt5x-temperature-reasoning-effort-none

e4e73d8

feat(embedding): add dimensions parameter for OpenAI and Azure OpenAI…

dce04f6

… text-embedding-3-* models Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

henry-fung had a problem deploying to models/azure_openai May 10, 2026 05:33 — with GitHub Actions Failure

henry-fung had a problem deploying to models/openai May 10, 2026 05:33 — with GitHub Actions Failure

henry-fung and others added 2 commits May 10, 2026 13:35

fix: apply review suggestions for gpt-5.1 top_p and logprobs stripping

80cc23b

- Add top_p to gpt-5.1 parameter rules (openai yaml and azure constants) - Strip logprobs when reasoning_effort is active in _chat_generate path Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore(azure_openai): bump version to 0.0.56

6ec75e1

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

dosubot Bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels May 10, 2026

chore(openai): bump version to 0.3.9

6acdfa9

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

henry-fung had a problem deploying to models/azure_openai May 10, 2026 05:39 — with GitHub Actions Failure

henry-fung had a problem deploying to models/openai May 10, 2026 05:39 — with GitHub Actions Failure

fix(azure_openai): fix duplicate name keyword in gpt-5.1 ParameterRule

cca228e

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

henry-fung temporarily deployed to models/openai May 10, 2026 05:46 — with GitHub Actions Inactive

henry-fung had a problem deploying to models/azure_openai May 10, 2026 05:46 — with GitHub Actions Failure

fix(azure_openai): fix duplicate name keyword in gpt-5.4 ParameterRule

2d543ad

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

henry-fung temporarily deployed to models/openai May 10, 2026 05:50 — with GitHub Actions Inactive

henry-fung temporarily deployed to models/azure_openai May 10, 2026 05:50 — with GitHub Actions Inactive

Merge branch 'main' into feat/gpt5x-temperature-reasoning-effort-none

eeb27aa

henry-fung temporarily deployed to models/azure_openai May 16, 2026 09:35 — with GitHub Actions Inactive

henry-fung had a problem deploying to models/openai May 16, 2026 09:35 — with GitHub Actions Failure

chore(openai): bump version to 0.4.1

58ec6dd

henry-fung temporarily deployed to models/openai May 16, 2026 09:43 — with GitHub Actions Inactive

henry-fung temporarily deployed to models/azure_openai May 16, 2026 09:43 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(openai/azure): enable temperature when reasoning_effort=none for…#2829

feat(openai/azure): enable temperature when reasoning_effort=none for…#2829
henry-fung wants to merge 10 commits into
langgenius:mainfrom
henry-fung:feat/gpt5x-temperature-reasoning-effort-none

henry-fung commented Apr 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

henry-fung commented Apr 4, 2026

Summary

Related Issues or Context

This PR contains Changes to Non-Plugin

This PR contains Changes to Non-LLM Models Plugin

This PR contains Changes to LLM Models Plugin

Version Control

Dify Plugin SDK Version

Environment Verification

Local Deployment Environment

SaaS Environment

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant