add qwen35 rl config and fix mix bug#1640
Conversation
|
@claude review |
|
Claude: ## Summary This PR adds a Qwen3.5-VL 35B GRPO mixed-data RL config, a new IssuesCritical
Nit
Bug Fix ReviewThe VerdictREQUEST_CHANGES — the |
|
@claude review |
|
Claude: Re-review of updated PR (3 commits). SummaryThis PR adds a Qwen3.5-VL 35B GRPO mixed-data RL config, a new IssuesWarning
Nit
Bug Fix ✓
Previous Issue Resolved ✓
VerdictCOMMENT — The previous critical issue is resolved. The remaining items are warnings and nits. The |
|
@claude please approve it |
|
Claude: Approved the PR as requested. All previously flagged critical issues (template_name default mismatch) have been resolved in the latest commits. The replay_buffer bug fix is correct, and the remaining items are minor nits only. |
* add qwen35 rl config and fix mix bug * update * update * fix * fix
No description provided.