Skip to content

Pull requests: PaddlePaddle/PaddleFormers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Test paddleformers bot
#4385 opened Apr 29, 2026 by zjjlivein Collaborator Loading…
2 tasks
Refactor zero_cost_checkpoint to use color-group-based 2D param colle…
#4380 opened Apr 28, 2026 by xxyux Contributor Loading…
2 tasks
update muon slice func
#4378 opened Apr 28, 2026 by xxyux Contributor Loading…
2 tasks
Feature/add model openelm contributor
#4372 opened Apr 28, 2026 by learncat163 Loading…
[Qwen3Moe] Add pad_token_id fallback
#4371 opened Apr 28, 2026 by SigureMo Member Loading…
[CI]update runs-on
#4370 opened Apr 28, 2026 by Liujie0926 Collaborator Loading…
2 tasks
udpate init_optimizer
#4367 opened Apr 27, 2026 by xxyux Contributor Loading…
2 tasks
[fix] new epoch acc loss logic fix
#4365 opened Apr 27, 2026 by wacxr123 Contributor Loading…
Add granite
#4364 opened Apr 27, 2026 by Minestar6 Loading…
[Qwen3MoE] Fix gradient alignment between PaddlePaddle and PyTorch/HF
#4358 opened Apr 25, 2026 by a31413510 Collaborator Loading…
Add auto-subabtch config
#4355 opened Apr 24, 2026 by Difers Contributor Loading…
2 tasks
[release/1.1] Add high_precision_rope cfg contributor
#4354 opened Apr 24, 2026 by risemeup1111 Loading…
2 tasks
add olmo2 model
#4353 opened Apr 24, 2026 by yicycyc Loading…
fix aoa shared_head contributor
#4352 opened Apr 24, 2026 by Lcysabcu Collaborator Loading…
2 tasks
[Deps] pin paddlecodec to >=0.1, <0.2 for Paddle 3.3 compatibility
#4349 opened Apr 24, 2026 by SigureMo Member Loading…
1 of 2 tasks
add allure for ci/ce
#4338 opened Apr 22, 2026 by Liujie0926 Collaborator Loading…
2 tasks
[CI] add_qwen3vl_moe
#4334 opened Apr 21, 2026 by zjjlivein Collaborator Loading…
2 tasks
Set the default value of distributed_dataloader to true.
#4331 opened Apr 21, 2026 by Jonathans575 Collaborator Loading…
2 tasks
[Model] Add DeepSeek-OCR-2 PaddlePaddle implementation with full SFT and LoRA support
#4324 opened Apr 20, 2026 by forBlank Collaborator Loading…
2 tasks done
[GLM4MoE] Set attention_softmax_in_fp32 and bf16 defaults in GLMMoEMo…
#4314 opened Apr 17, 2026 by zhanghonggeng Contributor Loading…
2 tasks
Fix: when using a streaming data pipeline, dataloader_num_workers mus…
#4286 opened Apr 15, 2026 by Jonathans575 Collaborator Loading…
2 tasks
disable qwen2
#4277 opened Apr 14, 2026 by zjjlivein Collaborator Loading…
2 tasks
Bump pytest from 8.1.1 to 9.0.3 in /tests contributor dependencies Pull requests that update a dependency file python Pull requests that update python code
#4275 opened Apr 13, 2026 by dependabot Bot Loading…
ProTip! Follow long discussions with comments:>50.