Skip to content

Pull requests: PaddlePaddle/FastDeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CI] Skip CI for non-runtime directories and add unittest Claude skill
#7870 opened May 20, 2026 by EmmonsCurse Collaborator Loading…
2 of 5 tasks
Ck
#7869 opened May 20, 2026 by lizexu123 Collaborator Loading…
5 tasks
[RL] Reset buffer size of slot_mapping
#7868 opened May 20, 2026 by gongshaotian Collaborator Loading…
5 tasks done
[RL] Reset buffer size of slot_mapping
#7866 opened May 20, 2026 by gongshaotian Collaborator Loading…
5 tasks done
[Metric] Support custom metric labels
#7865 opened May 20, 2026 by liyonghua0910 Collaborator Loading…
2 of 5 tasks
[XPU] Fix mtp cudagraph
#7864 opened May 20, 2026 by cmcamdy Collaborator Loading…
2 of 5 tasks
[KVCache] Add free_cpu_block_num gauge metric
#7856 opened May 19, 2026 by liyonghua0910 Collaborator Loading…
2 of 5 tasks
[Cherry-Pick][KVCache] Support request-level prefix cache disable(#7854)
#7855 opened May 19, 2026 by kevincheng2 Collaborator Loading…
4 of 5 tasks
[KVCache] Support request-level prefix cache disable
#7854 opened May 19, 2026 by kevincheng2 Collaborator Loading…
3 of 5 tasks
[DataProcessor] Refactor and unify text/multimodal processor pipeline
#7853 opened May 19, 2026 by luukunn Collaborator Loading…
3 of 5 tasks
Support Triton MLA Attention Backend
#7852 opened May 19, 2026 by chang-wenbin Collaborator Loading…
5 tasks
[Speculative Decoding]【Hackathon 10th Spring No.54】hybrid_mtp_ngram 端到端验证 contributor External developers
#7849 opened May 19, 2026 by NKNaN Contributor Loading…
5 tasks done
[Cherry-Pick][Feature][Log]console metrics log for pd disaggregation #7843
#7845 opened May 18, 2026 by CSWYF3634076 Collaborator Loading…
5 tasks done
[Feature] Add server-level token limits and prompt truncation control
#7842 opened May 18, 2026 by luukunn Collaborator Loading…
3 of 5 tasks
[BugFix] Fix attention mask for multimodal models
#7841 opened May 18, 2026 by TBD1 Collaborator Loading…
2 of 5 tasks
[PD] PD send cache via storage & Refine swap_cache_layout op
#7839 opened May 17, 2026 by juncaipeng Collaborator Loading…
1 of 5 tasks
support MLA overlap-schedule
#7836 opened May 15, 2026 by chang-wenbin Collaborator Loading…
5 tasks
Add inner benchmark metrics component
#7831 opened May 15, 2026 by Deleter-D Collaborator Loading…
5 tasks
[Cherry-Pick][Loader] Add values natural order check to layers grouped validation
#7822 opened May 14, 2026 by bukejiyu Collaborator Loading…
1 of 5 tasks
[Others] update flash mask version
#7819 opened May 14, 2026 by BingooYang Contributor Loading…
5 tasks done
[Feature] GPU Model Runner V1
#7810 opened May 13, 2026 by ming1753 Collaborator Draft
5 tasks
[bugfix] free blocks even if AS write failed
#7807 opened May 13, 2026 by zccjjj Contributor Loading…
5 tasks
Triton mla
#7804 opened May 13, 2026 by Linboyan-trc Loading…
[Others]Benchmark compare skill contributor External developers
#7803 opened May 13, 2026 by Linboyan-trc Loading…
ProTip! Add no:assignee to see everything that’s not assigned.