Skip to content

Commit a1ddf49

Browse files
authored
Link fix for p2p (#334)
* date and link * update
1 parent cdbce0e commit a1ddf49

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
---
22
title: "Updating 1T parameters in seconds — P2P weight transfer in Large Scale Distributed RL"
33
author: "Jiadong Guo, Xin Ji, Letian Ruan, Teng Ma, Chenyang Zhao, Yueming Yuan, Zhichen Zeng"
4-
date: "April 22, 2026"
5-
previewImg: /images/blog/p2p-update/p2p_overview.png
4+
date: "April 29, 2026"
5+
previewImg: /images/blog/p2p-update/p2p-overview.png
66
---
77

88
We introduced a **RDMA-based, Peer to Peer weight update** mechanism for RL workloads in SGLang as a supplement to traditional NCCL broadcast methods, compatible with all major open source models. By utilizing a source-side **CPU engine replica** and **P2P RDMA transfers** via Mooncake TransferEngine, we speed up weight transfer times for 1T-parameter Kimi-K2 7 times (53 seconds -> 7.2 seconds), at the cost of one additional inference engine replica (32G) per training rank on CPU memory. These optimizations minimize network redundancy and allow inference servers to resume rollout significantly faster.

0 commit comments

Comments
 (0)