Updating 1T parameters in seconds — P2P weight transfer in Large-Scale Distributed RL

Jiadong Guo, Xin Ji, Letian Ruan, Teng Ma, Chenyang Zhao, Yueming Yuan, Zhichen Zeng, April 29, 2026 SGLang RL Infra Network

A study from the SGLang-RL team on peer-to-peer weight transfer that updates trillion-parameter models in seconds during large-scale distributed RL training.

Read the full post on LMSYS.Org.