Skip to yearly menu bar Skip to main content


Oral

DreamDDP: Accelerating Low-Bandwidth Geo-Distributed LLM Training with Layer-wise Partial Synchronization

Zhenheng Tang · Zichen TANG · Junlin Huang · Xinglin Pan · Rudan Yan · Yuxin Wang · · Shaohuai Shi · Xiaowen Chu ·

Abstract

Chat is not available.