Toggle Poster Visibility
Oral
Thu Apr 08 03:20 PM -- 03:40 PM (PDT)
Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters
[
Paper PDF]
Oral
Thu Apr 08 03:40 PM -- 04:00 PM (PDT)
Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery
[
Paper PDF]
Oral
Thu Apr 08 04:00 PM -- 04:20 PM (PDT)
Wavelet: Efficient DNN Training with Tick-Tock Scheduling
[
Paper PDF]
Oral
Thu Apr 08 04:20 PM -- 04:40 PM (PDT)
Pipelined Backpropagation at Scale: Training Large Models without Batches
[
Paper PDF]