Timezone: »
Modern neural networks are increasingly bottlenecked by the limited capacity of on-device GPU memory. Prior work explores dropping activations as a strategy to scale to larger neural networks with fixed memory. However, these heuristics assume uniform cost per layer and only consider simple linear chain architectures, limiting their usability. In this paper, we formalize the problem of trading-off computation time and memory requirements for DNN training as the tensor rematerialization optimization problem. We develop a new system to optimally solve the problem in reasonable times (under an hour) using off-the-shelf MILP solvers. These schedules subsequently accelerate millions of training iterations. Our optimization pass in TensorFlow 2.0 automatically yields real training speedups of up to 4.8x over prior work, and can enable up to 5x increase in input size for real-world large networks.
Author Information
Paras Jain (UC Berkeley)
Ajay Jain (UC Berkeley)
Aniruddha Nrusimha (UC Berkeley)
Amir Gholami (UC Berkeley)
Pieter Abbeel (UC Berkeley)
Joseph Gonzalez (UC Berkeley)
Kurt Keutzer (EECS, UC Berkeley)
Ion Stoica (UC Berkeley)
Related Events (a corresponding poster, oral, or spotlight)
-
2020 Poster: Breaking the Memory Wall with Optimal Tensor Rematerialization »
Tue. Mar 3rd 12:30 -- 03:00 AM Room Ballroom A #9
More from the Same Authors
-
2023 Poster: On Optimizing the Communication of Model Parallelism »
Yonghao Zhuang · · Lianmin Zheng · Zhuohan Li · Eric Xing · Qirong Ho · Joseph Gonzalez · Ion Stoica · Hao Zhang -
2021 Poster: sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data »
Guanhua Wang · Zhuang Liu · Brandon Hsieh · Siyuan Zhuang · Joseph Gonzalez · Trevor Darrell · Ion Stoica -
2021 Oral: sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data »
Guanhua Wang · Zhuang Liu · Brandon Hsieh · Siyuan Zhuang · Joseph Gonzalez · Trevor Darrell · Ion Stoica -
2021 Remarks: Opening Remarks »
Alex Dimakis · Ion Stoica · Alexander Smola -
2020 Oral: AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning »
Ameer Haj-Ali · Qijing (Jenny) Huang · John Xiang · William Moses · Krste Asanovic · John Wawrzynek · Ion Stoica -
2020 Poster: AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning »
Ameer Haj-Ali · Qijing (Jenny) Huang · John Xiang · William Moses · Krste Asanovic · John Wawrzynek · Ion Stoica