[9:10]
Boveda: Building an On-Chip Deep Learning Memory Hierarchy Brick by Brick
[9:30]
Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models
[9:50]
A Distributed Graph-Theoretic Framework for Automatic Parallelization in Multi-core Systems
[10:10]
Accelerating SLIDE Deep Learning on Modern CPUs: Vectorization, Quantizations, Memory Optimizations, and More
[10:30]
Scaling Polyhedral Neural Network Verification on GPUs