Oral
|
Wed 15:27 |
Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance Jiarong Xing · Leyuan Wang · Shang Zhang · Jack Chen · Ang Chen · Yibo Zhu |
|
Oral
|
Tue 13:36 |
SLA-Driven ML INFERENCE FRAMEWORK FOR CLOUDS WITH HETEROGENEOUS ACCELERATORS Junguk Cho · Diman Zad Tootaghaj · Lianjie Cao · Puneet Sharma |
|
Oral
|
Wed 15:09 |
Learning Compressed Embeddings for On-Device Inference Niketan Pansare · Jay Katukuri · Aditya Arora · Frank Cipollone · Riyaaz Shaik · Noyan Tokgozoglu · Chandru Venkataraman |
|
Oral
|
Wed 14:51 |
HALOS: Hashing Large Output Space for Cheap Inference Zichang Liu · Zhaozhuo Xu · Alan Ji · Junyan Zhang · Jonathan Li · Beidi Chen · Anshumali Shrivastava |
|
Oral
|
Mon 9:57 |
GPU Semiring Primitives for Sparse Neighborhood Methods Corey Nolet · Divye Gala · Edward Raff · Joe Eaton · Brad Rees · Tim Oates |
|
Oral
|
Tue 14:15 |
Collapsible Linear Blocks for Super-Efficient Super Resolution Kartikeya Bhardwaj · Milos Milosavljevic · Liam O'Neil · Dibakar Gope · Ramon Matas · Alex Chalfin · Alex Chalfin · Naveen Suda · Naveen Suda · Lingchuan Meng · Lingchuan Meng · Danny Loh · Danny Loh |
|
Oral
|
Wed 14:15 |
ULPPACK: Fast Sub-8-bit Matrix Multiply on Commodity SIMD Hardware Jaeyeon Won · Jeyeon Si · Sam Son · Tae Jun Ham · Jae W. Lee |
|
Oral
|
Wed 13:00 |
Understanding GNN Computational Graph: A Coordinated Computation, IO, and Memory Perspective Hengrui Zhang · Zhongming Yu · Guohao Dai · Guyue Huang · Yufei Ding · Yuan Xie · Yu Wang |
|
Oral
|
Wed 9:57 |
Gyro Dropout: Maximizing Ensemble Effect in Neural Network Training JUNYEOL LEE · HYUNGJUN OH · Jiwon Seo |
|
Oral
|
Mon 16:18 |
TorchSparse: Efficient Point Cloud Inference Engine Haotian Tang · Zhijian Liu · Xiuyu Li · Yujun Lin · Song Han |
|
Oral
|
Tue 14:33 |
Towards the Co-design of Neural Networks and Accelerators Yanqi Zhou · Xuanyi Dong · Tianjian Meng · Mingxing Tan · Berkin Akin · Daiyi Peng · Amir Yazdanbakhsh · Da Huang · Ravi Narayanaswami · James Laudon |
|
Oral Session
|
Tue 14:15 |
Hardware Efficient ML |