Search All 2022 Events
 

Results

<<   <   Page 1 of 2   >   >>
Oral
Wed 15:27 Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance
Jiarong Xing · Leyuan Wang · Shang Zhang · Jack Chen · Ang Chen · Yibo Zhu
Oral
Tue 13:36 SLA-Driven ML INFERENCE FRAMEWORK FOR CLOUDS WITH HETEROGENEOUS ACCELERATORS
Junguk Cho · Diman Zad Tootaghaj · Lianjie Cao · Puneet Sharma
Oral
Wed 15:09 Learning Compressed Embeddings for On-Device Inference
Niketan Pansare · Jay Katukuri · Aditya Arora · Frank Cipollone · Riyaaz Shaik · Noyan Tokgozoglu · Chandru Venkataraman
Oral
Wed 14:51 HALOS: Hashing Large Output Space for Cheap Inference
Zichang Liu · Zhaozhuo Xu · Alan Ji · Junyan Zhang · Jonathan Li · Beidi Chen · Anshumali Shrivastava
Oral
Mon 9:57 GPU Semiring Primitives for Sparse Neighborhood Methods
Corey Nolet · Divye Gala · Edward Raff · Joe Eaton · Brad Rees · Tim Oates
Oral
Tue 14:15 Collapsible Linear Blocks for Super-Efficient Super Resolution
Kartikeya Bhardwaj · Milos Milosavljevic · Liam O'Neil · Dibakar Gope · Ramon Matas · Alex Chalfin · Alex Chalfin · Naveen Suda · Naveen Suda · Lingchuan Meng · Lingchuan Meng · Danny Loh · Danny Loh
Oral
Wed 14:15 ULPPACK: Fast Sub-8-bit Matrix Multiply on Commodity SIMD Hardware
Jaeyeon Won · Jeyeon Si · Sam Son · Tae Jun Ham · Jae W. Lee
Oral
Wed 13:00 Understanding GNN Computational Graph: A Coordinated Computation, IO, and Memory Perspective
Hengrui Zhang · Zhongming Yu · Guohao Dai · Guyue Huang · Yufei Ding · Yuan Xie · Yu Wang
Oral
Wed 9:57 Gyro Dropout: Maximizing Ensemble Effect in Neural Network Training
JUNYEOL LEE · HYUNGJUN OH · Jiwon Seo
Oral
Mon 16:18 TorchSparse: Efficient Point Cloud Inference Engine
Haotian Tang · Zhijian Liu · Xiuyu Li · Yujun Lin · Song Han
Oral
Tue 14:33 Towards the Co-design of Neural Networks and Accelerators
Yanqi Zhou · Xuanyi Dong · Tianjian Meng · Mingxing Tan · Berkin Akin · Daiyi Peng · Amir Yazdanbakhsh · Da Huang · Ravi Narayanaswami · James Laudon
Oral Session
Tue 14:15 Hardware Efficient ML