Skip to yearly menu bar Skip to main content


(5 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Wed Apr 07 03:20 PM -- 03:40 PM (PDT)
Nimble: Efficiently Compiling Dynamic Neural Networks for Model Inference
Haichen Shen · Jared Roesch · Zhi Chen · Wei Chen · Yong Wu · Mu Li · Vin Sharma · Zachary Tatlock · Yida Wang
Oral
Wed Apr 07 03:40 PM -- 04:00 PM (PDT)
MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions
Wenqi Jiang · Zhenhao He · Shuai Zhang · Thomas B. Preußer · Kai Zeng · Liang Feng · Jiansong Zhang · Tongxuan Liu · Yong Li · Jingren Zhou · Ce Zhang · Gustavo Alonso
Oral
Wed Apr 07 04:00 PM -- 04:20 PM (PDT)
VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Steve Dai · Rangha Venkatesan · Mark Ren · Brian Zimmer · William Dally · Brucek Khailany
Oral
Wed Apr 07 04:20 PM -- 04:40 PM (PDT)
Accelerate Inference of CNNs for Video Analysis While Preserving Exactness Exploiting Activation Sparsity
Toshiaki Wakatsuki · Sekitoshi Kanai · Yasuhiro Fujiwara
Oral
Wed Apr 07 04:40 PM -- 05:00 PM (PDT)
sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data
Guanhua Wang · Zhuang Liu · Brandon Hsieh · Siyuan Zhuang · Joseph Gonzalez · Trevor Darrell · Ion Stoica