Toggle Poster Visibility
Oral
Wed Apr 07 03:20 PM -- 03:40 PM (PDT)
Nimble: Efficiently Compiling Dynamic Neural Networks for Model Inference
[
Paper PDF]
Oral
Wed Apr 07 03:40 PM -- 04:00 PM (PDT)
MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions
[
Paper PDF]
Oral
Wed Apr 07 04:00 PM -- 04:20 PM (PDT)
VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
[
Paper PDF]
Oral
Wed Apr 07 04:20 PM -- 04:40 PM (PDT)
Accelerate Inference of CNNs for Video Analysis While Preserving Exactness Exploiting Activation Sparsity
[
Paper PDF]
Oral
Wed Apr 07 04:40 PM -- 05:00 PM (PDT)
sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data
[
Paper PDF]