Skip to yearly menu bar Skip to main content


Invited 1
in
Workshop: 2nd On-Device Intelligence Workshop

Efficient ML on the Edge with Apache TVM (Thierry Moreau, OctoML)

Thierry Moreau


Abstract:

Apache TVM is a complete deep learning compilation framework -- it automatically generates fast binary code for any model, on any device, by exploring a large search space of potential optimizations. TVM itself uses machine learning to guide its code synthesis process, saving months of engineering time. The code generated by TVM can be many times faster than hand-optimized libraries -- in some cases exceeding a speedup of 30x over hand-tuned code.

In this talk, I will give an overview of Apache TVM and how we are using it at OctoML to enable model deployment on mobile and IoT devices. I’ll highlight our recent efforts on micro-TVM, TVM’s solution for deploying ML on microcontrollers.

Biography Dr. Thierry Moreau is the co-founder of OctoML Inc., a Seattle-based startup that applies state of the art ML-based automation to put into production fast and efficient ML in the datacenter and on the edge. Thierry has been a key contributor to Apache TVM, the open source machine learning compiler that started at University of Washington, where Thierry got his Ph.D. Today he narrowly works with top semiconductor companies to grow the range of hardware devices that TVM targets.