Timezone: »
Executing machine learning workloads locally on resource constrained microcontrollers (MCUs) promises to drastically expand the application space of IoT. However, so-called TinyML presents severe technical challenges, as deep neural network inference demands a large compute and memory budget. To address this challenge, neural architecture search (NAS) promises to help design accurate ML models that meet the tight MCU memory, latency, and energy constraints. A key component of NAS algorithms is their latency/energy model, i.e., the mapping from a given neural network architecture to its inference latency/energy on an MCU. In this paper, we observe an intriguing property of NAS search spaces for MCU model design: on average, model latency varies linearly with model operation (op) count under a uniform prior over models in the search space. Exploiting this insight, we employ differentiable NAS (DNAS) to search for models with low memory usage and low op count, where op count is treated as a viable proxy to latency. Experimental results validate our methodology, yielding our MicroNet models, which we deploy on MCUs using Tensorflow Lite Micro, a standard open-source neural network (NN) inference runtime widely used in the TinyML community. MicroNets demonstrate state-of-the-art results for all three TinyMLperf industry-standard benchmark tasks: visual wake words, audio keyword spotting, and anomaly detection. Models and training scripts can be found at https://github.com/ARM-software/ML-zoo.
Author Information
Colby Banbury (Harvard)
Chuteng Zhou (Arm Research)
Igor Fedorov (Arm Research)
Ramon Matas (arm)
Urmish Thakker (SambaNova Systems)
Dibakar Gope (Arm)
Vijay Janapa Reddi (Harvard University)
Matthew Mattina (Arm ML Research Lab)
Paul Whatmough (Arm ML Research Lab)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Oral: MicroNets: Neural Network Architectures for Deploying TinyML Applications on Commodity Microcontrollers »
Tue. Apr 6th 05:10 -- 05:30 PM Room
More from the Same Authors
-
2022 Poster: MLPerf Mobile Inference Benchmark: An Industry-Standard Open-Source Machine Learning Benchmark for On-Device AI »
Vijay Janapa Reddi · David Kanter · Peter Mattson · Jared Duke · Thai Nguyen · Ramesh Chukka · Ken Shiring · Koan-Sin Tan · Mark Charlebois · William Chou · Mostafa El-Khamy · Jungwook Hong · Tom St John · Cindy Trinh · Michael Buch · Mark Mazumder · Relja Markovic · Thomas Atta · Fatih Cakir · Masoud Charkhabi · Xiaodong Chen · Cheng-Ming Chiang · Dave Dexter · Terry Heo · Guenther Schmuelling · Maryam Shabani · Dylan Zika -
2022 Poster: Collapsible Linear Blocks for Super-Efficient Super Resolution »
Kartikeya Bhardwaj · Milos Milosavljevic · Liam O'Neil · Dibakar Gope · Ramon Matas · Alex Chalfin · Naveen Suda · Lingchuan Meng · Danny Loh -
2023 Workshop: Workshop on Systems for Next-Gen AI Paradigms »
Jason Yik · Brian Anderson · Charlotte Frenkel · Vijay Janapa Reddi -
2023 Workshop: The 3rd On-Device Intelligence Workshop »
Vijay Janapa Reddi · Paul Whatmough · Vikas Chandra · Pete Warden · Brian Plancher · Colby Banbury · Matthew Stewart -
2023 Poster: Edge Impulse: An MLOps Platform for Tiny Machine Learning »
colby banbury · Vijay Janapa Reddi · Alexander Elium · · David Tischler · Daniel Situnayake · Carl Ward · Louis Moreau · Jenny Plunkett · Matthew Kelcey · Mathijs Baaijens · Alessandro Grande · Dmitry Maslov · Arthur Beavis · Jan Jongboom · Jessica Quaye -
2023 Poster: XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse »
Hyoukjun Kwon · Krishnakumar Nair · Jamin Seo · Jason Yik · Debabrata Mohapatra · Dongyuan Zhan · JINOOK SONG · Peter Capak · Peizhao Zhang · Peter Vajda · Colby Banbury · Mark Mazumder · Liangzhen Lai · Ashish Sirasao · Tushar Krishna · Harshit Khaitan · Vikas Chandra · Vijay Janapa Reddi -
2023 : Closing Remarks »
Vijay Janapa Reddi -
2022 Oral: Collapsible Linear Blocks for Super-Efficient Super Resolution »
Kartikeya Bhardwaj · Milos Milosavljevic · Liam O'Neil · Dibakar Gope · Ramon Matas · Alex Chalfin · Alex Chalfin · Naveen Suda · Naveen Suda · Lingchuan Meng · Lingchuan Meng · Danny Loh · Danny Loh -
2022 Oral: MLPerf Mobile Inference Benchmark: An Industry-Standard Open-Source Machine Learning Benchmark for On-Device AI »
Vijay Janapa Reddi · David Kanter · Peter Mattson · Jared Duke · Thai Nguyen · Ramesh Chukka · Ken Shiring · Koan-Sin Tan · Mark Charlebois · William Chou · Mostafa El-Khamy · Jungwook Hong · Tom St John · Cindy Trinh · Michael Buch · Mark Mazumder · Relja Markovic · Thomas Atta · Fatih Cakir · Masoud Charkhabi · Xiaodong Chen · Cheng-Ming Chiang · Dave Dexter · Terry Heo · Guenther Schmuelling · Maryam Shabani · Dylan Zika -
2021 : The Future of ML is Tiny and Bright: Challenges and Opportunities »
Vijay Janapa Reddi -
2021 Workshop: 2nd On-Device Intelligence Workshop »
Paul Whatmough · Vijay Janapa Reddi · Chuteng Zhou · Igor Federov · Matthew Mattina · Pete Warden · Ganesh Venkatesh · Vikas Chandra -
2021 Poster: Doping: A technique for Extreme Compression of LSTM Models using Sparse Structured Additive Matrices »
Urmish Thakker · Paul Whatmough · ZHIGANG LIU · Matthew Mattina · Jesse Beu -
2021 Oral: Doping: A technique for Extreme Compression of LSTM Models using Sparse Structured Additive Matrices »
Urmish Thakker · Paul Whatmough · ZHIGANG LIU · Matthew Mattina · Jesse Beu -
2021 Poster: RL-Scope: Cross-stack Profiling for Deep Reinforcement Learning Workloads »
James Gleeson · Srivatsan Krishnan · Moshe Gabel · Vijay Janapa Reddi · Eyal de Lara · Gennady Pekhimenko -
2021 Oral: RL-Scope: Cross-stack Profiling for Deep Reinforcement Learning Workloads »
James Gleeson · Srivatsan Krishnan · Moshe Gabel · Vijay Janapa Reddi · Eyal de Lara · Gennady Pekhimenko -
2021 Poster: TensorFlow Lite Micro: Embedded Machine Learning for TinyML Systems »
Robert David · Jared Duke · Advait Jain · Vijay Janapa Reddi · Nat Jeffries · Jian Li · Nick Kreeger · Ian Nappier · Meghna Natraj · Tiezhen Wang · Pete Warden · Rocky Rhodes · Rocky Rhodes -
2021 Oral: TensorFlow Lite Micro: Embedded Machine Learning for TinyML Systems »
Robert David · Jared Duke · Advait Jain · Vijay Janapa Reddi · Nat Jeffries · Jian Li · Nick Kreeger · Ian Nappier · Meghna Natraj · Tiezhen Wang · Pete Warden · Rocky Rhodes · Rocky Rhodes -
2020 Oral: MLPerf Training Benchmark »
Peter Mattson · Christine Cheng · Gregory Diamos · Cody Coleman · Paulius Micikevicius · David Patterson · Hanlin Tang · Gu-Yeon Wei · Peter Bailis · Victor Bittorf · David Brooks · Dehao Chen · Debo Dutta · Udit Gupta · Kim Hazelwood · Andy Hock · Xinyuan Huang · Daniel Kang · David Kanter · Naveen Kumar · Jeffery Liao · Deepak Narayanan · Tayo Oguntebi · Gennady Pekhimenko · Lillian Pentecost · Vijay Janapa Reddi · Taylor Robie · Tom St John · Carole-Jean Wu · Lingjie Xu · Cliff Young · Matei Zaharia -
2020 Oral: Searching for Winograd-aware Quantized Networks »
Javier Fernandez-Marques · Paul Whatmough · Andrew Mundy · Matthew Mattina -
2020 Poster: Searching for Winograd-aware Quantized Networks »
Javier Fernandez-Marques · Paul Whatmough · Andrew Mundy · Matthew Mattina -
2020 Poster: MLPerf Training Benchmark »
Peter Mattson · Christine Cheng · Gregory Diamos · Cody Coleman · Paulius Micikevicius · David Patterson · Hanlin Tang · Gu-Yeon Wei · Peter Bailis · Victor Bittorf · David Brooks · Dehao Chen · Debo Dutta · Udit Gupta · Kim Hazelwood · Andy Hock · Xinyuan Huang · Daniel Kang · David Kanter · Naveen Kumar · Jeffery Liao · Deepak Narayanan · Tayo Oguntebi · Gennady Pekhimenko · Lillian Pentecost · Vijay Janapa Reddi · Taylor Robie · Tom St John · Carole-Jean Wu · Lingjie Xu · Cliff Young · Matei Zaharia -
2020 Demonstration: Air Learning: An End To End Learning Gym For Aerial Robots »
Srivatsan Krishnan · Colby Banbury · Bardienus Duisterhof · Aleksandra Faust · Vijay Janapa Reddi