Widening Access to Applied Machine Learning with TinyML

Broadening access to both computational and educational resources is critical to diffusing machine-learning (ML) innovation. However, today, most ML resources and experts are siloed in a few countries and organizations. In this paper, we describe our pedagogical approach to increasing access to applied ML through a massive open online course (MOOC) on Tiny Machine Learning (TinyML). We suggest that TinyML, ML on resource-constrained embedded devices, is an attractive means to widen access because TinyML both leverages low-cost and globally accessible hardware, and encourages the development of complete, self-contained applications, from data collection to deployment. To this end, a collaboration between academia (Harvard University) and industry (Google) produced a four-part MOOC that provides application-oriented instruction on how to develop solutions using TinyML. The series is openly available on the edX MOOC platform, has no prerequisites beyond basic programming, and is designed for learners from a global variety of backgrounds. It introduces pupils to real-world applications, ML algorithms, data-set engineering, and the ethical considerations of these technologies via hands-on programming and deployment of TinyML applications in both the cloud and their own microcontrollers. To facilitate continued learning, community building, and collaboration beyond the courses, we launched a standalone website, a forum, a chat, and an optional course-project competition. We also released the course materials publicly, hoping they will inspire the next generation of ML practitioners and educators and further broaden access to cutting-edge ML technologies.

[1]  Inioluwa Deborah Raji,et al.  Model Cards for Model Reporting , 2018, FAT.

[2]  Aakanksha Chowdhery,et al.  Visual Wake Words Dataset , 2019, ArXiv.

[3]  Pete Warden,et al.  Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition , 2018, ArXiv.

[4]  Faruk Kazi,et al.  Neural Network Based Early Warning System for an Emerging Blackout in Smart Grid Power Networks , 2014, ISI.

[5]  Kara M. Dawson,et al.  Does visual attention to the instructor in online video affect learning and learner perceptions? An eye-tracking analysis , 2020, Comput. Educ..

[6]  Chih-Ming Chen,et al.  Effects of Different Video Lecture Types on Sustained Attention, Emotion, Cognitive Load, and Learning Performance , 2015, IIAI-AAI.

[7]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[8]  Vijay Vasudevan,et al.  Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[10]  Matthias Söllner,et al.  AI-Based Digital Assistants , 2019, Bus. Inf. Syst. Eng..

[11]  Christoph Fink,et al.  Machine learning for tracking illegal wildlife trade on social media , 2018, Nature Ecology & Evolution.

[12]  Sotiris Karabetsos,et al.  A Review of Machine Learning and IoT in Smart Transportation , 2019, Future Internet.

[13]  Peter Mayer,et al.  An investigation of phishing awareness and education over time: When and how to best remind users , 2020, SOUPS @ USENIX Security Symposium.

[14]  Paul Belleflamme,et al.  An Economic Appraisal of MOOC Platforms: Business Models and Impacts on Higher Education , 2014, CESifo Economic Studies.

[15]  Yundong Zhang,et al.  Hello Edge: Keyword Spotting on Microcontrollers , 2017, ArXiv.

[16]  Dan Jurafsky,et al.  Racial disparities in automated speech recognition , 2020, Proceedings of the National Academy of Sciences.

[17]  Logan Fiorella,et al.  Five ways to increase the effectiveness of instructional video , 2020 .

[18]  Antonio Liotta,et al.  Exploiting machine learning for intelligent room lighting applications , 2012, 2012 6th IEEE International Conference Intelligent Systems.

[19]  Timnit Gebru,et al.  Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.

[20]  Jianxiong Xiao,et al.  DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21]  Alexander Gruenstein,et al.  A Cascade Architecture for Keyword Spotting on Mobile Devices , 2017, ArXiv.

[22]  Peter D. Welch,et al.  The Fast Fourier Transform and Its Applications , 1969 .

[23]  J J Kabara,et al.  Spiral curriculum. , 1972, Journal of medical education.

[24]  Dongyoung Kim,et al.  Temporal Convolution for Real-time Keyword Spotting on Mobile Devices , 2019, INTERSPEECH.

[25]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[26]  Pierre Baldi,et al.  Autoencoders, Unsupervised Learning, and Deep Architectures , 2011, ICML Unsupervised and Transfer Learning.

[27]  Rebecca Vivian,et al.  Addressing the challenges of a new digital technologies curriculum: MOOCs as a scalable solution for teacher professional development , 2014 .

[28]  M. Lakkala,et al.  The impact of project-based learning curriculum on first-year retention, study experiences, and knowledge work competence , 2020, Research Papers in Education.

[29]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[30]  Maximilian Lam,et al.  Benchmarking TinyML Systems: Challenges and Direction , 2020, ArXiv.

[31]  Philip J. Guo,et al.  How video production affects student engagement: an empirical study of MOOC videos , 2014, L@S.

[32]  Hadeel S. Alenezi,et al.  Utilizing crowdsourcing and machine learning in education: Literature review , 2020, Education and Information Technologies.

[33]  Yoram Neumann,et al.  The Robust Learning Model With A Spiral Curriculum: Implications For For TThe Educational EffectivenessOfOf Online Master Degree Programs , 2017 .

[34]  C. C. Singh MOOCs for Teacher Professional Development : Reflections , and Suggested Actions , 2018 .

[35]  B. Schirmer,et al.  Online Instruction in Higher Education: Promising, Research-based, and Evidence-based Practices , 2020, Journal of Education and e-Learning Research.

[36]  Xiangjun Zeng,et al.  Gearbox oil temperature anomaly detection for wind turbine based on sparse Bayesian probability estimation , 2020 .

[37]  Yossi Matias,et al.  Personalizing ASR for Dysarthric and Accented Speech with Limited Data , 2019, INTERSPEECH.

[38]  Lalu Banoth,et al.  A Survey of Data Mining and Machine Learning Methods for Cyber Security Intrusion Detection , 2017 .

[39]  Alasdair McDonald,et al.  Combining SCADA and vibration data into a single anomaly detection model to predict wind turbine component failure , 2020 .

[40]  Vijaya B. Kolachalama,et al.  Machine learning and medical education , 2018, npj Digital Medicine.

[41]  Tom M. Mitchell,et al.  Experience with a learning personal assistant , 1994, CACM.

[42]  Wenyuan Xu,et al.  DolphinAttack: Inaudible Voice Commands , 2017, CCS.

[43]  Kyle Taylor,et al.  Smartphone ownership is growing rapidly around the world, but not always equally , 2019 .

[44]  M. Sinclair,et al.  Project-based learning. , 1998, NT learning curve.

[45]  Alberto Rodriguez,et al.  TossingBot: Learning to Throw Arbitrary Objects With Residual Physics , 2019, IEEE Transactions on Robotics.

[46]  A. M. White The Process of Education , 1994 .

[47]  Henriette Tolstrup Holmegaard,et al.  Motivational patterns in STEM education: a self-determination perspective on first year courses , 2019 .

[48]  Ankur Teredesai,et al.  Interpretable Machine Learning in Healthcare , 2018, 2018 IEEE International Conference on Healthcare Informatics (ICHI).

[49]  P. Jamieson Arduino for Teaching Embedded Systems . Are Computer Scientists and Engineering Educators Missing the Boat ? , 2011 .

[50]  Roberto Morabito,et al.  A TinyMLaaS Ecosystem for Machine Learning in IoT: Overview and Research Challenges , 2021, 2021 International Symposium on VLSI Design, Automation and Test (VLSI-DAT).

[51]  Dazhi Yang Instructional strategies and course design for teaching statistics online: perspectives from online students , 2017, International journal of STEM education.

[52]  Joseph A. Paradiso,et al.  Deep Learning for Wildlife Conservation and Restoration Efforts , 2019 .

[53]  V. Reddi,et al.  TensorFlow Lite Micro: Embedded Machine Learning on TinyML Systems , 2020, MLSys.

[54]  Martin,et al.  The Designer's Guide to the Cortex-M Processor Family: A Tutorial Approach , 2013 .

[55]  Y. Kawahara,et al.  Telemetry-mining: a machine learning approach to anomaly detection and fault diagnosis for space systems , 2006, 2nd IEEE International Conference on Space Mission Challenges for Information Technology (SMC-IT'06).

[56]  Privacy-Preserving Inference on the Edge: Mitigating a New Threat Model , 2020 .

[57]  Benjamin R. Cowan,et al.  See What I’m Saying? Comparing Intelligent Personal Assistant Use for Native and Non-Native Language Speakers , 2020, MobileHCI.

[58]  Claire Wladis,et al.  An investigation of course-level factors as predictors of online STEM course outcomes , 2014, Comput. Educ..

[59]  Alessandro D’Ausilio,et al.  Using Arduino microcontroller boards to measure response latencies , 2013, Behavior research methods.

[60]  Richard Mayer,et al.  Multimedia Learning , 2001, Visible Learning Guide to Student Achievement.

[61]  Ioannis Iossifidis,et al.  Autonomous driving: A comparison of machine learning techniques by means of the prediction of lane change behavior , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[62]  Hesham A. Rakha,et al.  Applying Machine Learning Techniques to Transportation Mode Recognition Using Mobile Phone Sensor Data , 2015, IEEE Transactions on Intelligent Transportation Systems.

[63]  Erhan Guven,et al.  A Survey of Data Mining and Machine Learning Methods for Cyber Security Intrusion Detection , 2016, IEEE Communications Surveys & Tutorials.

[64]  V. Tiwari MFCC and its applications in speaker recognition , 2010 .

[65]  Yaxin Bi,et al.  KNN Model-Based Approach in Classification , 2003, OTM.

[66]  H. Frank Cervone,et al.  Applied digital library project management: Using Pugh matrix analysis in complex decision-making situations , 2009, OCLC Syst. Serv..

[67]  Kathleen C. Fraser,et al.  Linguistic Features Identify Alzheimer's Disease in Narrative Speech. , 2015, Journal of Alzheimer's disease : JAD.

[68]  Martin Wattenberg,et al.  TensorFlow.js: Machine Learning for the Web and Beyond , 2019, MLSys.

[69]  Grant Potter,et al.  Machine Learning for Kids , 2018 .

[70]  Peter Richtárik,et al.  Federated Learning: Strategies for Improving Communication Efficiency , 2016, ArXiv.