Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices

Edge devices and robots have access to an abundance of raw data that needs to be processed on the edge. Deep neural networks (DNNs) can help these devices understand and learn from this complex data; however, executing DNNs while achieving high performance is a challenge for edge devices. This is because of the high computational demands of DNN execution in real-time. This paper describes and implements a method to enable edge devices to execute DNNs collaboratively. This is possible and useful because in many environments, several on-edge devices are already integrated in their surroundings, but are usually idle and can provide additional computing power to a distributed system. We implement this method on two iRobots, each of which has been equipped with a Raspberry Pi 3. Then, we characterize the execution performance, communication latency, energy consumption, and thermal behavior of our system while it is executing AlexNet.

[1]  Michael S. Ryoo,et al.  Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices , 2018, ArXiv.

[2]  Michael S. Ryoo,et al.  Real-Time Image Recognition Using Collaborative IoT Devices , 2018, ReQuEST@ASPLOS.

[3]  Michael S. Ryoo,et al.  Late Breaking Results: Robustly Executing DNNs in IoT Systems Using Coded Distributed Computing ∗ , 2019, 2019 56th ACM/IEEE Design Automation Conference (DAC).

[4]  Andrew Zisserman,et al.  Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[5]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[6]  Jiwen Lu,et al.  Runtime Neural Pruning , 2017, NIPS.

[7]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[8]  Ramyad Hadidi,et al.  An Edge-Centric Scalable Intelligent Framework To Collaboratively Execute DNN , 2019 .

[9]  Roland Siegwart,et al.  From perception to decision: A data-driven approach to end-to-end motion planning for autonomous ground robots , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[10]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[11]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Jürgen Schmidhuber,et al.  A Machine Learning Approach to Visual Perception of Forest Trails for Mobile Robots , 2016, IEEE Robotics and Automation Letters.

[13]  In Lee,et al.  The Internet of Things (IoT): Applications, investments, and challenges for enterprises , 2015 .

[14]  Yiran Chen,et al.  Learning Structured Sparsity in Deep Neural Networks , 2016, NIPS.

[15]  Sarmad Ullah Khan,et al.  Future Internet: The Internet of Things Architecture, Possible Applications and Key Challenges , 2012, 2012 10th International Conference on Frontiers of Information Technology.

[16]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[17]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[18]  Michael S. Ryoo,et al.  Collaborative Execution of Deep Neural Networks on Internet of Things Devices , 2019, ArXiv.

[19]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[20]  Michael S. Ryoo,et al.  Distributed Perception by Collaborative Robots , 2018, IEEE Robotics and Automation Letters.

[21]  Huimin Lu,et al.  Motor Anomaly Detection for Unmanned Aerial Vehicles Using Reinforcement Learning , 2018, IEEE Internet of Things Journal.

[22]  Lida Xu,et al.  The internet of things: a survey , 2014, Information Systems Frontiers.

[23]  Yoshua Bengio,et al.  Training deep neural networks with low precision multiplications , 2014 .

[24]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[25]  Ashutosh Kumar Singh,et al.  Machine Learning for High-Throughput Stress Phenotyping in Plants. , 2016, Trends in plant science.

[26]  Sudhakar Yalamanchili,et al.  LODESTAR: Creating Locally-Dense CNNs for Efficient Inference on Systolic Arrays* , 2019, 2019 56th ACM/IEEE Design Automation Conference (DAC).

[27]  Францкевич Кирилл Эдуардович,et al.  ИССЛЕДОВАНИЕ КЛАСТЕРНОЙ СИСТЕМЫ НА ОСНОВЕ ОДНОПЛАТНЫХ КОМПЬЮТЕРОВ RASPBERRY PI 3B , 2019 .

[28]  Erdogan Dogdu,et al.  Context-Aware Computing, Learning, and Big Data in Internet of Things: A Survey , 2018, IEEE Internet of Things Journal.

[29]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[30]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[31]  Scott A. Mahlke,et al.  Scalpel: Customizing DNN pruning to the underlying hardware parallelism , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).

[32]  Ming Yang,et al.  Compressing Deep Convolutional Networks using Vector Quantization , 2014, ArXiv.

[33]  Xin Wang,et al.  Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks , 2017, NIPS.

[34]  Vincent Vanhoucke,et al.  Improving the speed of neural networks on CPUs , 2011 .

[35]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.