Partitioning of CNN Models for Execution on Fog Devices

Fog Computing has in recent times captured the imagination of industrial and research organizations working on various aspects of connected livelihood and governance of smart cities. Improvements in deep neural networks imply extensive use of such models for analytics and inferencing on large volume of data, including sensor observations, images, speech. A growing need for such inferencing to be run on devices closer to the data sources, i.e. devices which reside at the edge of the network, popularly known as fog devices exists, in order to reduce the upstream network traffic. However, being computationally constrained in nature, executing complex deep inferencing models on such devices has been proved difficult. This has led to several new approaches to partition/distribute the computation and/or data over multiple fog devices. In this paper we propose a novel depth-wise input partitioning scheme for CNN models and experimentally prove that it achieves better performance compared to row/column or grid based schemes.

[1]  Yogesh L. Simmhan,et al.  ECHO: An Adaptive Orchestration Platform for Hybrid Dataflows across Cloud and Edge , 2017, ICSOC.

[2]  Hua-Jun Hong,et al.  Distributed analytics in fog computing platforms using tensorflow and kubernetes , 2017, 2017 19th Asia-Pacific Network Operations and Management Symposium (APNOMS).

[3]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[4]  Walid Saad,et al.  Deep Learning for Reliable Mobile Edge Analytics in Intelligent Transportation Systems: An Overview , 2017, IEEE Vehicular Technology Magazine.

[5]  Burak Kantarci,et al.  Big Sensed Data Meets Deep Learning for Smarter Health Care in Smart Cities , 2017, J. Sens. Actuator Networks.

[6]  Massoud Pedram,et al.  FFT-based deep learning deployment in embedded systems , 2017, 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[7]  Shahzad Khan,et al.  Artificial intelligence framework for smart city microgrids: State of the art, challenges, and opportunities , 2018, 2018 Third International Conference on Fog and Mobile Edge Computing (FMEC).

[8]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Mahadev Satyanarayanan,et al.  The Emergence of Edge Computing , 2017, Computer.

[10]  Björn W. Schuller,et al.  Convolutional RNN: An enhanced model for extracting features from sequential data , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[11]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Rashid Mehmood,et al.  UbeHealth: A Personalized Ubiquitous Cloud and Edge-Enabled Networked Healthcare System for Smart Cities , 2018, IEEE Access.

[13]  Samy Bengio,et al.  The Handbook of Brain Theory and Neural Networks , 2002 .

[14]  Ansuman Banerjee,et al.  A Framework for Speculative Scheduling and Device Selection for Task Execution on a Mobile Cloud , 2014, ARMS-CC@PODC.

[15]  Ranjan Dasgupta,et al.  Fast Boot User Experience Using Adaptive Storage Partitioning , 2009, 2009 Computation World: Future Computing, Service Computation, Cognitive, Adaptive, Content, Patterns.

[16]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[17]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[18]  Nicholas D. Lane,et al.  DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices , 2016, 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[19]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[20]  Yiran Chen,et al.  MoDNN: Local distributed mobile computing system for Deep Neural Network , 2017, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017.

[21]  Feng Qian,et al.  Enabling Cooperative Inference of Deep Learning on Wearables and Smartphones , 2017, ArXiv.

[22]  IqbalMuddesar,et al.  Multi-access edge computing , 2017 .

[23]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[24]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[25]  Jerry Zeyu Gao,et al.  An edge-based smart mobile service system for illegal dumping detection and monitoring in San Jose , 2017, 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[26]  Mounir Hamdi,et al.  Parallel Image Processing Applications on a Network of Workstations , 1995, Parallel Comput..

[27]  Xu Chen,et al.  Edge Intelligence: On-Demand Deep Learning Model Co-Inference with Device-Edge Synergy , 2018, MECOMM@SIGCOMM.

[28]  Maxime Pelcat,et al.  Accelerating CNN inference on FPGAs: A Survey , 2018, ArXiv.

[29]  Michael I. Jordan,et al.  The Handbook of Brain Theory and Neural Networks , 2002 .