A Locally Distributed Mobile Computing Framework for DNN based Android Applications

In recent years, with the development of deep neural network (DNN), more and more applications (e.g., image classification, target recognition and audio processing) are supported by it. However, the disadvantage of its own large model makes it difficult to apply on resource-constrained devices such as mobile devices. In order to solve this problem, the existing research and technology mainly focus on the DNN model compression and the segmentation migration of the model. The former is generally at the expense of reducing accuracy, and the segmentation of the model has no unified migration tool for the DNN model of different applications. In this work, we propose a universal neural network layer segmentation tool, which enables the trained DNN model to be migrated, and migrates the segmentation layer to the nodes in the current network in accordance with the dynamic optimal allocation algorithm proposed in this paper. The experimental results show that the tool can adapt to various neural networks with different structures and perform optimal allocation of layers through algorithm. When the number of working nodes increases from 1 to 5, this method can speed up DNN 2-2.5 times, and shows a good acceleration effect.

[1]  Osonde Osoba,et al.  Noise-enhanced convolutional neural networks , 2016, Neural Networks.

[2]  Trevor N. Mudge,et al.  Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge , 2017, ASPLOS.

[3]  Yixin Chen,et al.  Compressing Neural Networks with the Hashing Trick , 2015, ICML.

[4]  Quan Chen,et al.  DjiNN and Tonic: DNN as a service and its implications for future warehouse scale computers , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).

[5]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[6]  Ronald G. Dreslinski,et al.  A hybrid approach to offloading mobile image classification , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Alec Wolman,et al.  MCDNN: An Approximation-Based Execution Framework for Deep Stream Processing Under Resource Constraints , 2016, MobiSys.

[9]  Dawei Li,et al.  DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices , 2017, AAAI.

[10]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[11]  Alexander Gruenstein,et al.  Accurate and compact large vocabulary speech recognition on mobile devices , 2013, INTERSPEECH.

[12]  Alexander J. Smola,et al.  Scaling Distributed Machine Learning with the Parameter Server , 2014, OSDI.

[13]  Yiran Chen,et al.  MoDNN: Local distributed mobile computing system for Deep Neural Network , 2017, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017.

[14]  Stefano Tubaro,et al.  Deep Convolutional Neural Networks for pedestrian detection , 2015, Signal Process. Image Commun..

[15]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.