论文信息 - Flattened Convolutional Neural Networks for Feedforward Acceleration - 字舞流文

Flattened Convolutional Neural Networks for Feedforward Acceleration

We present flattened convolutional neural networks that are designed for fast feedforward execution. The redundancy of the parameters, especially weights of the convolutional filters in convolutional neural networks has been extensively studied and different heuristics have been proposed to construct a low rank basis of the filters after training. In this work, we train flattened networks that consist of consecutive sequence of one-dimensional filters across all directions in 3D space to obtain comparable performance as conventional convolutional networks. We tested flattened model on different datasets and found that the flattened layer can effectively substitute for the 3D filters without loss of accuracy. The flattened convolution pipelines provide around two times speed-up during feedforward pass compared to the baseline model due to the significant reduction of learning parameters. Furthermore, the proposed method does not require efforts in manual tuning or post processing once the model is trained.

Eugenio Culurciello | Jonghoon Jin | Aysegul Dundar | E. Culurciello | A. Dundar | Jonghoon Jin

[1] Ivan V. Oseledets,et al. Speeding-up Convolutional Neural Networks Using Fine-tuned CP-Decomposition , 2014, ICLR.

[2] Andrew Zisserman,et al. Speeding up Convolutional Neural Networks with Low Rank Expansions , 2014, BMVC.

[3] John Tran,et al. cuDNN: Efficient Primitives for Deep Learning , 2014, ArXiv.

[4] Yann LeCun,et al. Effiicient BackProp , 1996, Neural Networks: Tricks of the Trade.

[5] Qiang Chen,et al. Network In Network , 2013, ICLR.

[6] Klaus-Robert Müller,et al. Kernel Analysis of Deep Networks , 2011, J. Mach. Learn. Res..

[7] Clément Farabet,et al. Torch7: A Matlab-like Environment for Machine Learning , 2011, NIPS 2011.

[8] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[9] Joan Bruna,et al. Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation , 2014, NIPS.

[10] Eugenio Culurciello,et al. An Analysis of the Connections Between Layers of Deep Neural Networks , 2013, ArXiv.

[11] Nitish Srivastava,et al. Discriminative Transfer Learning with Tree-based Priors , 2013, NIPS.

[12] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Misha Denil,et al. Predicting Parameters in Deep Learning , 2014 .

[14] Vincent Lepetit,et al. Learning Separable Filters , 2013, CVPR.

[15] Berin Martini,et al. An efficient implementation of deep convolutional neural networks on a mobile coprocessor , 2014, 2014 IEEE 57th International Midwest Symposium on Circuits and Systems (MWSCAS).

[16] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[17] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[18] Rajat Raina,et al. Efficient sparse coding algorithms , 2006, NIPS.

[19] Andrew S. Cassidy,et al. A million spiking-neuron integrated circuit with a scalable communication network and interface , 2014, Science.

[20] Ming Yang,et al. Compressing Deep Convolutional Networks using Vector Quantization , 2014, ArXiv.

[21] Xinyun Chen. Under Review as a Conference Paper at Iclr 2017 Delving into Transferable Adversarial Ex- Amples and Black-box Attacks , 2016 .

[22] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[23] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[24] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[25] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[26] Vincent Vanhoucke,et al. Improving the speed of neural networks on CPUs , 2011 .

[27] Klaus-Robert Müller,et al. Neural Networks: Tricks of the Trade, this book is an outgrowth of a 1996 NIPS workshop , 1998, NIPS 1998.