论文信息 - Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network (CNN) using a minimal model (Minimal CNN). The proposed minimal CNN is presented using a layering approach. This approach provides a concise and accessible understanding of the main mathematical operations of a CNN. Hence, it benefits beginners and non-mathematical prolific learners to acquire foundational knowledge that informs a principled understanding of CNNs without having an intimidating experience. A handwritten digit recognition using MNIST handwritten digit dataset is used to experiment the performance of the proposed minimal CNN with other neural networks.

Matthew Y. W. Teow | M. Teow

[1] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[2] P. Lennie. Receptive fields , 2003, Current Biology.

[3] Dong Yu,et al. Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..

[4] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[6] D. Hubel,et al. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[7] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[8] Yoshua Bengio. Machines Who Learn. , 2016, Scientific American.

[9] Takayuki Ito,et al. Neocognitron: A neural network model for a mechanism of visual pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[10] Yann LeCun,et al. Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[11] W. Walter. A Machine that Learns , 1951 .

[12] Jean Ponce,et al. A Theoretical Analysis of Feature Pooling in Visual Recognition , 2010, ICML.

[13] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[15] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[16] C. Martin. 2015 , 2015, Les 25 ans de l’OMC: Une rétrospective en photos.