A discriminative cascade CNN model for offline handwritten digit recognition

This paper presents a high-performance two-stage cascade CNN model. The main idea behind the cascade CNN model is complementary classification objectives between Stage I and Stage II. Discriminative learning is introduced to train Stage II by feeding back poorly recognized training samples. Experiments have been conducted on the competitive MNIST handwritten digit database. The cascade model achieved the best state-of-the-art performance with an error rate of 0.18%.

[1]  Ching Y. Suen,et al.  A trainable feature extractor for handwritten digit recognition , 2007, Pattern Recognit..

[2]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[3]  Xin Li,et al.  An MQDF-CNN Hybrid Model for Offline Handwritten Chinese Character Recognition , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[4]  Patrice Y. Simard,et al.  Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[5]  Ching Y. Suen,et al.  A novel hybrid CNN-SVM classifier for recognizing handwritten digits , 2012, Pattern Recognit..

[6]  Sargur N. Srihari,et al.  Offline Chinese handwriting recognition: an assessment of current technology , 2007, Frontiers of Computer Science in China.

[7]  Ka-Chung Leung,et al.  Recognition of Handwritten Chinese Characters by Combining Regularization, Fisher's Discriminant and Distorted Sample Generation , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[8]  Yichuan Tang,et al.  Deep Learning using Linear Support Vector Machines , 2013, 1306.0239.

[9]  Luca Maria Gambardella,et al.  Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition , 2010, ArXiv.

[10]  Bernhard Schölkopf,et al.  Training Invariant Support Vector Machines , 2002, Machine Learning.

[11]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[12]  Luca Maria Gambardella,et al.  Deep, Big, Simple Neural Nets for Handwritten Digit Recognition , 2010, Neural Computation.

[13]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[14]  Zhaolei Zhang,et al.  A Deep Non-linear Feature Mapping for Large-Margin kNN Classification , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[15]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Yann LeCun,et al.  Regularization of Neural Networks using DropConnect , 2013, ICML.

[17]  Jürgen Schmidhuber,et al.  A committee of neural networks for traffic sign classification , 2011, The 2011 International Joint Conference on Neural Networks.

[18]  Luca Maria Gambardella,et al.  Convolutional Neural Network Committees for Handwritten Character Classification , 2011, 2011 International Conference on Document Analysis and Recognition.

[19]  Luca Maria Gambardella,et al.  Convolutional Neural Support Vector Machines: Hybrid Visual Pattern Classifiers for Multi-robot Systems , 2012, 2012 11th International Conference on Machine Learning and Applications.