Recognizing Handwritten Characters with Local Descriptors and Bags of Visual Words

In this paper we propose the use of several feature extraction methods, which have been shown before to perform well for object recognition, for recognizing handwritten characters. These methods are the histogram of oriented gradients (HOG), a bag of visual words using pixel intensity information (BOW), and a bag of visual words using extracted HOG features (HOG-BOW). These feature extraction algorithms are compared to other well-known techniques: principal component analysis, the discrete cosine transform, and the direct use of pixel intensities. The extracted features are given to three different types of support vector machines for classification, namely a linear SVM, an SVM with the RBF kernel, and a linear SVM using L2-regularization. We have evaluated the six different feature descriptors and three SVM classifiers on three different handwritten character datasets: Bangla, Odia and MNIST. The results show that the HOG-BOW, BOW and HOG method significantly outperform the other methods. The HOG-BOW method performs best with the L2-regularized SVM and obtains very high recognition accuracies on all three datasets.

[1]  Anandarup Roy,et al.  SVM-based hierarchical architectures for handwritten Bangla character recognition , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[2]  Remco C. Veltkamp,et al.  Ensembles of novel visual keywords descriptors for image categorization , 2010, 2010 11th International Conference on Control Automation Robotics & Vision.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Sriganesh Madhvanath,et al.  Principal component analysis for online handwritten character recognition , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[6]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[7]  Yann LeCun,et al.  The mnist database of handwritten digits , 2005 .

[8]  Tusar Kanti Mishra,et al.  A comparative analysis of image transformations for handwritten Odia numeral recognition , 2013, 2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[9]  Lambert Schomaker,et al.  Machine learning for multi-view eye-pair detection , 2014, Eng. Appl. Artif. Intell..

[10]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[11]  Luca Maria Gambardella,et al.  Better Digit Recognition with a Committee of Simple Neural Nets , 2011, 2011 International Conference on Document Analysis and Recognition.

[12]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[13]  Honglak Lee,et al.  An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[14]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[15]  David S. Doermann,et al.  Unsupervised feature learning framework for no-reference image quality assessment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[17]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[18]  P. Mahadevan,et al.  An overview , 2007, Journal of Biosciences.

[19]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[20]  Zabih Ghassemlooy,et al.  Handwritten Arabic Character Recognition: Which Feature Extraction Method? , 2011 .

[21]  Kazuhiko Takahashi,et al.  Remarks on Computational Facial Expression Recognition from HOG Features Using Quaternion Multi-layer Neural Network , 2014, EANN.

[22]  Luis Salgado,et al.  Image-based on-road vehicle detection using cost-effective Histograms of Oriented Gradients , 2013, J. Vis. Commun. Image Represent..

[23]  Andrew Y. Ng,et al.  Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning , 2011, 2011 International Conference on Document Analysis and Recognition.

[24]  Geoffrey E. Hinton,et al.  Autoencoders, Minimum Description Length and Helmholtz Free Energy , 1993, NIPS.

[25]  Hong Yan,et al.  Rapid feature extraction for Bangla handwritten digit recognition , 2011, 2011 International Conference on Machine Learning and Cybernetics.

[26]  Lambert Schomaker,et al.  A Comparison of Feature and Pixel-Based Methods for Recognizing Handwritten Bangla Digits , 2013, 2013 12th International Conference on Document Analysis and Recognition.