Feature Extraction based on DCT for Handwritten Digit Recognition

Feature extraction is a crucial and challenging step in many pattern recognition problems and especially in handwritten digit recognition applications. However, the extraction of the most informative features with highly discriminatory ability to improve the classification accuracy and reduce complexity remains one of the most important problems for this task. This work investigates the effectiveness of four feature extraction approaches based on Discrete Cosine Transform (DCT) to capture discriminative features for handwritten Digits recognition. These approaches are: DCT upper left corner (ULC) coefficients, DCT zigzag coefficients, block based DCT ULC coefficients and block based DCT zigzag coefficients. The coefficients of each DCT variant are used as input data for Support Vector Machine (SVM) classifier to evaluate their performances. The objective of this work is to identify the optimal feature extraction approach that speed up the learning algorithms while maximizing the classification accuracy. We used the well known MNIST database in two variants: raw data and preprocessed data that dismisses the non-informative region of the observations in the dataset. The results have been analyzed and compared in terms of classification accuracy and reduction rate and the findings have demonstrated that the block based DCT zigzag feature extraction yields a superior performance than its counterparts.

[1]  Sun Zhen,et al.  Recognition Method for Handwritten Digits Based on Improved Chain Code Histogram Feature , 2013, ICMT 2013.

[2]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[3]  Cheng-Lin Liu,et al.  Gabor feature extraction for character recognition: comparison with gradient feature , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[4]  Hiroshi Sako,et al.  Handwritten digit recognition: investigation of normalization and feature extraction techniques , 2004, Pattern Recognit..

[5]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[6]  Patrick J. Grother,et al.  NIST Special Database 19 Handprinted Forms and Characters Database , 1995 .

[7]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[8]  Dexin Zhang,et al.  DCT-Based Iris Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  E.D. Di Claudio,et al.  Car plate recognition by neural networks and image processing , 1998, ISCAS '98. Proceedings of the 1998 IEEE International Symposium on Circuits and Systems (Cat. No.98CH36187).

[10]  Phalguni Gupta,et al.  An Efficient Palmprint Based Recognition System Using 1D-DCT Features , 2012, ICIC.

[11]  Jinchang Ren,et al.  Word-based handwritten Arabic scripts recognition using DCT features and neural network classifier , 2008, 2008 5th International Multi-Conference on Systems, Signals and Devices.

[12]  Min-Jen Tsai,et al.  DCT and DWT-based image watermarking by using subsampling , 2004, 24th International Conference on Distributed Computing Systems Workshops, 2004. Proceedings..

[13]  Ali Aghagolzadeh,et al.  Feature extraction using discrete cosine transform and discrimination power analysis with a face recognition technology , 2010, Pattern Recognit..

[14]  Esam Abdel-Raheem,et al.  Online Arabic/Persian character recognition using neural network classifier and DCT features , 2011, 2011 IEEE 54th International Midwest Symposium on Circuits and Systems (MWSCAS).

[15]  Cheng-Lin Liu,et al.  Handwritten digit recognition: benchmarking of state-of-the-art techniques , 2003, Pattern Recognit..

[16]  Jeet Kumar,et al.  An Efficient Pose Invariant Face Recognition System , 2011, SocProS.

[17]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[18]  Yun Lei,et al.  Effective use of DCTS for contextualizing features for speaker recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Harris Drucker,et al.  Comparison of learning algorithms for handwritten digit recognition , 1995 .

[20]  Vladimir Cherkassky,et al.  The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.

[21]  Bao-Gang Hu,et al.  Robust feature extraction via information theoretic learning , 2009, ICML '09.

[22]  H.M.R. Abou-zeid,et al.  Computer recognition of unconstrained handwritten numerals , 2003, 2003 46th Midwest Symposium on Circuits and Systems.

[23]  K. Fliegel,et al.  Image Quality Evaluation in Security Imaging Systems , 2007, IEEE Aerospace and Electronic Systems Magazine.

[24]  L. Deng,et al.  The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web] , 2012, IEEE Signal Processing Magazine.

[25]  Uday B. Desai,et al.  Face recognition using a DCT-HMM approach , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[26]  Chong-Wah Ngo,et al.  Video text detection and segmentation for optical character recognition , 2005, Multimedia Systems.

[27]  Zabih Ghassemlooy,et al.  Handwritten Arabic Character Recognition: Which Feature Extraction Method? , 2011 .

[28]  Belaid Bouikhalene,et al.  Extraction Method of Handwritten Digit Recognition Tested on the MNIST Database , 2013 .