A Review on Feature Extraction and Feature Selection for Handwritten Character Recognition

The development of handwriting character recognition (HCR) is an interesting area in pattern recognition. HCR system consists of a number of stages which are preprocessing, feature extraction, classification and followed by the actual recognition. It is generally agreed that one of the main factors influencing performance in HCR is the selection of an appropriate set of features for representing input samples. This paper provides a review of these advances. In a HCR, the set of features plays as main issues, as procedure in choosing the relevant feature that yields minimum classification error. To overcome these issues and maximize classification performance, many techniques have been proposed for reducing the dimensionality of the feature space in which data have to be processed. These techniques, generally denoted as feature reduction, may be divided in two main categories, called feature extraction and feature selection. A large number of research papers and reports have already been published on this topic. In this paper we provide an overview of some of the methods and approach of feature extraction and selection. Throughout this paper, we apply the investigation and analyzation of feature extraction and selection approaches in order to obtain the current trend. Throughout this paper also, the review of metaheuristic harmony search algorithm (HSA) has provide.

[1]  Mohammad Saniee Abadeh,et al.  Image steganalysis using a bee colony based feature selection algorithm , 2014, Eng. Appl. Artif. Intell..

[2]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[3]  Habibollah Haron,et al.  Recognition of Isolated Handwritten Latin Characters using One Continuous Route of Freeman Chain Code Representation and Feedforward Neural Network Classifier , 2010 .

[4]  Atul Sajjanhar,et al.  Polar Transformation System for Offline Handwritten Character Recognition , 2011, SNPD.

[5]  Byung Ro Moon,et al.  Hybrid Genetic Algorithms for Feature Selection , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Benton H. Calhoun,et al.  Body Area Sensor Networks: Challenges and Opportunities , 2009, Computer.

[7]  Haradhan Chel,et al.  Scaled Conjugate Gradient Algorithm in Neural Network Based Approach for Handwritten Text Recognition , 2011, CSE 2011.

[8]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[9]  Ramón M. Rodríguez-Dagnino,et al.  Efficiency of chain codes to represent binary objects , 2007, Pattern Recognit..

[10]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[11]  Lei Li,et al.  Handwritten character recognition via direction string and nearest neighbor matching , 2012 .

[12]  Santanu Chaudhury,et al.  Devnagari numeral recognition by combining decision of multiple connectionist classifiers , 2002 .

[13]  Romesh Ranawana,et al.  Use of fuzzy feature descriptions to recognize handwritten alphanumeric characters , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[14]  M. Nasipuri,et al.  Region selection in handwritten character recognition using Artificial Bee Colony Optimization , 2012, 2012 Third International Conference on Emerging Applications of Information Technology.

[15]  Jianmin Jiang,et al.  Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking , 2011, Pattern Recognit. Lett..

[16]  Binu P. Chacko,et al.  Handwritten character recognition using wavelet energy and extreme learning machine , 2012, Int. J. Mach. Learn. Cybern..

[17]  Yang Yang,et al.  English Character Recognition Based on Feature Combination , 2011 .

[18]  Akira Suzuki,et al.  Feature Selection for Character Recognition Using Genetic Algorithm , 2009, 2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC).

[19]  D. S. Guru,et al.  Feature selection and indexing of online signatures , 2012, 2012 12th International Conference on Hybrid Intelligent Systems (HIS).

[20]  Siu Cheung Hui,et al.  Computational methods for Traditional Chinese Medicine: A survey , 2007, Comput. Methods Programs Biomed..

[21]  H. Isahara,et al.  Language, Script, and Encoding Identification with String Kernel Classifiers , 2006 .

[22]  Fatos T. Yarman-Vural,et al.  A heuristic algorithm for optical character recognition of Arabic script , 1997, Signal Process..

[23]  Jean Paul Frédéric Serra Morphological filtering: An overview , 1994, Signal Process..

[24]  P.S. Deshpande,et al.  Recognition of hand written devnagari characters with percentage component regular expression matching and classification tree , 2007, TENCON 2007 - 2007 IEEE Region 10 Conference.

[25]  Claudio De Stefano,et al.  A GA-based feature selection approach with an application to handwritten character recognition , 2014, Pattern Recognit. Lett..

[26]  Nafiz Arica,et al.  An overview of character recognition focused on off-line handwriting , 2001, IEEE Trans. Syst. Man Cybern. Syst..

[27]  Julius T. Tou,et al.  Recognition of Handwritten Characters by Topological Feature Extraction and Multilevel Categorization , 1972, IEEE Transactions on Computers.

[28]  M. N. Ayyaz,et al.  Handwritten Character Recognition Using Multiclass SVM Classification with Hybrid Feature Extraction , 2016 .

[29]  Kuo-Chin Fan,et al.  Document image preprocessing based on optimal Boolean filters , 2000, Signal Process..

[30]  Prachi Mukherji,et al.  Shape Feature and Fuzzy Logic Based Offline Devnagari Handwritten Optical Character Recognition , 2010 .

[31]  Yves Lecourtier,et al.  Combining structural and statistical features for the recognition of handwritten characters , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[32]  D. Andina,et al.  Feature selection using Sequential Forward Selection and classification applying Artificial Metaplasticity Neural Network , 2010, IECON 2010 - 36th Annual Conference on IEEE Industrial Electronics Society.

[33]  Tetsushi Wakabayashi,et al.  Comparative Study of Devnagari Handwritten Character Recognition Using Different Feature and Classifiers , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[34]  Borut Zalik,et al.  An efficient chain code with Huffman coding , 2005, Pattern Recognit..

[35]  Roberto Brunelli,et al.  MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 2001 .

[36]  Nadir Farah,et al.  Classifiers Combination for Arabic Words Recognition: Application to Handwritten Algerian City Names , 2012, ICISP.

[37]  M. Sarfraz Computer-Aided Intelligent Recognition Techniques and Applications , 2005 .

[38]  Mahantapas Kundu,et al.  A genetic algorithm based region sampling for selection of local features in handwritten digit recognition application , 2012, Appl. Soft Comput..

[39]  Christopher Kermorvant,et al.  Features for HMM-Based Arabic Handwritten Word Recognition Systems , 2012 .

[40]  Geoff Dougherty Feature Extraction and Selection , 2013 .

[41]  Bidyut Baran Chaudhuri,et al.  Offline recognition of handwritten Bangla characters: an efficient two-stage approach , 2012, Pattern Analysis and Applications.

[42]  C. Y. Suen,et al.  Optimal local weighted averaging methods in contour smoothing , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Paul C. K. Kwok,et al.  A thinning algorithm by contour generation , 1988, CACM.

[44]  R. Ravindra Kumar,et al.  Malayalam Offline Handwritten Recognition Using Probabilistic Simplified Fuzzy ARTMAP , 2012, ISI.

[45]  K. P. Primekumar,et al.  On-line Malayalam handwritten character recognition using HMM and SVM , 2013, 2013 International Conference on Signal Processing , Image Processing & Pattern Recognition.

[46]  G. S. Reddy,et al.  Combined online and offline assamese handwritten numeral recognizer , 2012, 2012 National Conference on Communications (NCC).

[47]  Yan Solihin,et al.  Integral Ratio: A New Class of Global Thresholding Techniques for Handwriting Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[48]  Zhong-Zhi Shi,et al.  Research of pattern feature extraction and selection , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[49]  János Kormos,et al.  Recognition of chain-coded patches with statistical methods , 2003 .

[50]  Gebrail Bekdas,et al.  Harmony Search Algorithm Approach for Optimum Design of Post-Tensioned Axially Symmetric Cylindrical Reinforced Concrete Walls , 2015, J. Optim. Theory Appl..

[51]  Josef Kittler,et al.  Fast branch & bound algorithms for optimal feature selection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  Mineichi Kudo,et al.  Comparison of algorithms that select features for pattern classifiers , 2000, Pattern Recognit..

[53]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[54]  Utpal Roy,et al.  Discriminative HMM training with GA for handwritten word recognition , 2008, 2008 19th International Conference on Pattern Recognition.

[55]  Gour C. Karmakar,et al.  Bezier Curve-Based Character Descriptor Considering Shape Information , 2007, 6th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2007).

[56]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[57]  Azriel Rosenfeld,et al.  The Chain Pyramid: Hierarchical Contour Processing , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  Subhadip Basu,et al.  An Axiomatic Fuzzy Set Theory Based Feature Selection Methodology for Handwritten Numeral Recognition , 2014 .

[59]  Herbert Freeman,et al.  Computer Processing of Line-Drawing Images , 1974, CSUR.

[60]  Luigi P. Cordella,et al.  Combining Single Class Features for Improving Performance of a Two Stage Classifier , 2010, 2010 20th International Conference on Pattern Recognition.

[61]  Jung-Hsien Chiang,et al.  Neural and Fuzzy Methods in Handwriting Recognition , 1997, Computer.

[62]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[63]  Bernd Jähne,et al.  Digital Image Processing: Concepts, Algorithms, and Scientific Applications , 1991 .

[64]  Habibollah Haron,et al.  The Heuristic extraction algorithms for freeman chain code of handwritten character , 2011 .

[65]  Sabri A. Mahmoud,et al.  Printed Arabic Text Recognition , 2012 .

[66]  Mahantapas Kundu,et al.  Recognition of Non-Compound Handwritten Devnagari Characters using a Combination of MLP and Minimum Edit Distance , 2010, ArXiv.

[67]  Habibollah Haron,et al.  Metaheuristics Methods (GA and ACO) for Minimizing the Length of Freeman Chain Code from Handwritten Isolated Characters , 2010 .

[68]  Dharam Veer Sharma,et al.  Comparison of Feature Extraction Methods for Recognition of Isolated Handwritten Characters in Gurmukhi Script , 2011, ICIS 2011.

[69]  Xiaobo Li,et al.  Boundary detection using mathematical morphology , 1995, Pattern Recognit. Lett..

[70]  David Austin,et al.  Visual Object Recognition using Template Matching , 2004 .

[71]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[72]  P. Vanaja Ranjan,et al.  Zone based Feature Extraction Algorithm for Handwritten Numeral Recognition of Kannada Script , 2009, 2009 IEEE International Advance Computing Conference.

[73]  Ganapatsingh G. Rajput,et al.  Handwritten Kannada Vowel Character Recognition Using Crack Codes and Fourier Descriptors , 2011, MIWAI.

[74]  Anil K. Jain,et al.  Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[75]  Carlos Cabrelli,et al.  Automatic Representation of Binary Images , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[76]  Leonid I. Perlovsky,et al.  Conundrum of Combinatorial Complexity , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[77]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[78]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[79]  Mallikarjun Hangarge,et al.  Global and Local Features Based Handwritten Text Words and Numerals Script Identification , 2007, International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007).

[80]  Milan Sonka,et al.  Image processing analysis and machine vision [2nd ed.] , 1999 .

[81]  Mahantapas Kundu,et al.  Combining Multiple Feature Extraction Techniques for Handwritten Devnagari Character Recognition , 2008, 2008 IEEE Region 10 and the Third international Conference on Industrial and Information Systems.

[82]  Jia-Guu Leu,et al.  Edge sharpening through ramp width reduction , 2000, Image Vis. Comput..

[83]  S. Impedovo,et al.  Optical Character Recognition - a Survey , 1991, Int. J. Pattern Recognit. Artif. Intell..

[84]  Habibollah Haron,et al.  The evolution and trend of chain code scheme , 2008 .

[85]  William E. Higgins,et al.  Shape Representation: Comparison between the Morphological Skeleton and Morphological Shape Decomposition , 1994, ICIP.

[86]  Fatime Mahamat Fadoul Development of mapping and visualizing algorithm of vertex chain code from thinned binary image , 2008 .

[87]  Chenyang Lu,et al.  Reliable clinical monitoring using wireless sensor networks: experiences in a step-down hospital unit , 2010, SenSys '10.

[88]  Ching Y. Suen,et al.  Computer recognition of unconstrained handwritten numerals , 1992, Proc. IEEE.

[89]  Claudio De Stefano,et al.  A feature selection algorithm for class discrimination improvement , 2007, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[90]  Gaurav Harit,et al.  Devising interactive access techniques for Indian language document images , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[91]  Amit Choudhary,et al.  Unconstrained Handwritten Digit OCR Using Projection Profile and Neural Network Approach , 2012 .

[92]  Stavros J. Perantonis,et al.  Handwritten character recognition through two-stage foreground sub-sampling , 2010, Pattern Recognit..

[93]  Ching Y. Suen,et al.  Analysis of Class Separation and Combination of Class-Dependent Features for Handwriting Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[94]  Jack Sklansky,et al.  A note on genetic algorithms for large-scale feature selection , 1989, Pattern Recognit. Lett..

[95]  Sebastiano Impedovo,et al.  Fundamentals in Handwriting Recognition , 1994, NATO ASI Series.

[96]  Reza Azmi,et al.  A hybrid GA and SA algorithms for feature selection in recognition of hand-printed Farsi characters , 2010, 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[97]  Kidiyo Kpalma,et al.  An Overview of Advances of Pattern Recognition Systems in Computer Vision , 2007 .

[98]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[99]  Giovanni Ramponi,et al.  Adaptive unsharp masking for contrast enhancement , 1997, Proceedings of International Conference on Image Processing.

[100]  Fabio Roli,et al.  A note on core research issues for statistical pattern recognition , 2002, Pattern Recognit. Lett..

[101]  Mohamed Othman,et al.  Chain Coding and Pre Processing Stages of Handwritten Character Image File , 2010 .

[102]  Abdelmajid Ben Hamadou,et al.  Multi-stream Markov Models for Arabic Handwriting Recognition , 2012 .

[103]  Chong-Ho Choi,et al.  Input feature selection for classification problems , 2002, IEEE Trans. Neural Networks.