A genetic algorithm based clustering approach for improving off-line handwritten digit classification

In this paper a new clustering technique for improving off-line handwritten digit recognition is introduced. Clustering design is approached as an optimization problem in which the objective function to be minimized is the cost function associated to the classification, that is here performed by the k-nearest neighbor (k-NN) classifier based on the Sokal and Michener dissimilarity measure. For this purpose, a genetic algorithm is used to determine the best cluster centers to reduce classification time, without suffering a great loss in accuracy. In addition, an effective strategy for generating the initial-population of the genetic algorithm is also presented. The experimental tests carried out using the MNIST database show the effectiveness of this method.

[1]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[2]  Stephen Grossberg,et al.  Adaptive Resonance Theory , 2010, Encyclopedia of Machine Learning.

[3]  S. Grossberg,et al.  Search mechanisms for adaptive resonance theory (ART) architectures , 1989, International 1989 Joint Conference on Neural Networks.

[4]  Q. Henry Wu,et al.  A class boundary preserving algorithm for data condensation , 2011, Pattern Recognit..

[5]  Stephen Grossberg,et al.  Adaptive resonance theory , 1997, Scholarpedia.

[6]  Francisco Herrera,et al.  Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  C. Tappert,et al.  A Survey of Binary Similarity and Distance Measures , 2010 .

[8]  Sebastiano Impedovo,et al.  Zoning Methods for Hand-Written Character Recognition: An Overview , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[9]  Zheng Pei,et al.  A Modified Editing k-nearest Neighbor Rule , 2011, J. Comput..

[10]  Sebastiano Impedovo,et al.  Feature Membership Functions in Voronoi-Based Zoning , 2009, AI*IA.

[11]  Dorothea Heiss-Czedik,et al.  An Introduction to Genetic Algorithms. , 1997, Artificial Life.

[12]  Cheng-Lin Liu,et al.  Handwritten digit recognition: benchmarking of state-of-the-art techniques , 2003, Pattern Recognit..

[13]  Sebastiano Impedovo,et al.  Optimal zoning design by genetic algorithms , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[14]  A. V. Zukhba NP-completeness of the problem of prototype selection in the nearest neighbor method , 2010, Pattern Recognition and Image Analysis.

[15]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.