A Hybrid Method for Multiclass Classification and Its Application to Handwritten Character Recognition

The support vector machine (SVM) is an effective pattern classification method. However, solving N(N-1)/2 binary classifications in the training phase makes it too costly to use SVM in applications with a high number N of class types. In this paper, we propose a new prototype classification method that can be combined with SVM for pattern recognition. This hybrid method has the following merits. First, the learning algorithm for constructing prototypes determines both the number and the location of the prototypes. This algorithm not only terminates within a finite number of iterations, but also assures that each training sample matches in class type with the nearest prototype. Second, SVM can be used to process the top-rank candidates obtained by the prototype classification method, which saves time in both the training and testing processes. We apply our method to recognizing handwritten numerals and handwritten Chinese/Hiragana characters. Experiment results show that the hybrid method saves a great deal of training and testing time in large scale tasks and achieves comparable accuracy to that achieved by using SVM solely. Our results also show that the hybrid method performs better than the nearest neighbour method.

[1]  John Platt,et al.  Large Margin DAG's for Multiclass Classification , 1999 .

[2]  Gérard Dreyfus,et al.  Single-layer learning revisited: a stepwise procedure for building and training a neural network , 1989, NATO Neurocomputing.

[3]  Seong-Whan Lee,et al.  Nonlinear shape normalization methods for the recognition of large-set handwritten characters , 1994, Pattern Recognit..

[4]  Chun-Jen Chen,et al.  A linear-time component-labeling algorithm using contour tracing technique , 2004, Comput. Vis. Image Underst..

[5]  Michio Umeda Advances in Recognition Methods for Handwritten Kanji Characters (Special issue on Character Recognition and Document Understanding) , 1996 .

[6]  A. Levine,et al.  The nearest-neighbor rule for small samples drawn from uniform distributions (Corresp.) , 1973, IEEE Trans. Inf. Theory.

[7]  Jin Hyung Kim,et al.  High accuracy handwritten Chinese character recognition by improved feature matching method , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[8]  Hiromitsu Yamada,et al.  A nonlinear normalization method for handprinted kanji character recognition - line density equalization , 1990, Pattern Recognit..

[9]  Yianni Attikiouzel,et al.  An improved algorithm for border following of binary images , 1989 .

[10]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  E. Backer,et al.  Computer-assisted reasoning in cluster analysis , 1995 .

[12]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[13]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[14]  John F. O'Callaghan,et al.  An Alternative Definition for "Neighborhood of a Point" , 1975, IEEE Transactions on Computers.

[15]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[16]  Isabelle Guyon,et al.  Comparison of classifier methods: a case study in handwritten digit recognition , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[17]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[18]  Samy Bengio,et al.  Torch: a modular machine learning software library , 2002 .

[19]  H.M.R. Abou-zeid,et al.  Computer recognition of unconstrained handwritten numerals , 2003, 2003 46th Midwest Symposium on Circuits and Systems.

[20]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.