Sample-separation-margin based minimum classification error training of pattern classifiers with quadratic discriminant functions

In this paper, we present a new approach to minimum classification error (MCE) training of pattern classifiers with quadratic discriminant functions. First, a so-called sample separation margin (SSM) is defined for each training sample and then used to define the misclassification measure in MCE formulation. The computation of SSM can be cast as a nonlinear constrained optimization problem and solved efficiently. Experimental results on a large-scale isolated online handwritten Chinese character recognition task demonstrate that SSM-based MCE training not only decreases the empirical classification error, but also pushes the training samples away from the decision boundaries, therefore a good generalization is achieved. Compared with conventional MCE training, an additional 7% to 18% relative error rate reduction is observed in our experiments.

[1]  Anthony V. Fiacco,et al.  Sensitivity analysis for nonlinear programming using penalty methods , 1976, Math. Program..

[2]  Yongqiang Wang,et al.  Building compact recognizers of handwritten Chinese characters using precision constrained Gaussian model, minimum classification error training and parameter compression , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[3]  Yongqiang Wang,et al.  Design Compact Recognizers of Handwritten Chinese Characters Using Precision Constrained Gaussian Models, Minimum Classification Error Training and Parameter Compression , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[4]  J. J. Moré Generalizations of the trust region problem , 1993 .

[5]  Zhenlong. Bai,et al.  A study on a goal oriented detection and verification based approach for image and ink document analysis , 2005 .

[6]  Shun-ichi Amari,et al.  A Theory of Adaptive Pattern Classifiers , 1967, IEEE Trans. Electron. Comput..

[7]  Fumitaka Kimura,et al.  Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Scott E. Fahlman,et al.  An empirical study of learning speed in back-propagation networks , 1988 .

[9]  Biing-Hwang Juang,et al.  Discriminative learning for minimum error classification [pattern recognition] , 1992, IEEE Trans. Signal Process..

[10]  Henry Wolkowicz,et al.  Indefinite Trust Region Subproblems and Nonsymmetric Eigenvalue Perturbations , 1995, SIAM J. Optim..

[11]  Hiroshi Sako,et al.  Discriminative learning quadratic discriminant function for handwriting recognition , 2004, IEEE Transactions on Neural Networks.

[12]  S. Katagiri,et al.  Discriminative Learning for Minimum Error Classification , 2009 .

[13]  Tingting. He,et al.  A study on several problems in online handwritten Chinese character recognition , 2008 .

[14]  Biing-Hwang Juang,et al.  Minimum classification error rate methods for speech recognition , 1997, IEEE Trans. Speech Audio Process..

[15]  Qiang Huo,et al.  A study of a new misclassification measure for minimum classification error training of prototype-based pattern classifiers , 2008, 2008 19th International Conference on Pattern Recognition.