Efficient matrixized classification learning with separated solution process

The matrix-pattern-oriented Ho–Kashyap classifier (MatMHKS), using two-sided weight vectors to constrain the matrixized samples, can deal with not only the vectorized sample but also the matrixized sample. For vectorized sample, by converting the vectorized mode into matrixized mode, MatMHKS relieves the curse of dimensionality and extends the expressive modes of sample. Although MatMHKS has been demonstrated to be effective in the classification performance, it consumes a lot of time to alternately update two weight vectors in each iteration. Moreover, MatMHKS is not suitable in dealing with imbalanced problems. Finally, there does not exist effective analysis of generalization risk for matrixized classifiers. To this end, this paper proposes an efficient matrixized Ho–Kashyap classifier (EMatMHKS), which separately updates the two-sided weight vectors to avoid repeatedly calculating the inverse matrix in MatMHKS, thus significantly improving the training speed. Moreover, by introducing a weight matrix, both balanced and imbalanced situations can be tackled. Finally, PAC-Bayes bound is used to reflect the error upper bound of matrixized and vectorized classifiers. Both balanced and imbalanced data sets are used to validate the effectiveness and the efficiency of the proposed EMatMHKS in the experiment.

[1]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[2]  Philip H. Ramsey Nonparametric Statistical Methods , 1974, Technometrics.

[3]  StaianoAntonino,et al.  Intrinsic dimension estimation , 2016 .

[4]  Matthias W. Seeger,et al.  PAC-Bayesian Generalisation Error Bounds for Gaussian Process Classification , 2003, J. Mach. Learn. Res..

[5]  John Shawe-Taylor,et al.  PAC Bayes and Margins , 2003 .

[6]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Jacek M. Łȩski,et al.  Ho--Kashyap classifier with generalization control , 2003 .

[8]  Heloisa A. Camargo,et al.  Imbalanced datasets in the generation of fuzzy classification systems - an investigation using a multiobjective evolutionary algorithm based on decomposition , 2016, 2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[9]  Nong Sang,et al.  Regularized max-min linear discriminant analysis , 2017, Pattern Recognit..

[10]  Jacek M. Leski,et al.  Ho-Kashyap classifier with generalization control , 2003, Pattern Recognit. Lett..

[11]  Tian Yongjun and Chen Songcan,et al.  Matrix-Pattern-Oriented Ho-Kashyap Classifier with Regularization Learning , 2005 .

[12]  Sayan Mukherjee,et al.  Learning theory: stability is sufficient for generalization and necessary and sufficient for consistency of empirical risk minimization , 2006, Adv. Comput. Math..

[13]  Songcan Chen,et al.  Matrix-pattern-oriented Ho-Kashyap classifier with regularization learning , 2007, Pattern Recognit..

[14]  Wei Chen,et al.  A framework for data-driven analysis of materials under uncertainty: Countering the curse of dimensionality , 2017 .

[15]  Maciej Zieba,et al.  Service-Oriented Medical System for Supporting Decisions With Missing and Imbalanced Data , 2014, IEEE Journal of Biomedical and Health Informatics.

[16]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[17]  Daniel S. Yeung,et al.  Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems , 2015, IEEE Transactions on Cybernetics.

[18]  Maoguo Gong,et al.  Optimization methods for regularization-based ill-posed problems: a survey and a multi-objective framework , 2017, Frontiers of Computer Science.

[19]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[20]  Lijun Xie,et al.  A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data , 2018, Pattern Recognit..

[21]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[22]  Yuming Zhou,et al.  A novel ensemble method for classifying imbalanced data , 2015, Pattern Recognit..

[23]  R. Iman,et al.  Approximations of the critical region of the fbietkan statistic , 1980 .

[24]  Eric Horvitz,et al.  Considering Cost Asymmetry in Learning Classifiers , 2006, J. Mach. Learn. Res..

[25]  Zhe Wang,et al.  Cascade interpolation learning with double subspaces and confidence disturbance for imbalanced problems , 2019, Neural Networks.

[26]  Yunqian Ma,et al.  Imbalanced Learning: Foundations, Algorithms, and Applications , 2013 .

[27]  Ronald L. Rivest,et al.  Introduction to Algorithms, third edition , 2009 .

[28]  J. Langford Tutorial on Practical Prediction Theory for Classification , 2005, J. Mach. Learn. Res..

[29]  John Shawe-Taylor,et al.  Tighter PAC-Bayes Bounds , 2006, NIPS.

[30]  Antonino Staiano,et al.  Intrinsic dimension estimation: Advances and open problems , 2016, Inf. Sci..

[31]  V. Koltchinskii,et al.  Rademacher Processes and Bounding the Risk of Function Learning , 2004, math/0405338.

[32]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[33]  Evgueni A. Haroutunian,et al.  Information Theory and Statistics , 2011, International Encyclopedia of Statistical Science.

[34]  Daoqiang Zhang,et al.  Pattern Representation in Feature Extraction and Classifier Design: Matrix Versus Vector , 2008, IEEE Transactions on Neural Networks.

[35]  Zhe Wang,et al.  Geometric Structural Ensemble Learning for Imbalanced Problems , 2020, IEEE Transactions on Cybernetics.

[36]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[37]  François Laviolette,et al.  PAC-Bayesian learning of linear classifiers , 2009, ICML '09.

[38]  Zhi-Hua Zhou,et al.  Exploratory Undersampling for Class-Imbalance Learning , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[39]  Q. Henry Wu,et al.  Association Rule Mining-Based Dissolved Gas Analysis for Fault Diagnosis of Power Transformers , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).