A constrained least squares regression model

Abstract Least squares regression (LSR) is a widely used regression technique for multicategory classification. Conventional LSR model assumes that during the learning phase, the labeled samples can be exactly transformed into a discrete label matrix, which is too strict to learn a regression matrix for fitting the labels. To overcome this drawback, lots of LSR’s variants utilize the soft target label, which contains the continuous values, to replace this discrete label to improve the learning performance. Since the regression matrix can be learnt from these soft target labels, it is reasonable to assume that the samples in the same class have similar soft target labels. Nevertheless, most of existing LSR-based models don’t adequately consider this similarity assumption. In this paper, we propose a constrained least squares regression (CLSR) model for multicategory classification. The main motivation of CLSR is to force the samples in the same class to obtain the similar soft target labels. To effectively optimize CLSR, we propose a novel alternating algorithm, which can converge to the globally optimal solution. Extensive experiments results on face and digit data sets confirm the effectiveness of our proposed model.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  Chunhong Pan,et al.  Groupwise Retargeted Least-Squares Regression , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[3]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[5]  Vojislav Kecman,et al.  Multi-target support vector regression via correlation regressor chains , 2017, Inf. Sci..

[6]  Qian Du,et al.  Scene classification using local and global features with collaborative representation fusion , 2016, Inf. Sci..

[7]  Dacheng Tao,et al.  Local Rademacher Complexity for Multi-Label Learning , 2014, IEEE Transactions on Image Processing.

[8]  Matthew B. Blaschko,et al.  Discovering predictors of mental health service utilization with k-support regularized logistic regression , 2016, Inf. Sci..

[9]  Jindong Shen,et al.  A twin-hyperspheres support vector machine with automatic variable weights for data classification , 2017, Inf. Sci..

[10]  Chih-Jen Lin,et al.  Coordinate Descent Method for Large-scale L2-loss Linear Support Vector Machines , 2008, J. Mach. Learn. Res..

[11]  Chuangyin Dang,et al.  Shape constrained risk-neutral density estimation by support vector regression , 2016, Inf. Sci..

[12]  Feiping Nie,et al.  Discriminative Least Squares Regression for Multiclass Classification and Feature Selection , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[13]  Lawrence Carin,et al.  Sparse multinomial logistic regression: fast algorithms and generalization bounds , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Shiming Xiang,et al.  Retargeted Least Squares Regression Algorithm , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[16]  Shichao Zhang,et al.  Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Dacheng Tao,et al.  Multi-View Learning With Incomplete Views , 2015, IEEE Transactions on Image Processing.

[18]  Ling Shao,et al.  Discriminative Elastic-Net Regularized Linear Regression , 2017, IEEE Transactions on Image Processing.

[19]  Feiping Nie,et al.  Scalable Multi-View Semi-Supervised Classification via Adaptive Regression , 2017, IEEE Transactions on Image Processing.

[20]  Feiping Nie,et al.  2D Feature Selection by Sparse Matrix Regression , 2017, IEEE Transactions on Image Processing.

[21]  Tieniu Tan,et al.  Feature Selection Based on Structured Sparsity: A Comprehensive Study , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[22]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[23]  Xuelong Li,et al.  Regularized Label Relaxation Linear Regression , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[24]  Xiaofei He,et al.  Multi-Target Regression via Robust Low-Rank Learning , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Zhenyue Zhang,et al.  Uniform Projection for Multi-View Learning , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Dianhui Wang,et al.  High dimensional data regression using Lasso model and neural networks with random weights , 2016, Inf. Sci..

[27]  Mohammed Bennamoun,et al.  Linear Regression for Face Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Andy Harter,et al.  Parameterisation of a stochastic model for human face identification , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[29]  Svetha Venkatesh,et al.  Face Recognition Using Kernel Ridge Regression , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Xiantong Zhen,et al.  Multitarget Sparse Latent Regression , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[31]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[32]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[34]  Dacheng Tao,et al.  Robust Extreme Multi-label Learning , 2016, KDD.

[35]  Tommy W. S. Chow,et al.  Robust non-negative sparse graph for semi-supervised multi-label learning with missing labels , 2018, Inf. Sci..

[36]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..