Extensive semi-quantitative regression

In this paper, we propose and solve a new machine learning problem called the extensive semi-quantitative regression, where the information about some target values is incomplete; we only know their lower bounds and/or upper bounds instead of their exact values. To employ the information efficiently in extensive semi-quantitative regression, we introduce a local graph to capture the geometric structure for the samples with the exact target values and the target bounds, and construct a graph-based support vector regressor, called ESQ-SVR. The efficiency of our ESQ-SVR is supported by the results of preliminary experiments conducted on both the artificial and real world datasets. HighlightsWe propose a new problem called extensive semi-quantitative regression.A graph-based support vector regressor is used to solve the above problem.Our approach could capture the geometric structure for the samples.It also bounded the target values and the value ranges.The efficiency of the approach is supported by the results of experiments.

[1]  S. Chatterjee,et al.  Regression Analysis by Example , 1979 .

[2]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[3]  Jiunn R Chen,et al.  PDZ Domain Binding Selectivity Is Optimized Across the Mouse Proteome , 2007, Science.

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Jiaxin Wang,et al.  Non-flat function estimation with a multi-scale support vector regression , 2006, Neurocomputing.

[6]  D. Basak,et al.  Support Vector Regression , 2008 .

[7]  Yuan-Hai Shao,et al.  An ε-twin support vector machine for regression , 2012, Neural Computing and Applications.

[8]  Jude W. Shavlik,et al.  Knowledge-Based Kernel Approximation , 2004, J. Mach. Learn. Res..

[9]  Winky K.O. Ho,et al.  Region-specific Estimates of the Determinants of Real Estate Investment in China , 2012 .

[10]  Ivor W. Tsang,et al.  Laplacian Embedded Regression for Scalable Manifold Regularization , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Zhi-Hua Zhou,et al.  Semi-Supervised Regression with Co-Training , 2005, IJCAI.

[12]  Olvi L. Mangasarian,et al.  Nonlinear Knowledge in Kernel Approximation , 2007, IEEE Transactions on Neural Networks.

[13]  Nai-Yang Deng,et al.  Support Vector Machines: Optimization Based Theory, Algorithms, and Extensions , 2012 .

[14]  Gary D. Bader,et al.  A regression framework incorporating quantitative and negative interaction data improves quantitative prediction of PDZ domain–peptide interaction from primary sequence , 2010, Bioinform..

[15]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[16]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[17]  Shahaboddin Shamshirband,et al.  Sensor Data Fusion by Support Vector Regression Methodology—A Comparative Study , 2015, IEEE Sensors Journal.

[18]  Shahaboddin Shamshirband,et al.  RETRACTED: Stiffness performance of polyethylene terephthalate modified asphalt mixtures estimation using support vector machine-firefly algorithm , 2015, Measurement.

[19]  Mikhail Belkin,et al.  Laplacian Support Vector Machines Trained in the Primal , 2009, J. Mach. Learn. Res..

[20]  Lung-Fei Lee,et al.  Two-Stage Least Squares Estimation of Spatial Autoregressive Models with Endogenous Regressors and Many Instruments , 2013 .

[21]  Gérard Bloch,et al.  Incorporating prior knowledge in support vector regression , 2007, Machine Learning.

[22]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[23]  Gonzalo Mateos,et al.  Robust Nonparametric Regression via Sparsity Control With Application to Load Curve Data Cleansing , 2011, IEEE Transactions on Signal Processing.

[24]  S. Satchell,et al.  Forecasting Volatility in the Financial Markets , 1999 .

[25]  Shervin Motamedi,et al.  Hybrid intelligent model for approximating unconfined compressive strength of cement-based bricks with odd-valued array of peat content (0–29%) , 2015 .

[26]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[27]  Gunnar Rätsch,et al.  Large Scale Multiple Kernel Learning , 2006, J. Mach. Learn. Res..

[28]  Chee Kheong Siew,et al.  Universal Approximation using Incremental Constructive Feedforward Networks with Random Hidden Nodes , 2006, IEEE Transactions on Neural Networks.

[29]  Larry A. Wasserman,et al.  Statistical Analysis of Semi-Supervised Regression , 2007, NIPS.

[30]  Gavin MacBeath,et al.  Predicting PDZ domain–peptide interactions from primary sequences , 2008, Nature Biotechnology.

[31]  Trevor Hastie,et al.  Linear Methods for Regression , 2001 .

[32]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[33]  Chih-Chou Chiu,et al.  Financial time series forecasting using independent component analysis and support vector regression , 2009, Decis. Support Syst..

[34]  Georgios B. Giannakis,et al.  Sparse Volterra and Polynomial Regression Models: Recoverability and Estimation , 2011, IEEE Transactions on Signal Processing.

[35]  Trevor J. Hastie,et al.  Genome-wide association analysis by lasso penalized logistic regression , 2009, Bioinform..

[36]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[37]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[38]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[39]  Gonzalo Mateos,et al.  Distributed Sparse Linear Regression , 2010, IEEE Transactions on Signal Processing.

[40]  D. Kleinbaum,et al.  Applied Regression Analysis and Multivariable Methods , 1999 .

[41]  Na Li,et al.  Incorporating prior knowledge and multi-kernel into linear programming support vector regression , 2015, Soft Comput..

[42]  M. Koetter,et al.  Real Estate Prices and Bank Stability , 2009 .