Linearized binary regression

Probit regression was first proposed by Bliss in 1934 to study mortality rates of insects. Since then, an extensive body of work has analyzed and used probit or related binary regression methods (such as logistic regression) in numerous applications and fields. This paper provides a fresh angle to such well-established binary regression methods. Concretely, we demonstrate that linearizing the probit model in combination with linear estimators performs on par with state-of-the-art nonlinear regression methods, such as posterior mean or maximum aposteriori estimation, for a broad range of real-world regression problems. We derive exact, closed-form, and nonasymptotic expressions for the mean-squared error of our linearized estimators, which clearly separates them from nonlinear regression methods that are typically difficult to analyze. We showcase the efficacy of our methods and results for a number of synthetic and real-world datasets, which demonstrates that linearized binary regression finds potential use in a variety of inference, estimation, signal processing, and machine learning applications that deal with binary-valued observations or measurements.

[1]  T. Amemiya QUALITATIVE RESPONSE MODELS: A SURVEY , 1981 .

[2]  Lawrence Carin,et al.  Sparse multinomial logistic regression: fast algorithms and generalization bounds , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Charles X. Ling,et al.  Using AUC and accuracy in evaluating learning algorithms , 2005, IEEE Transactions on Knowledge and Data Engineering.

[4]  Ewout van den Berg,et al.  1-Bit Matrix Completion , 2012, ArXiv.

[5]  T. Hastie,et al.  Classification of gene microarrays by penalized logistic regression. , 2004, Biostatistics.

[6]  C. Stein Estimation of the Mean of a Multivariate Normal Distribution , 1981 .

[7]  D. Cox The Regression Analysis of Binary Sequences , 1958 .

[8]  H. Vincent Poor,et al.  An Introduction to Signal Detection and Estimation , 1994, Springer Texts in Electrical Engineering.

[9]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[10]  Francis R. Bach,et al.  Self-concordant analysis for logistic regression , 2009, ArXiv.

[11]  Peter D. Hoff,et al.  A First Course in Bayesian Statistical Methods , 2009 .

[12]  C. I. Bliss THE CALCULATION OF THE DOSAGE-MORTALITY CURVE , 1935 .

[13]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[14]  F. Bunea Honest variable selection in linear and logistic regression models via $\ell_1$ and $\ell_1+\ell_2$ penalization , 2008, 0808.4051.

[15]  J. Lafferty,et al.  High-dimensional Ising model selection using ℓ1-regularized logistic regression , 2010, 1010.0311.

[16]  C. I. Bliss,et al.  THE METHOD OF PROBITS. , 1934, Science.

[17]  Jun Zhou,et al.  Hyperspectral Image Classification Based on Structured Sparse Logistic Regression and Three-Dimensional Wavelet Texture Features , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[19]  Gavin C. Cawley,et al.  Gene Selection in Cancer Classification using Sparse Logistic Regression with Bayesian Regularisation , 2006 .

[20]  D. Brillinger The identification of a particular nonlinear time series system , 1977 .

[21]  Christos Thrampoulidis,et al.  LASSO with Non-linear Measurements is Equivalent to One With Linear Measurements , 2015, NIPS.

[22]  F. Lord Applications of Item Response Theory To Practical Testing Problems , 1980 .

[23]  Yaniv Plan,et al.  Robust 1-bit Compressed Sensing and Sparse Logistic Regression: A Convex Programming Approach , 2012, IEEE Transactions on Information Theory.

[24]  Richard G. Baraniuk,et al.  Tag-Aware Ordinal Sparse Factor Analysis for Learning and Content Analytics , 2014, EDM.

[25]  Richard G. Baraniuk,et al.  A Field Guide to Forward-Backward Splitting with a FASTA Implementation , 2014, ArXiv.

[26]  Julian J. Bussgang,et al.  Crosscorrelation functions of amplitude-distorted gaussian signals , 1952 .

[27]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[28]  Cheng Tao,et al.  Channel Estimation and Performance Analysis of One-Bit Massive MIMO Systems , 2016, IEEE Transactions on Signal Processing.