Regularized matrix-variate logistic regression with response subject to misclassification

Abstract Matrix-variate logistic regression is useful in facilitating the relationship between the binary response and complex-featured matrix-variates arising commonly from medical imaging research. However, standard inference procedures based on such a model are impaired by the presence of the response misclassification as well as inactive covariates. It is imperative to account for the misclassification effects and select active covariates when employing matrix-variate logistic regression to analyze such data. In this paper, we develop penalized unbiased estimating functions using the smoothly clipped absolute deviation (SCAD) penalty to address the sparsity of matrix-variate data as well as the response misclassification effects. The proposed methods are justified both theoretically and numerically. We analyze the Breast Cancer Wisconsin data with the proposed methods.

[1]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[2]  W. N. Street,et al.  Image analysis and machine learning applied to breast cancer diagnosis and prognosis. , 1995, Analytical and quantitative cytology and histology.

[3]  Grace Y Yi,et al.  Marginal analysis of longitudinal ordinal data with misclassification in both response and covariates , 2014, Biometrical journal. Biometrische Zeitschrift.

[4]  Gongjun Xu,et al.  Debiased Inference on Treatment Effect in a High-Dimensional Model , 2019, Journal of the American Statistical Association.

[5]  Runze Li,et al.  Variable Selection in Measurement Error Models. , 2010, Bernoulli : official journal of the Bernoulli Society for Mathematical Statistics and Probability.

[6]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[7]  Roger Logan,et al.  Estimation and Inference for Logistic Regression with Covariate Misclassification and Measurement Error in Main Study/Validation Study Designs , 2000 .

[8]  Hongtu Zhu,et al.  Tensor Regression with Applications in Neuroimaging Data Analysis , 2012, Journal of the American Statistical Association.

[9]  Hung Hung,et al.  Matrix variate logistic regression model with application to EEG data. , 2011, Biostatistics.

[10]  G. Yi,et al.  Matrix-variate logistic regression with measurement error , 2020 .

[11]  Lexin Li,et al.  Regularized matrix regression , 2012, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[12]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[13]  G. Yi,et al.  Imputation and likelihood methods for matrix‐variate logistic regression with response misclassification , 2021, Canadian Journal of Statistics.

[14]  Christian Hansen,et al.  Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach , 2015 .

[15]  Grace Y. Yi,et al.  Marginal methods for correlated binary data with misclassified responses , 2011 .

[16]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[17]  Grace Y. Yi,et al.  Statistical Analysis with Measurement Error or Misclassification , 2017 .

[18]  Dennis L. Sun,et al.  Exact post-selection inference, with application to the lasso , 2013, 1311.6238.

[19]  J. Neuhaus Bias and efficiency loss due to misclassified responses in binary regression , 1999 .

[20]  Arnoldo Frigessi,et al.  Measurement error in Lasso: impact and likelihood bias correction , 2012, 1210.5378.

[21]  B. Efron Estimation and Accuracy After Model Selection , 2014, Journal of the American Statistical Association.