Maximum likelihood inference for the Cox regression model with applications to missing covariates

In this paper, we carry out an in-depth theoretical investigation for existence of maximum likelihood estimates for the Cox model (Cox, 1972, 1975) both in the full data setting as well as in the presence of missing covariate data. The main motivation for this work arises from missing data problems, where models can easily become difficult to estimate with certain missing data configurations or large missing data fractions. We establish necessary and sufficient conditions for existence of the maximum partial likelihood estimate (MPLE) for completely observed data (i.e., no missing data) settings as well as sufficient conditions for existence of the maximum likelihood estimate (MLE) for survival data with missing covariates via a profile likelihood method. Several theorems are given to establish these conditions. A real dataset from a cancer clinical trial is presented to further illustrate the proposed methodology.

[1]  D. Cox Regression Models and Life-Tables , 1972 .

[2]  J. Ibrahim,et al.  Likelihood-Based Methods for Missing Covariates in the Cox Proportional Hazards Model , 2001 .

[3]  Odile Pons Estimation in the Cox Model with Missing Covariate Data , 2002 .

[4]  Kirby L. Jackson,et al.  Log-linear analysis of censored survival data with partially observed covariates , 1989 .

[5]  R. Little,et al.  Proportional hazards regression with missing covariates , 1999 .

[6]  J. Hobert,et al.  Convergence rates and asymptotic standard errors for Markov chain Monte Carlo algorithms for Bayesian probit regression , 2007 .

[7]  W. Tsai,et al.  On using the Cox proportional hazards model with missing covariates , 1997 .

[8]  Joseph G Ibrahim,et al.  Frailty models with missing covariates. , 2002, Biometrics.

[9]  J G Ibrahim,et al.  Maximum Likelihood Methods for Cure Rate Models with Missing Covariates , 2001, Biometrics.

[10]  Rupert G. Miller,et al.  Survival Analysis , 2022, The SAGE Encyclopedia of Research Design.

[11]  Michael J Schell,et al.  Phase III trial comparing a defined duration of therapy versus continuous therapy followed by second-line therapy in advanced-stage IIIB/IV non-small-cell lung cancer. , 2002, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[12]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[13]  J. Ibrahim,et al.  A Semiparametric Mixture Model for Analyzing Clustered Competing Risks Data , 2005, Biometrics.

[14]  S. Johansen An Extension of Cox's Regression Model , 1983 .

[15]  H. Y. Chen,et al.  Double-Semiparametric Method for Missing Covariates in Cox Regression Models , 2002 .

[16]  Joseph G. Ibrahim,et al.  Propriety of the Posterior Distribution and Existence of the MLE for Regression Models With Covariates Missing at Random , 2004 .

[17]  J G Ibrahim,et al.  Using the EM-algorithm for survival data with incomplete categorical covariates , 1996, Lifetime data analysis.

[18]  Ming-Hui Chen,et al.  Propriety of posterior distribution for dichotomous quantal response models , 2000 .

[19]  Z. Ying,et al.  Cox Regression with Incomplete Covariate Measurements , 1993 .

[20]  Joseph G. Ibrahim,et al.  Incomplete covariates in the Cox model with applications to biological marker data , 2001 .

[21]  Joseph G Ibrahim,et al.  Bayesian Analysis for Generalized Linear Models with Nonignorably Missing Covariates , 2005, Biometrics.

[22]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[23]  Myunghee Cho Paik Multiple Imputation for the Cox Proportional Hazards Model with Missing Covariates , 1997, Lifetime data analysis.

[24]  J. Ibrahim Incomplete Data in Generalized Linear Models , 1990 .

[25]  Joseph G. Ibrahim,et al.  Posterior propriety and computation for the Cox regression model with applications to missing covariates , 2006 .

[26]  Torben Martinussen,et al.  Cox Regression with Incomplete Covariate Measurements using the EM‐algorithm , 1999 .

[27]  J G Ibrahim,et al.  Estimating equations with incomplete categorical covariates in the Cox model. , 1998, Biometrics.

[28]  N. Breslow Covariance analysis of censored survival data. , 1974, Biometrics.

[29]  J G Ibrahim,et al.  Estimation with correlated censored survival data with missing covariates. , 2000, Biostatistics.

[30]  A. D'Amico,et al.  Cancer‐specific mortality after radiation therapy with short‐course hormonal therapy or radical prostatectomy in men with localized, intermediate‐risk to high‐risk prostate cancer , 2006, Cancer.

[31]  J. Booth,et al.  Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm , 1999 .

[32]  Joseph G. Ibrahim,et al.  Missing covariates in generalized linear models when the missing data mechanism is non‐ignorable , 1999 .

[33]  Joseph G. Ibrahim,et al.  On propriety of the posterior distribution and existence of the maximum likelihood estimator for regression models with covariates missing at random , 2004 .

[34]  M. Jacobsen Existence and unicity of MLEs in discrete exponential family distributions , 1989 .