Vertical modelling: Analysis of competing risks data with missing causes of failure

We propose vertical modelling as a natural approach to the problem of analysis of competing risks data when failure types are missing for some individuals. Under a natural missing-at-random assumption for these missing failure types, we use the observed data likelihood to estimate its parameters and show that the all-cause hazard and the relative hazards appearing in vertical modelling are indeed key quantities of this likelihood. This fact has practical implications in that it suggests vertical modelling as a simple and attractive method of analysis in competing risks with missing causes of failure; all individuals are used in estimating the all-cause hazard and only those with non-missing cause of failure for relative hazards. The relative hazards also appear in a multiple imputation approach to the same problem proposed by Lu and Tsiatis and in the EM algorithm. We compare the vertical modelling approach with the method of Goetghebeur and Ryan for a breast cancer data set, highlighting the different aspects they contribute to the data analysis.

[1]  Els Goetghebeur,et al.  Analysis of competing risks survival data when some failure types are missing , 1995 .

[2]  A A Tsiatis,et al.  Multiple Imputation Methods for Estimating Regression Coefficients in the Competing Risks Model with Missing Cause of Failure , 2001, Biometrics.

[3]  H. V. Houwelingen Dynamic Prediction by Landmarking in Event History Analysis , 2007 .

[4]  H Putter,et al.  Tutorial in biostatistics: competing risks and multi‐state models , 2007, Statistics in medicine.

[5]  Hein Putter,et al.  The mstate package for estimation and prediction in non- and semi-parametric multi-state and competing risks models , 2010, Comput. Methods Programs Biomed..

[6]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[7]  Radu V. Craiu,et al.  Inference based on the EM algorithm for the competing risks model with masked causes of failure , 2004 .

[8]  I. Langner Survival Analysis: Techniques for Censored and Truncated Data , 2006 .

[9]  J. Klein,et al.  Semiparametric Proportional Hazards Regression with Fixed Covariates , 1997 .

[10]  R. Gray,et al.  Tamoxifen versus placebo: double-blind adjuvant trial in elderly women with stage II breast cancer. , 1986, NCI monographs : a publication of the National Cancer Institute.

[11]  Wenbin Lu,et al.  ANALYSIS OF COMPETING RISKS DATA WITH MISSING CAUSE OF FAILURE UNDER ADDITIVE HAZARDS MODEL , 2008 .

[12]  M A Nicolaie,et al.  Vertical modeling: A pattern mixture approach for competing risks modeling , 2010, Statistics in medicine.

[13]  Chanseok Park,et al.  Parameter estimation of incomplete data in competing risks using the EM algorithm , 2005, IEEE Transactions on Reliability.

[14]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[15]  Roderick J. A. Little,et al.  Theory of Inference Based on the Likelihood Function , 2014 .