Parameter estimation of incomplete data in competing risks using the EM algorithm

Consider a system which is made up of multiple components connected in a series. In this case, the failure of the whole system is caused by the earliest failure of any of the components, which is commonly referred to as competing risks. In certain situations, it is observed that the determination of the cause of failure may be expensive, or may be very difficult to observe due to the lack of appropriate diagnostics. Therefore, it might be the case that the failure time is observed, but its corresponding cause of failure is not fully investigated. This is known as masking. Moreover, this competing risks problem is further complicated due to possible censoring. In practice, censoring is very common because of time and cost considerations on experiments. In this paper, we deal with parameter estimation of the incomplete lifetime data in competing risks using the EM algorithm, where incompleteness arises due to censoring and masking. Several studies have been carried out, but parameter estimation for incomplete data has mainly focused on exponential models. We provide the general likelihood method, and the parameter estimation of a variety of models including exponential, s-normal, and lognormal models. This method can be easily implemented to find the MLE of other models. Exponential and lognormal examples are illustrated with parameter estimation, and a graphical technique for checking model validity.

[1]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[2]  Frank M. Guess,et al.  Bayesian inference for masked system lifetime data , 1995 .

[3]  Ram C. Tiwari,et al.  Comparing cumulative incidence functions of a competing-risks model , 1997 .

[4]  Odd Aalen,et al.  Nonparametric Estimation of Partial Transition Probabilities in Multiple Decrement Models , 1978 .

[5]  Ian W. McKeague,et al.  Some Tests for Comparing Cumulative Incidence Functions and Cause-Specific Hazard Rates , 1994 .

[6]  Thorn J. Hodgson,et al.  Maximum likelihood analysis of component reliability using masked system life-test data , 1988 .

[7]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[8]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[9]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[10]  K. F. Lam A class of tests for the equality of k cause-specific hazard rates in a competing risks model , 1998 .

[11]  B. Efron,et al.  Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher information , 1978 .

[12]  D. Kundu,et al.  Analysis of Incomplete Data in Presence of Competing Risks , 2000 .

[13]  Frank M. Guess,et al.  Estimating system and component reliabilities under partial information on cause of failure , 1991 .

[14]  Chanseok Park,et al.  Parametric inference of incomplete data with competing risks among several groups , 2004, IEEE Transactions on Reliability.

[15]  R. J. Herman,et al.  Maximum Likelihood Estimation For Multi-Risk Model , 1971 .

[16]  D. Cox,et al.  THE ANALYSIS OF EXPONENTIALLY DISTRIBUTED LIFE-TIMES WITH Two TYPES OF FAILURE , 1959 .

[17]  M. A. Tanner,et al.  Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions, 3rd Edition , 1998 .

[18]  Debasis Kundu,et al.  Parameter Estimation for Partially Complete Time and Type of Failure Data , 2004 .

[19]  Laurence A. Baxter,et al.  Applications of the EM algorithm to the analysis of life length data , 1995 .

[20]  Gregg E. Dinse,et al.  A mixture model for the regression analysis of competing risks data , 1985 .

[21]  Charles El-Nouty,et al.  The Presmoothed Nelson–Aalen Estimator in the Competing Risk Model , 2005 .

[22]  R. Gray A Class of $K$-Sample Tests for Comparing the Cumulative Incidence of a Competing Risk , 1988 .

[23]  W. J. Padgett,et al.  Analysis of strength distributions of multi-modal failures using the EM algorithm , 2006 .

[24]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[25]  R. Beran Minimum Hellinger distance estimates for parametric models , 1977 .

[26]  Daniel F. Heitjan,et al.  Inference from Grouped Continuous Data: A Review , 1989 .

[27]  Yuri K Belyaev,et al.  A Class of Non‐parametric Tests in the Competing Risks Model for Comparing Two Samples , 1998 .

[28]  H. A. David,et al.  Life Tests under Competing Causes of Failure and the Theory of Competing Risks , 1971 .

[29]  Xian Zhou,et al.  Analysis of parametric models for competing risks , 2002 .

[30]  Masami Miyakawa,et al.  Analysis of Incomplete Data in Competing Risks Model , 1984, IEEE Transactions on Reliability.

[31]  Ram C. Tiwari,et al.  Nonparametric tests for cause specific hazard rates with censored data for competing risks among several groups , 2006 .

[32]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[33]  Donald B. Rubin,et al.  Inference from Coarse Data via Multiple Imputation with Application to Age Heaping , 1990 .

[34]  Bruce W. Turnbull,et al.  COMPARING TWO TREATMENTS WITH MULTIPLE COMPETING RISKS ENDPOINTS , 1999 .

[35]  Dario Gasbarra,et al.  Testing equality of cause-specific hazard rates corresponding to m competing risks among K groups , 2002 .

[36]  Frederick R. Forst,et al.  On robust estimation of the location parameter , 1980 .

[37]  T. Ishioka,et al.  Maximum likelihood estimation of Weibull parameters for two independent competing risk , 1991 .

[38]  R. Jiang,et al.  Study of n-fold weibull competing risk model , 2003 .

[39]  Frank M. Guess,et al.  An iterative approach for estimating component reliability from masked system life data , 1989 .