论文信息 - Survival estimation and testing via multiple imputation

Survival estimation and testing via multiple imputation

Multiple imputation is a technique for handling data sets with missing values. The method fills in each missing value several times, creating many augmented data sets. Each augmented data set is analyzed separately and the results combined to give a final result consisting of an estimate and a measure of uncertainty. In this paper we consider nonparametric multiple-imputation methods to handle missing event times for censored observations in the context of nonparametric survival estimation and testing. Two nonparametric imputation schemes are considered. In risk set imputation the censored time is replaced by a random draw of the observed times amongst those at risk after the censoring time. In Kaplan-Meier (KM) imputation the imputed time is a draw from the estimated distribution of event times amongst those at risk after the censoring time. We show that with a large number of imputes the estimates from both methods reproduce the KM estimator. In a simulation study we show that the inclusion of a bootstrap stage in the multiple imputation algorithm gives coverage rates of confidence intervals that are comparable to that from Greenwood's formula. Connections to the redistribute to the right algorithm are discussed.

Susan Murray | Chiu-Hsieh Hsu | Chiu-Hsieh Hsu | S. Murray | Jeremy M. G. Taylor

[1] D B Rubin,et al. Multiple imputation in health-care databases: an overview and some applications. , 1991, Statistics in medicine.

[2] M. Pagano,et al. Survival analysis. , 1996, Nutrition.

[3] Xiao-Li Meng,et al. Multiple-Imputation Inferences with Uncongenial Sources of Input , 1994 .

[4] D. Rubin,et al. Statistical Analysis with Missing Data. , 1989 .

[5] B. Efron. The two sample problem with censored data , 1967 .

[6] H. L. Le Roy,et al. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[7] D. Rubin. Multiple imputation for nonresponse in surveys , 1989 .

[8] Daniel F. Heitjan,et al. Ignorability in general incomplete-data models , 1994 .

[9] D. Rubin,et al. MULTIPLE IMPUTATIONS IN SAMPLE SURVEYS-A PHENOMENOLOGICAL BAYESIAN APPROACH TO NONRESPONSE , 2002 .

[10] Roderick J. A. Little,et al. Multiple Imputation for the Fatal Accident Reporting System , 1991 .

[11] Jeremy MG Taylor,et al. Partially parametric techniques for multiple imputation , 1996 .

[12] D. Rubin,et al. Large-sample significance levels from multiply imputed data using moment-based statistics and an F reference distribution , 1991 .

[13] D. Rubin. INFERENCE AND MISSING DATA , 1975 .