Multiple imputation for threshold-crossing data with interval censoring.

Medical statistics often involve measurements of the time when a variable crosses a threshold value. The time to threshold crossing may be the outcome variable in a survival analysis, or a time-dependent covariate in the analysis of a subsequent event. This paper presents new methods for analysing threshold-crossing data that are interval censored in that the time of threshold crossing is known only within a specified interval. Such data typically arise in event-history studies when the threshold is crossed at some time between data-collection points, such as visits to a clinic. We propose methods based on multiple imputation of the threshold-crossing time with use of models that take into account values recorded at the times of visits. We apply the methods to two real data sets, one involving hip replacements and the other on the prostate specific antigen (PSA) assay for prostate cancer. In addition, we compare our methods with the common practice of imputing the threshold-crossing time as the right endpoint of the interval. The two examples require different imputation models, but both lead to simple analyses of the multiply imputed data that automatically take into account variability due to imputation.

[1]  Roderick J. A. Little,et al.  The Analysis of Social Science Data with Missing Values , 1989 .

[2]  D. Rubin,et al.  Ignorability and Coarse Data , 1991 .

[3]  D. Rubin,et al.  Multiple Imputation for Interval Estimation from Simple Random Samples with Ignorable Nonresponse , 1986 .

[4]  D. Finkelstein,et al.  A proportional hazards model for interval-censored failure time data. , 1986, Biometrics.

[5]  Norman R. Draper,et al.  Applied regression analysis (2. ed.) , 1981, Wiley series in probability and mathematical statistics.

[6]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .

[7]  R. Peto,et al.  Experimental Survival Curves for Interval‐Censored Data , 1973 .

[8]  P. Bacchetti,et al.  Nonparametric estimation of the incubation period of AIDS based on a prevalent cohort with unknown infection times. , 1991, Biometrics.

[9]  B. Turnbull The Empirical Distribution Function with Arbitrarily Grouped, Censored, and Truncated Data , 1976 .

[10]  J M Taylor,et al.  Estimating the distribution of times from HIV seroconversion to AIDS using multiple imputation. Multicentre AIDS Cohort Study. , 1990, Statistics in medicine.

[11]  Donald B. Rubin,et al.  Inference from Coarse Data via Multiple Imputation with Application to Age Heaping , 1990 .

[12]  R. Wolfe,et al.  A semiparametric model for regression analysis of interval-censored failure time data. , 1985, Biometrics.

[13]  Joan S. Chmiel,et al.  Acquired immunodeficiency syndrome (AIDS)-free time after human immunodeficiency virus type 1 (HIV-1) seroconversion in homosexual men. Multicenter AIDS Cohort Study Group. , 1989, American journal of epidemiology.

[14]  D B Rubin,et al.  Multiple imputation in health-care databases: an overview and some applications. , 1991, Statistics in medicine.

[15]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[16]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[17]  Roderick J. A. Little,et al.  Multiple Imputation for the Fatal Accident Reporting System , 1991 .