Estimating medical costs from incomplete follow-up data.

Estimation of the average total cost for treating patients with a particular disease is often complicated by the fact that the survival times are censored on some study subjects and their subsequent costs are unknown. The naive sample average of the observed costs from all study subjects or from the uncensored cases only can be severely biased, and the standard survival analysis techniques are not applicable. To minimize the bias induced by censoring, we partition the entire time period of interest into a number of small intervals and estimate the average total cost either by the sum of the Kaplan-Meier estimator for the probability of dying in each interval multiplied by the sample mean of the total costs from the observed deaths in that interval or by the sum of the Kaplan-Meier estimator for the probability of being alive at the start of each interval multiplied by an appropriate estimator for the average cost over the interval conditional on surviving to the start of the interval. The resultant estimators are consistent if censoring occurs solely at the boundaries of the intervals. In addition, the estimators are asymptotically normal with easily estimated variances. Extensive numerical studies show that the asymptotic approximations are adequate for practical use and the biases of the proposed estimators are small even when censoring may occur in the interiors of the intervals. An ovarian cancer study is provided.