Regression analysis of incomplete medical cost data

The accumulation of medical cost over time for each subject is an increasing stochastic process defined up to the instant of death. The stochastic structure of this process is complex. In most applications, the process can only be observed at a limited number of time points. Furthermore, the process is subject to right censoring so that it is unobservable after the censoring time. These special features of the medical cost data, especially the presence of death and censoring, pose major challenges in the construction of plausible statistical models and the development of the corresponding inference procedures. In this paper, we propose several classes of regression models which formulate the effects of possibly time-dependent covariates on the marginal mean of cost accumulation in the presence of death or on the conditional means of cost accumulation given specific survival patterns. We then develop estimating equations for these models by combining the approach of generalized estimating equations for longitudinal data with the inverse probability of censoring weighting technique. The resultant estimators are shown to be consistent and asymptotically normal with simple variance estimators. Simulation studies indicate that the proposed inference procedures behave well in practical situations. An application to data taken from a large cancer study reveals that the Medicare enrollees who are diagnosed with less aggressive ovarian cancer tend to accumulate medical cost at lower rates than those with more aggressive disease, but tend to have higher lifetime costs because they live longer.

[1]  D. Lin,et al.  Linear regression analysis of censored medical costs. , 2000, Biostatistics.

[2]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[3]  J. V. Ryzin,et al.  Regression Analysis with Randomly Right-Censored Data , 1981 .

[4]  Anastasios A. Tsiatis,et al.  A consistent estimator for the distribution of quality adjusted survival time , 1997 .

[5]  A H Briggs,et al.  Pulling cost-effectiveness analysis up by its bootstraps: a non-parametric approach to confidence interval estimation. , 1997, Health economics.

[6]  Ruth Etzioni,et al.  Estimating the costs attributable to a disease with application to ovarian cancer. , 1996, Journal of clinical epidemiology.

[7]  D. Cox Regression Models and Life-Tables , 1972 .

[8]  M. Stumpf,et al.  Contribution to the discussion of the paper by Wilson, Weale and Balding: "Inferences from DNA data: population histories, evolutionary processes and forensic match probabilities" , 2003 .

[9]  R J Cook,et al.  Marginal analysis of recurrent events and a terminating event. , 1997, Statistics in medicine.

[10]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[11]  D. Lin,et al.  Proportional Means Regression for Censored Medical Costs , 2000, Biometrics.

[12]  Zhiliang Ying,et al.  A simple nonparametric estimator of the bivariate survival function under univariate censoring , 1993 .

[13]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[14]  Lee-Jen Wei,et al.  Confidence bands for survival curves under the proportional , 1994 .

[15]  L. Kessler,et al.  Potential for Cancer Related Health Services Research Using a Linked Medicare‐Tumor Registry Database , 1993, Medical care.

[16]  M. Kenward,et al.  Contribution to the discussion of the paper by Diggle, Tawn and Moyeed , 1998 .

[17]  J. Robins,et al.  Recovery of Information and Adjustment for Dependent Censoring Using Surrogate Markers , 1992 .

[18]  Roderick J. A. Little,et al.  Modeling the Drop-Out Mechanism in Repeated-Measures Studies , 1995 .

[19]  M Cortina Borja,et al.  Modelling Survival Data: extending the Cox model, by T.M. Therneau and P.M. Grambsch , 2002 .

[20]  E. Feuer,et al.  Estimating medical costs from incomplete follow-up data. , 1997, Biometrics.

[21]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[22]  A R Willan,et al.  Confidence intervals for cost-effectiveness ratios: an application of Fieller's theorem. , 1996, Health economics.

[23]  A O'Hagan,et al.  Bayesian cost‐effectiveness analysis from clinical trial data , 2001, Statistics in medicine.

[24]  A R Willan,et al.  Analysis, sample size, and power for estimating incremental net health benefit from clinical trial data. , 2001, Controlled clinical trials.

[25]  Z. Ying,et al.  Model‐Checking Techniques Based on Cumulative Residuals , 2002, Biometrics.

[26]  Anastasios A. Tsiatis,et al.  Estimating medical costs with censored data , 2000 .