Estimating treatment effects in observational studies with both prevalent and incident cohorts

Registry databases are increasingly being used for comparative effectiveness research in cancer. Such databases reflect the real‐world patient population and physician practice, and thus are natural sources for comparing multiple treatment scenarios and the associated long‐term clinical outcomes. Registry databases usually include both incident and prevalent cohorts, which provide valuable complementary information for patients with more recent diagnoses in the incident cohort as well as patients with long‐term follow‐up data in the prevalent cohort. However, utilizing such data to derive valid inference poses two major challenges: the data from a prevalent cohort are not random samples of the target population, and there may be substantial differences in the baseline characteristics of patients between treatment arms, which influences the decisions about treatment selection in both cohorts. In this article we extend propensity score methodology to observational studies that involve both prevalent and incident cohorts, and assess the effectiveness of radiation therapy (RT) in SEER‐Medicare patients diagnosed with stage IV breast cancer. Specifically we utilize the incident cohort to estimate the propensity for receiving RT, and then combine data from both the incident and prevalent cohorts to estimate the effect of RT by adjusting for the propensity scores in the model. We evaluate the proposed method with simulations. We demonstrate that the proposed propensity score method simultaneously removes sampling bias and selection bias under several assumptions. The Canadian Journal of Statistics 45: 202–219; 2017 © 2017 Statistical Society of Canada

[1]  Ashkan Ertefaie,et al.  Propensity score estimation in the presence of length‐biased sampling: a non‐parametric adjustment approach , 2013, Stat.

[2]  Kwun Chuen Gary Chan,et al.  Survival analysis without survival data: connecting length-biased and case-control data. , 2013, Biometrika.

[3]  Liang Li,et al.  A Weighting Analogue to Pair Matching in Propensity Score Analysis , 2013, The international journal of biostatistics.

[4]  Mei-Cheng Wang,et al.  Estimating Propensity Scores and Causal Survival Functions Using Prevalent Survival Data , 2012, Biometrics.

[5]  J. Qin,et al.  Buckley–James‐Type Estimator with Right‐Censored and Length‐Biased Data , 2011, Biometrics.

[6]  Yu Shen,et al.  Statistical Methods for Analyzing Right‐Censored Length‐Biased Data under Cox Model , 2010, Biometrics.

[7]  Jing Ning,et al.  Analyzing Length-Biased Data With Semiparametric Transformation and Accelerated Failure Time Models , 2009, Journal of the American Statistical Association.

[8]  Peter C Austin,et al.  A critical appraisal of propensity‐score matching in the medical literature between 1996 and 2003 , 2008, Statistics in medicine.

[9]  Jake Bowers,et al.  Covariate balance in simple stratified and clustered comparative studies , 2008, 0808.3857.

[10]  Marie Davidian,et al.  Comment: Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data. , 2008, Statistical science : a review journal of the Institute of Mathematical Statistics.

[11]  Gary King,et al.  Misunderstandings between experimentalists and observationalists about causal inference , 2008 .

[12]  Joseph Kang,et al.  Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data , 2007, 0804.2973.

[13]  Masoud Asgharian,et al.  Asymptotic behavior of the unconditional NPMLE of the length-biased survivor function from right censored prevalent cohort data , 2005, math/0602239.

[14]  Ralph B. D'Agostino Propensity score methods for bias reduction in the comparison of a treatment to a non-randomized control group. , 1998, Statistics in medicine.

[15]  N P Jewell,et al.  Statistical models for prevalent cohort data. , 1993, Biometrics.

[16]  Zhiliang Ying,et al.  Rank Regression Methods for Left-Truncated and Right-Censored Data , 1991 .

[17]  D. Rubin,et al.  Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity Score , 1985 .

[18]  Y. Vardi,et al.  Nonparametric Estimation in the Presence of Length Bias , 1982 .

[19]  D. Rubin Matched Sampling for Causal Effects: Matching to Remove Bias in Observational Studies , 1973 .

[20]  J. Chemali,et al.  Summary and discussion of “ The central role of the propensity score in observational studies for causal effects , 2014 .

[21]  D. Rubin Matched Sampling for Causal Effects: Using Multivariate Matched Sampling and Regression Adjustment to Control Bias in Observational Studies , 2006 .

[22]  Wenceslao González-Manteiga,et al.  Asymptotic properties of a generalized kaplan-meier estimator with some applications , 1994 .