Semiparametric model for bivariate survival data subject to biased sampling

To understand better the relationship between patient characteristics and their residual survival after an intermediate event such as the local recurrence of cancer, it is of interest to identify patients with the intermediate event and then to analyse their residual survival data. One challenge in analysing such data is that the observed residual survival times tend to be longer than those in the target population, since patients who die before experiencing the intermediate event are excluded from the cohort identified. We propose to model jointly the ordered bivariate survival data by using a copula model and appropriately adjusting for the sampling bias. We develop an estimating procedure to estimate simultaneously the parameters for the marginal survival functions and the association parameter in the copula model, and we use a two‐stage expectation–maximization algorithm. Using empirical process theory, we prove that the estimators have strong consistency and asymptotic normality. We conduct simulation studies to evaluate the finite sample performance of the method proposed. We apply the method to two cohort studies to evaluate the association between patient characteristics and residual survival.

[1]  Marc van Kreveld,et al.  The accelerated failure time model , 2017 .

[2]  M. Bondy,et al.  Semiparametric model for semi-competing risks data with application to breast cancer study , 2016, Lifetime data analysis.

[3]  J. Qin,et al.  Semiparametric maximum likelihood inference for truncated or biased-sampling data , 2016 .

[4]  J. Qin,et al.  Score Estimating Equations from Embedded Likelihood Functions Under Accelerated Failure Time Model , 2014, Journal of the American Statistical Association.

[5]  E. Rakovitch,et al.  Factors associated with breast cancer mortality after local recurrence. , 2014, Current oncology.

[6]  C. Wolfson,et al.  Analysis of Biased Survival Data: The Canadian Study of Health and Aging andbeyond , 2014 .

[7]  A. Recht,et al.  Survival after partial breast brachytherapy in elderly patients with nonmetastatic breast cancer. , 2013, Brachytherapy.

[8]  Zhiliang Ying,et al.  A Unified Approach to Semiparametric Transformation Models Under General Biased Sampling Schemes , 2013, Journal of the American Statistical Association.

[9]  Ya-Chen Tina Shih,et al.  Nomogram to predict the benefit of radiation for older patients with breast cancer treated with conservative surgery. , 2012, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[10]  C. Hudis,et al.  1st International consensus guidelines for advanced breast cancer (ABC 1). , 2012, Breast.

[11]  Jing Ning,et al.  Maximum Likelihood Estimations and EM Algorithms With Length-Biased Data , 2011, Journal of the American Statistical Association.

[12]  R. Gelber,et al.  Strategies for subtypes—dealing with the diversity of breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2011 , 2011, Annals of oncology : official journal of the European Society for Medical Oncology.

[13]  Yu Shen,et al.  Statistical Methods for Analyzing Right‐Censored Length‐Biased Data under Cox Model , 2010, Biometrics.

[14]  Jing Ning,et al.  Analyzing Length-Biased Data With Semiparametric Transformation and Accelerated Failure Time Models , 2009, Journal of the American Statistical Association.

[15]  W. Tsai,et al.  Pseudo-partial likelihood for proportional hazards models with biased-sampling data. , 2009, Biometrika.

[16]  Masoud Asgharian,et al.  Covariate Bias Induced by Length-Biased Sampling of Failure Times , 2008 .

[17]  Melissa Bondy,et al.  Residual risk of breast cancer recurrence 5 years after adjuvant therapy. , 2008, Journal of the National Cancer Institute.

[18]  Xiaohong Chen,et al.  A MODEL SELECTION TEST FOR BIVARIATE FAILURE-TIME DATA , 2007, Econometric Theory.

[19]  Limin Peng,et al.  Regression Modeling of Semicompeting Risks Data , 2007, Biometrics.

[20]  Masoud Asgharian,et al.  Asymptotic behavior of the unconditional NPMLE of the length-biased survivor function from right censored prevalent cohort data , 2005, math/0602239.

[21]  David Grimes,et al.  Randomized phase II trial of the efficacy and safety of trastuzumab combined with docetaxel in patients with human epidermal growth factor receptor 2-positive metastatic breast cancer administered as first-line treatment: the M77001 study group. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[22]  K. Bandeen-Roche,et al.  A Diagnostic for Association in Bivariate Survival Models , 2005, Lifetime data analysis.

[23]  Marvin Zelen,et al.  Forward and Backward Recurrence Times and Length Biased Sampling: Age Specific Models , 2004, Lifetime data analysis.

[24]  David A. Snowdon,et al.  Healthy Aging and Dementia: Findings from the Nun Study , 2003, Annals of Internal Medicine.

[25]  Mitchell H. Gail,et al.  Encyclopedia of Epidemiologic Methods , 2002 .

[26]  David B Wolfson,et al.  Length-Biased Sampling With Right Censoring , 2002 .

[27]  Jason P. Fine,et al.  On semi-competing risks data , 2001 .

[28]  T Ostbye,et al.  A reevaluation of the duration of survival after the onset of dementia. , 2001, The New England journal of medicine.

[29]  T. Fleming,et al.  Use of chemotherapy plus a monoclonal antibody against HER2 for metastatic breast cancer that overexpresses HER2. , 2001, The New England journal of medicine.

[30]  Martin T. Wells,et al.  Model Selection and Semiparametric Inference for Bivariate Failure-Time Data , 2000 .

[31]  E. Lehmann Elements of large-sample theory , 1998 .

[32]  W. Markesbery,et al.  Brain infarction and the clinical expression of Alzheimer disease. The Nun Study. , 1997, JAMA.

[33]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[34]  T. Louis,et al.  Inferences on the association parameter in copula models for bivariate survival data. , 1995, Biometrics.

[35]  C. Genest,et al.  Statistical Inference Procedures for Bivariate Archimedean Copulas , 1993 .

[36]  Z. Ying,et al.  Checking the Cox model with cumulative sums of martingale-based residuals , 1993 .

[37]  N P Jewell,et al.  Statistical models for prevalent cohort data. , 1993, Biometrics.

[38]  Yehuda Vardi,et al.  Multiplicative censoring, renewal processes, deconvolution and decreasing density: Nonparametric estimation , 1989 .

[39]  Mei-Cheng Wang,et al.  A Semiparametric Model for Randomly Truncated Data , 1989 .

[40]  D. Oakes,et al.  Bivariate survival models induced by frailties , 1989 .

[41]  R. Gray,et al.  Methodology for case-control studies with prevalent cases , 1987 .

[42]  Nicholas P. Jewell,et al.  Asymptotic Properties of the Product Limit Estimate Under Random Truncation , 1986 .

[43]  H. Høst,et al.  Age as a prognostic factor in breast cancer , 1986, Cancer.

[44]  H. Adami,et al.  Age as a prognostic factor in breast cancer , 1985, Cancer.

[45]  Y. Vardi Empirical Distributions in Selection Bias Models , 1985 .

[46]  D. Oakes A Model for Association in Bivariate Survival Data , 1982 .

[47]  Y. Vardi,et al.  Nonparametric Estimation in the Presence of Length Bias , 1982 .

[48]  Marvin Zelen,et al.  On the theory of screening for chronic diseases , 1969 .

[49]  E. Stacy A Generalization of the Gamma Distribution , 1962 .

[50]  J. Neyman Statistics—Servant of All Science , 1955 .

[51]  Soumendu Sundar Mukherjee,et al.  Weak convergence and empirical processes , 2019 .

[52]  M. Haugh,et al.  An Introduction to Copulas , 2016 .

[53]  A. Jemal,et al.  Cancer statistics, 2014 , 2014, CA: a cancer journal for clinicians.

[54]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[55]  R. Gill Non- and semi-parametric maximum likelihood estimators and the Von Mises method , 1986 .

[56]  M. Graffar [Modern epidemiology]. , 1971, Bruxelles medical.

[57]  J. Neyman Statistics; servant of all sciences. , 1955, Science.