Regression analysis for secondary response variable in a case‐cohort study

Case-cohort study design has been widely used for its cost-effectiveness. In any real study, there are always other important outcomes of interest beside the failure time that the original case-cohort study is based on. How to utilize the available case-cohort data to study the relationship of a secondary outcome with the primary exposure obtained through the case-cohort study is not well studied. In this article, we propose a non-parametric estimated likelihood approach for analyzing a secondary outcome in a case-cohort study. The estimation is based on maximizing a semiparametric likelihood function that is built jointly on both time-to-failure outcome and the secondary outcome. The proposed estimator is shown to be consistent, efficient, and asymptotically normal. Finite sample performance is evaluated via simulation studies. Data from the Sister Study is analyzed to illustrate our method.

[1]  P. Kraft,et al.  Genome‐wide association scans for secondary traits using case‐control samples , 2009, Genetic epidemiology.

[2]  Haibo Zhou,et al.  A Semiparametric Empirical Likelihood Method for Biased Sampling Schemes with Auxiliary Covariates , 2006, Biometrics.

[3]  Dale P Sandler,et al.  Association between Urinary Prostaglandin E2 Metabolite and Breast Cancer Risk: A Prospective, Case–Cohort Study of Postmenopausal Women , 2013, Cancer Prevention Research.

[4]  A. Scott,et al.  Re-using data from case-control studies. , 1997, Statistics in medicine.

[5]  Clarice R. Weinberg,et al.  Telomere length in peripheral blood and breast cancer risk in a prospective case-cohort analysis: results from the Sister Study , 2011, Cancer Causes & Control.

[6]  Juha Karvanen,et al.  Secondary Analysis under Cohort Sampling Designs Using Conditional Likelihood , 2012 .

[7]  Edi Brogi,et al.  Inflammation and Increased Aromatase Expression Occur in the Breast Tissue of Obese Women with Breast Cancer , 2011, Cancer Prevention Research.

[8]  R. L. Prentice,et al.  A case-cohort design for epidemiologic cohort studies and disease prevention trials , 1986 .

[9]  Bryan Langholz,et al.  Exposure Stratified Case-Cohort Designs , 2000, Lifetime data analysis.

[10]  D. Zeng,et al.  Proper analysis of secondary phenotype data in case‐control association studies , 2009, Genetic epidemiology.

[11]  Kani Chen,et al.  Case-cohort and case-control analysis with Cox's model , 1999 .

[12]  Raymond J Carroll,et al.  Robust estimation for homoscedastic regression in the secondary analysis of case–control data , 2013, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[13]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[14]  J Cai,et al.  More efficient estimators for case-cohort studies. , 2013, Biometrika.

[15]  Olli Saarela,et al.  Nested case–control data utilized for multiple outcomes: a likelihood approach and alternatives , 2008, Statistics in medicine.

[16]  J. Cai,et al.  Marginal hazards model for case-cohort studies with multiple disease outcomes. , 2009, Biometrika.

[17]  Haibo Zhou,et al.  Design and Inference for Cancer Biomarker Study with an Outcome and Auxiliary‐Dependent Subsampling , 2010, Biometrics.

[18]  Jeffrey S. Racine,et al.  Nonparametric Estimation of Conditional CDF and Quantile Functions With Mixed Categorical and Continuous Data , 2008 .

[19]  Priya Bhardwaj,et al.  Obesity Is Associated with Inflammation and Elevated Aromatase Expression in the Mouse Mammary Gland , 2011, Cancer Prevention Research.

[20]  Yannan Jiang,et al.  Secondary analysis of case‐control data , 2006, Statistics in medicine.

[21]  Raymond J Carroll,et al.  Semiparametric estimation in the secondary analysis of case–control studies , 2016, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[22]  J. Klenk,et al.  Analyses of Case–Control Data for Additional Outcomes , 2007, Epidemiology.

[23]  Steven G. Self,et al.  Asymptotic Distribution Theory and Efficiency Results for Case-Cohort Studies , 1988 .

[24]  Agus Salim,et al.  A maximum likelihood method for secondary analysis of nested case‐control data , 2014, Statistics in medicine.

[25]  Michal Kulich,et al.  Improving the Efficiency of Relative-Risk Estimation in Case-Cohort Studies , 2004 .

[26]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .