SEMIPARAMETRIC TRANSFORMATION MODELS WITH MULTILEVEL RANDOM EFFECTS FOR CORRELATED DISEASE ONSET IN FAMILIES.

Large cohort studies are commonly launched to study risk of genetic variants or other risk factors on age at onset (AAO) of a chronic disorder. In these studies, family history data including AAO of disease in family members are collected to provide additional information and can be used to improve efficiency. Statistical analysis of these data is challenging due to missing genotypes in family members and the heterogeneous dependence attributed to both shared genetic back-ground and shared environmental factors (e.g., life style). In this paper, we propose a class of semiparametric transformation models with multilevel random effects to tackle these challenges. The proposed models include both proportional hazards model and proportional odds model as special cases. The multilevel random effects contain individual-specific random effects including kinship correlation structure dependent on the family pedigree, and a shared random effect to account for unobserved environment exposure. We use nonparametric maximum likelihood approach for inference and propose an expectation-maximization algorithm for computation in the presence of missing genotypes among family members. The obtained estimators are shown to be consistent, asymptotically normal, and semiparametrically efficient. Simulation studies demonstrate that the proposed method performs well with finite sample sizes. Finally, the proposed method is applied to study genetic risks in an Alzheimer's disease study.

[1]  T. Beaty,et al.  Fundamentals of Genetic Epidemiology , 1993 .

[2]  Florian Heiss,et al.  Likelihood approximation by numerical integration on sparse grids , 2008 .

[3]  K. Marder,et al.  Familial aggregation of Alzheimer disease among whites, African Americans, and Caribbean Hispanics in northern Manhattan. , 2000, Archives of neurology.

[4]  Alice S Whittemore,et al.  Logistic regression of family data from retrospective study designs , 2003, Genetic epidemiology.

[5]  Malka Gorfine,et al.  CASE-CONTROL SURVIVAL ANALYSIS WITH A GENERAL SEMIPARAMETRIC SHARED FRAILTY MODEL - A PSEUDO FULL LIKELIHOOD APPROACH. , 2009, Annals of statistics.

[6]  Qing Liu,et al.  A note on Gauss—Hermite quadrature , 1994 .

[7]  Alice S. Whittemore,et al.  Logistic regression of family data from case-control studies , 1995 .

[8]  R. Costa,et al.  Incidence of AD in African-Americans, Caribbean Hispanics, and Caucasians in northern Manhattan , 2001, Neurology.

[9]  J. Hopper,et al.  The BOADICEA model of genetic susceptibility to breast and ovarian cancers: updates and extensions , 2008, British Journal of Cancer.

[10]  Yuanjia Wang,et al.  ROBUST MIXED EFFECTS MODEL FOR CLUSTERED FAILURE TIME DATA: APPLICATION TO HUNTINGTON'S DISEASE EVENT MEASURES. , 2017, The annals of applied statistics.

[11]  L. Hsu,et al.  A Frailty‐Model‐Based Approach to Estimating the Age‐Dependent Penetrance Function of Candidate Genes Using Population‐Based Case‐Control Study Designs: An Application to Data on the BRCA1 Gene , 2009, Biometrics.

[12]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[13]  F. E. Satterthwaite An approximate distribution of estimates of variance components. , 1946, Biometrics.

[14]  Satterthwaite Fe An approximate distribution of estimates of variance components. , 1946 .

[15]  Malka Gorfine,et al.  Missing genetic information in case-control family data with general semi-parametric shared frailty model , 2011, Lifetime data analysis.

[16]  B. Tycko,et al.  Apolipoprotein E and alzheimer's disease: Ethnic variation in genotypic risks , 1995, Annals of neurology.

[17]  Z. Ying,et al.  Analysis of transformation models with censored data , 1995 .

[18]  Donglin Zeng,et al.  Maximum likelihood estimation in semiparametric regression models with censored data , 2007, Statistica Sinica.

[19]  D. Berry,et al.  Determining carrier probabilities for breast cancer-susceptibility genes BRCA1 and BRCA2. , 1998, American journal of human genetics.

[20]  Malka Gorfine,et al.  Frailty Models for Familial Risk With Application to Breast Cancer , 2013, Journal of the American Statistical Association.