General retrospective mega‐analysis framework for rare variant association tests

Here, we describe a retrospective mega‐analysis framework for gene‐ or region‐based multimarker rare variant association tests. Our proposed mega‐analysis association tests allow investigators to combine longitudinal and cross‐sectional family‐ and/or population‐based studies. This framework can be applied to a continuous, categorical, or survival trait. In addition to autosomal variants, the tests can be applied to conduct mega‐analyses on X‐chromosome variants. Tests were built on study‐specific region‐ or gene‐level quasiscore statistics and, therefore, do not require estimates of effects of individual rare variants. We used the generalized estimating equation approach to account for complex multiple correlation structures between family members, repeated measurements, and genetic markers. While accounting for multilevel correlations and heterogeneity across studies, the test statistics were computationally efficient and feasible for large‐scale sequencing studies. The retrospective aspect of association tests helps alleviate bias due to phenotype‐related sampling and type I errors due to misspecification of phenotypic distribution. We evaluated our developed mega‐analysis methods through comprehensive simulations with varying sample sizes, covariates, population stratification structures, and study designs across multiple studies. To illustrate application of the proposed framework, we conducted a mega‐association analysis combining a longitudinal family study and a cross‐sectional case–control study from Genetic Analysis Workshop 19.

[1]  Dan-Yu Lin,et al.  Meta-analysis of gene-level associations for rare variants based on single-variant statistics. , 2013, American journal of human genetics.

[2]  You‐Gan Wang,et al.  Effects of Variance‐Function Misspecification in Analysis of Longitudinal Data , 2005, Biometrics.

[3]  Lifeng Lin,et al.  Alternative measures of between‐study heterogeneity in meta‐analysis: Reducing the impact of outlying studies , 2017, Biometrics.

[4]  V. A. Villar,et al.  G Protein–Coupled Receptor Kinases: Crucial Regulators of Blood Pressure , 2016, Journal of the American Heart Association.

[5]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[6]  Dan-Yu Lin,et al.  MASS: meta-analysis of score statistics for sequencing studies , 2013, Bioinform..

[7]  Xihong Lin,et al.  Optimal tests for rare variant effects in sequencing association studies. , 2012, Biostatistics.

[8]  Ana-Maria Bosonea,et al.  Molecular Signals Elicited by GPCR Agonists in Hypertension, Cardiovascular Remodeling: Are MMPs and ADAMs Elusive Therapeutic Targets? , 2012, Current hypertension reviews.

[9]  Dajiang J. Liu,et al.  Meta-Analysis of Gene Level Tests for Rare Variant Association , 2013, Nature Genetics.

[10]  M. Rieder,et al.  Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. , 2012, American journal of human genetics.

[11]  Robert J. Lefkowitz,et al.  Seven-transmembrane-spanning receptors and heart function , 2002, Nature.

[12]  T. Beaty,et al.  Incorporation of covariates into multipoint linkage disequilibrium mapping in case‐control studies , 2008, Genetic epidemiology.

[13]  Shuang Feng,et al.  RAREMETAL: fast and powerful meta-analysis for rare variants , 2014, Bioinform..

[14]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[15]  W. Bodmer,et al.  Common and rare variants in multifactorial susceptibility to common diseases , 2008, Nature Genetics.

[16]  V. Gurevich,et al.  G Protein-Coupled Receptor Kinases , 2016, Methods in Pharmacology and Toxicology.

[17]  Daniel J Schaid,et al.  Multiple Genetic Variant Association Testing by Collapsing and Kernel Methods With Pedigree or Population Structured Data , 2013, Genetic epidemiology.

[18]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[19]  D. Bowden,et al.  Region‐based association tests for sequencing data on survival traits , 2017, Genetic epidemiology.

[20]  P. Diggle Analysis of Longitudinal Data , 1995 .

[21]  Thomas Lumley,et al.  META-ANALYSIS OF A RARE-VARIANT ASSOCIATION TEST. , 2011 .

[22]  J. Higgins,et al.  Cochrane Handbook for Systematic Reviews of Interventions , 2010, International Coaching Psychology Review.

[23]  Leukocyte TRP channel gene expressions in patients with non-valvular atrial fibrillation , 2017, Scientific Reports.

[24]  J. Pankow,et al.  Analysis of family- and population-based samples in cohort genome-wide association studies , 2011, Human Genetics.

[25]  Seunggeun Lee,et al.  General framework for meta-analysis of rare variants in sequencing association studies. , 2013, American journal of human genetics.

[26]  Dan-Yu Lin,et al.  A general framework for detecting disease associations with rare variants in sequencing studies. , 2011, American journal of human genetics.

[27]  David C. Glahn,et al.  Omics-squared: human genomic, transcriptomic and phenotypic data for genetic analysis workshop 19 , 2016, BMC Proceedings.

[28]  Dan-Yu Lin,et al.  Meta-analysis for Discovering Rare-Variant Associations: Statistical Methods and Software Programs. , 2015, American journal of human genetics.

[29]  Debashis Ghosh,et al.  Meta-analysis based on weighted ordered P-values for genomic data with heterogeneity , 2014, BMC Bioinformatics.

[30]  Thomas Lumley,et al.  Sequence Kernel Association Test for Survival Traits , 2014, Genetic epidemiology.

[31]  Fang-Chi Hsu,et al.  Generalization of Rare Variant Association Tests for Longitudinal Family Studies , 2016, Genetic epidemiology.

[32]  Claire Infante-Rivard,et al.  Combining case-control and case-trio data from the same population in genetic association analyses: overview of approaches and illustration with a candidate gene study. , 2009, American journal of epidemiology.

[33]  Mary Sara McPeek,et al.  Retrospective Binary-Trait Association Test Elucidates Genetic Architecture of Crohn Disease. , 2016, American journal of human genetics.

[34]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[35]  A. Karr,et al.  Combining cohorts in longitudinal surveys , 2013 .

[36]  D. Thomas,et al.  Bias and efficiency in family-based gene-characterization studies: conditional, prospective, retrospective, and joint likelihoods. , 2000, American journal of human genetics.

[37]  Ellen M. Wijsman,et al.  Genetic Analysis Workshop 19: methods and strategies for analyzing human sequence and gene expression data in extended families and unrelated individuals , 2016, BMC Proceedings.

[38]  S. Bull,et al.  Strategies for genetic association analyses combining unrelated case-control individuals and family trios. , 2012, American journal of epidemiology.

[39]  A. Eckhart,et al.  Regulation of GPCR signaling in hypertension. , 2010, Biochimica et Biophysica Acta.

[40]  Ellen M Wijsman,et al.  Combining Family‐ and Population‐Based Imputation Data for Association Analysis of Rare and Common Variants in Large Pedigrees , 2014, Genetic epidemiology.

[41]  D. Kuonen Saddlepoint approximations for distributions of quadratic forms in normal variables , 1999 .

[42]  David M. Harris,et al.  GPCR signalling in hypertension: role of GRKs. , 2008, Clinical science.

[43]  Dan-Yu Lin,et al.  Meta‐Analysis of Sequencing Studies With Heterogeneous Genetic Associations , 2014, Genetic epidemiology.

[44]  A. Morris,et al.  Transethnic Meta-Analysis of Genomewide Association Studies , 2011, Genetic epidemiology.

[45]  S. Gabriel,et al.  Calibrating a coalescent simulation of human genome sequence variation. , 2005, Genome research.

[46]  Michael J Bamshad,et al.  Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project. , 2016, American journal of human genetics.

[47]  J. Meigs,et al.  Sequence Kernel Association Test for Quantitative Traits in Family Samples , 2013, Genetic epidemiology.