Gene Level Meta-Analysis of Quantitative Traits by Functional Linear Models

Meta-analysis of genetic data must account for differences among studies including study designs, markers genotyped, and covariates. The effects of genetic variants may differ from population to population, i.e., heterogeneity. Thus, meta-analysis of combining data of multiple studies is difficult. Novel statistical methods for meta-analysis are needed. In this article, functional linear models are developed for meta-analyses that connect genetic data to quantitative traits, adjusting for covariates. The models can be used to analyze rare variants, common variants, or a combination of the two. Both likelihood-ratio test (LRT) and F-distributed statistics are introduced to test association between quantitative traits and multiple variants in one genetic region. Extensive simulations are performed to evaluate empirical type I error rates and power performance of the proposed tests. The proposed LRT and F-distributed statistics control the type I error very well and have higher power than the existing methods of the meta-analysis sequence kernel association test (MetaSKAT). We analyze four blood lipid levels in data from a meta-analysis of eight European studies. The proposed methods detect more significant associations than MetaSKAT and the P-values of the proposed LRT and F-distributed statistics are usually much smaller than those of MetaSKAT. The functional linear models and related test statistics can be useful in whole-genome and whole-exome association studies.

[1]  Manuel A. R. Ferreira,et al.  Practical aspects of imputation-driven meta-analysis of genome-wide association studies. , 2008, Human molecular genetics.

[2]  M. Rieder,et al.  Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. , 2012, American journal of human genetics.

[3]  Marie Frei,et al.  Functional Data Analysis With R And Matlab , 2016 .

[4]  J. Ioannidis,et al.  Meta-analysis methods for genome-wide association studies and beyond , 2013, Nature Reviews Genetics.

[5]  Sheldon M. Ross,et al.  Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.

[6]  Dan-Yu Lin,et al.  Meta-analysis of gene-level associations for rare variants based on single-variant statistics. , 2013, American journal of human genetics.

[7]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[8]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[9]  Ruzong Fan,et al.  High-Resolution Association Mapping of Quantitative Trait Loci: A Population-Based Approach , 2006, Genetics.

[10]  E. Mardis Next-generation DNA sequencing methods. , 2008, Annual review of genomics and human genetics.

[11]  L. Penrose,et al.  THE CORRELATION BETWEEN RELATIVES ON THE SUPPOSITION OF MENDELIAN INHERITANCE , 2022 .

[12]  Piotr Kokoszka,et al.  Inference for Functional Data with Applications , 2012 .

[13]  S. Gabriel,et al.  Calibrating a coalescent simulation of human genome sequence variation. , 2005, Genome research.

[14]  Momiao Xiong,et al.  Quantitative trait locus analysis for next-generation sequencing with the functional linear models , 2012, Journal of Medical Genetics.

[15]  W. Ansorge Next-generation DNA sequencing techniques. , 2009, New biotechnology.

[16]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[17]  E. Zeggini,et al.  An Evaluation of Statistical Approaches to Rare Variant Analysis in Genetic Association Studies , 2009, Genetic epidemiology.

[18]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[19]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[20]  F. Ferraty,et al.  The Oxford Handbook of Functional Data Analysis , 2011, Oxford Handbooks Online.

[21]  Evangelos Evangelou,et al.  Heterogeneity in Meta-Analyses of Genome-Wide Association Investigations , 2007, PloS one.

[22]  John P A Ioannidis,et al.  Meta-analysis in genome-wide association studies. , 2009, Pharmacogenomics.

[23]  Alexander F. Wilson,et al.  Generalized Functional Linear Models for Gene‐Based Case‐Control Association Studies , 2014, Genetic epidemiology.

[24]  High resolution mapping of quantitative trait loci by linkage disequilibrium analysis , 2002, European Journal of Human Genetics.

[25]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[26]  Kathryn Roeder,et al.  Testing for an Unusual Distribution of Rare Variants , 2011, PLoS genetics.

[27]  K. Lange,et al.  Prioritizing GWAS results: A review of statistical methods and recommendations for their application. , 2010, American journal of human genetics.

[28]  Momiao Xiong,et al.  Pleiotropy Analysis of Quantitative Traits at Gene Level by Multivariate Functional Linear Models , 2015, Genetic epidemiology.

[29]  Dan-Yu Lin,et al.  Meta‐Analysis of Sequencing Studies With Heterogeneous Genetic Associations , 2014, Genetic epidemiology.

[30]  D. Clayton,et al.  A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data: application to HLA in type 1 diabetes. , 2002, American journal of human genetics.

[31]  S. Weisberg Applied Linear Regression: Weisberg/Applied Linear Regression 3e , 2005 .

[32]  Momiao Xiong,et al.  Functional Linear Models for Association Analysis of Quantitative Traits , 2013, Genetic epidemiology.

[33]  Jing Cui,et al.  Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci , 2010, Nature Genetics.

[34]  Seunggeun Lee,et al.  General framework for meta-analysis of rare variants in sequencing association studies. , 2013, American journal of human genetics.

[35]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[36]  M. McCarthy,et al.  Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes , 2008, Nature Genetics.

[37]  Shamil R Sunyaev,et al.  Pooled association tests for rare variants in exon-resequencing studies. , 2010, American journal of human genetics.

[38]  Dajiang J. Liu,et al.  Meta-Analysis of Gene Level Tests for Rare Variant Association , 2013, Nature Genetics.

[39]  Thomas Lumley,et al.  Sequence Kernel Association Test for Survival Traits , 2014, Genetic epidemiology.