Regionally Smoothed Meta‐Analysis Methods for GWAS Datasets

Genome‐wide association studies are proven tools for finding disease genes, but it is often necessary to combine many cohorts into a meta‐analysis to detect statistically significant genetic effects. Often the component studies are performed by different investigators on different populations, using different chips with minimal SNPs overlap. In some cases, raw data are not available for imputation so that only the genotyped single nucleotide polymorphisms (SNPs) results can be used in meta‐analysis. Even when SNP sets are comparable, different cohorts may have peak association signals at different SNPs within the same gene due to population differences in linkage disequilibrium or environmental interactions. We hypothesize that the power to detect statistical signals in these situations will improve by using a method that simultaneously meta‐analyzes and smooths the signal over nearby markers. In this study, we propose regionally smoothed meta‐analysis methods and compare their performance on real and simulated data.

[1]  D. Weeks,et al.  Genome-wide Association Scan for Childhood Caries Implicates Novel Genes , 2011, Journal of dental research.

[2]  Andrew G. Clark,et al.  Gene-Based Testing of Interactions in Association Studies of Quantitative Traits , 2013, PLoS genetics.

[3]  Fan Zhang,et al.  IPAD: the Integrated Pathway Analysis Database for Systematic Enrichment Analysis , 2012, BMC Bioinformatics.

[4]  A. Gardner Methods of Statistics , 1941 .

[5]  M H Gail,et al.  Probability that a Two‐Stage Genome‐Wide Association Study will Detect a Disease‐Associated SNP and Implications for Multistage Designs , 2008, Annals of human genetics.

[6]  Xiaodong Wang,et al.  Fast and accurate haplotype frequency estimation for large haplotype vectors from pooled DNA data , 2012, BMC Genetics.

[7]  Terri H Beaty,et al.  A graphical assessment of p-values from sliding window haplotype tests of association to identify asthma susceptibility loci on chromosome 11q , 2006, BMC Genetics.

[8]  GusfieldDan Introduction to the IEEE/ACM Transactions on Computational Biology and Bioinformatics , 2004 .

[9]  Kai Wang,et al.  ATOM: a powerful gene-based association test by combining optimally weighted markers , 2009, Bioinform..

[10]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[11]  Joshua M. Korn,et al.  Association between microdeletion and microduplication at 16p11.2 and autism. , 2008, The New England journal of medicine.

[12]  Johnny S. H. Kwan,et al.  HYST: a hybrid set-based test for genome-wide association studies, with application to protein-protein interaction-based association analysis. , 2012, American journal of human genetics.

[13]  Evan E. Eichler,et al.  A hot spot of genetic instability in autism. , 2008, The New England journal of medicine.

[14]  D. Gudbjartsson,et al.  Common and low-frequency variants associated with genome-wide recombination rate , 2013, Nature Genetics.

[15]  Qiang Yang,et al.  HapBoost: A Fast Approach to Boosting Haplotype Association Analyses in Genome-Wide Association Studies , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  D. Curtis,et al.  A simple method for assessing the strength of evidence for association at the level of the whole gene , 2008, Advances and applications in bioinformatics and chemistry : AABC.

[17]  William Wheeler,et al.  Probability of detecting disease-associated single nucleotide polymorphisms in case-control genome-wide association studies. , 2008, Biostatistics.

[18]  Detecting susceptibility genes for rheumatoid arthritis based on a novel sliding-window approach , 2009, BMC proceedings.

[19]  Eleanor Feingold,et al.  Genetic Analysis of Variation in Human Meiotic Recombination , 2009, PLoS genetics.

[20]  Qiuying Sha,et al.  A Variable‐Sized Sliding‐Window Approach for Genetic Association Studies via Principal Component Analysis , 2009, Annals of human genetics.

[21]  P. Rosenberg,et al.  Pathway analysis by adaptive combination of P‐values , 2009, Genetic epidemiology.

[22]  C I Amos,et al.  Detecting haplotype effects in genomewide association studies , 2007, Genetic epidemiology.

[23]  D. Clayton,et al.  Transmission/disequilibrium tests for extended marker haplotypes. , 1999, American journal of human genetics.

[24]  Jia Li,et al.  An adaptively weighted statistic for detecting differential gene expression when combining multiple transcriptomic studies , 2011, 1108.3180.

[25]  Ying-Chao Lin,et al.  Using Maximal Segmental Score in Genome‐Wide Association Studies , 2012, Genetic epidemiology.

[26]  Aaron J. Lorenz,et al.  Performance of Single Nucleotide Polymorphisms versus Haplotypes for Genome-Wide Association Analysis in Barley , 2010, PloS one.

[27]  Michael Weiner,et al.  Voxelwise gene-wide association study (vGeneWAS): Multivariate gene-based association testing in 731 elderly subjects , 2011, NeuroImage.

[28]  G. Tseng,et al.  Comprehensive literature review and statistical considerations for GWAS meta-analysis , 2012, Nucleic acids research.

[29]  T. Dawber,et al.  Epidemiological approaches to heart disease: the Framingham Study. , 1951, American journal of public health and the nation's health.