Meta-Analysis in Genome-Wide Association Datasets: Strategies and Application in Parkinson Disease

Background Genome-wide association studies hold substantial promise for identifying common genetic variants that regulate susceptibility to complex diseases. However, for the detection of small genetic effects, single studies may be underpowered. Power may be improved by combining genome-wide datasets with meta-analytic techniques. Methodology/Principal Findings Both single and two-stage genome-wide data may be combined and there are several possible strategies. In the two-stage framework, we considered the options of (1) enhancement of replication data and (2) enhancement of first-stage data, and then, we also considered (3) joint meta-analyses including all first-stage and second-stage data. These strategies were examined empirically using data from two genome-wide association studies (three datasets) on Parkinson disease. In the three strategies, we derived 12, 5, and 49 single nucleotide polymorphisms that show significant associations at conventional levels of statistical significance. None of these remained significant after conservative adjustment for the number of performed analyses in each strategy. However, some may warrant further consideration: 6 SNPs were identified with at least 2 of the 3 strategies and 3 SNPs [rs1000291 on chromosome 3, rs2241743 on chromosome 4 and rs3018626 on chromosome 11] were identified with all 3 strategies and had no or minimal between-dataset heterogeneity (I2 = 0, 0 and 15%, respectively). Analyses were primarily limited by the suboptimal overlap of tested polymorphisms across different datasets (e.g., only 31,192 shared polymorphisms between the two tier 1 datasets). Conclusions/Significance Meta-analysis may be used to improve the power and examine the between-dataset heterogeneity of genome-wide association studies. Prospective designs may be most efficient, if they try to maximize the overlap of genotyping platforms and anticipate the combination of data across many genome-wide association studies.

[1]  T C Chalmers,et al.  Cumulative meta-analysis of therapeutic trials for myocardial infarction. , 1992, The New England journal of medicine.

[2]  J. Fleiss Review papers : The statistical basis of meta-analysis , 1993 .

[3]  Diana B. Petitti,et al.  Meta-Analysis, Decision Analysis, and Cost-Effectiveness Analysis: Methods for Quantitative Synthesis in Medicine , 1994 .

[4]  J. Ioannidis,et al.  Quantitative Synthesis in Systematic Reviews , 1997, Annals of Internal Medicine.

[5]  F. J. Livesey,et al.  Netrin and Netrin Receptor Expression in the Embryonic Mammalian Nervous System Suggests Roles in Retinal, Striatal, Nigral, and Cerebellar Development , 1997, Molecular and Cellular Neuroscience.

[6]  S. Ackerman,et al.  Cloning and mapping of the UNC5C gene to human chromosome 4q21-q23. , 1998, Genomics.

[7]  Christopher H Schmid,et al.  Summing up evidence: one answer is not always enough , 1998, The Lancet.

[8]  J. Ioannidis,et al.  Replication validity of genetic association studies , 2001, Nature Genetics.

[9]  L. Hedges,et al.  The power of statistical tests in meta-analysis. , 2001, Psychological methods.

[10]  Thomas A Trikalinos,et al.  Genetic associations in large versus small studies: an empirical assessment , 2003, The Lancet.

[11]  John P A Ioannidis,et al.  Genetic associations: false or true? , 2003, Trends in molecular medicine.

[12]  E. Lander,et al.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease , 2003, Nature Genetics.

[13]  B. Becker,et al.  How meta-analysis increases statistical power. , 2003, Psychological methods.

[14]  D. Altman,et al.  Measuring inconsistency in meta-analyses , 2003, BMJ : British Medical Journal.

[15]  Nathaniel Rothman,et al.  Assessing the Probability That a Positive Report is False: An Approach for Molecular Epidemiology Studies , 2004 .

[16]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[17]  Stephen P. Daiger,et al.  Was the Human Genome Project Worth the Effort? , 2005, Science.

[18]  D. Clayton,et al.  Genome-wide association studies: theoretical and practical concerns , 2005, Nature Reviews Genetics.

[19]  Mariza de Andrade,et al.  High-resolution whole-genome association study of Parkinson disease. , 2005, American journal of human genetics.

[20]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[21]  Paolo Vineis,et al.  A network of investigator networks in human genome epidemiology. , 2005, American journal of epidemiology.

[22]  J. Ioannidis,et al.  Relative Citation Impact of Various Study Designs in the Health Sciences , 2005, JAMA.

[23]  Sonja W. Scholz,et al.  Genome-wide genotyping in Parkinson's disease and neurologically normal controls: first stage analysis and public release of data , 2006, The Lancet Neurology.

[24]  Lon R Cardon,et al.  Evaluating coverage of genome-wide association studies , 2006, Nature Genetics.

[25]  Thomas A Trikalinos,et al.  Implications of small effect sizes of individual genetic variants on the design and interpretation of genetic association studies of complex diseases. , 2006, American journal of epidemiology.

[26]  John S Witte,et al.  Opinion: A gene-centric approach to genome-wide association studies , 2006, Nature Reviews Genetics.

[27]  Ling Lin,et al.  Axonal Growth Regulation of Fetal and Embryonic Stem Cell‐Derived Dopaminergic Neurons by Netrin‐1 and Slits , 2006, Stem cells.

[28]  L. Cardon Delivering New Disease Genes , 2006, Science.

[29]  D. Clayton,et al.  A genome-wide association study of nonsynonymous SNPs identifies a type 1 diabetes locus in the interferon-induced helicase (IFIH1) region , 2006, Nature Genetics.

[30]  R. Myers,et al.  Lack of replication of thirteen single-nucleotide polymorphisms implicated in Parkinson's disease: a large-scale international study , 2006, The Lancet Neurology.

[31]  J. Todd Statistical false positive or true disease pathway? , 2006, Nature Genetics.

[32]  G. Abecasis,et al.  Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies , 2006, Nature Genetics.

[33]  F. Hu,et al.  A Common Genetic Variant Is Associated with Adult and Childhood Obesity , 2006, Science.