Imputation-Based Meta-Analysis of Severe Malaria in Three African Populations

Combining data from genome-wide association studies (GWAS) conducted at different locations, using genotype imputation and fixed-effects meta-analysis, has been a powerful approach for dissecting complex disease genetics in populations of European ancestry. Here we investigate the feasibility of applying the same approach in Africa, where genetic diversity, both within and between populations, is far more extensive. We analyse genome-wide data from approximately 5,000 individuals with severe malaria and 7,000 population controls from three different locations in Africa. Our results show that the standard approach is well powered to detect known malaria susceptibility loci when sample sizes are large, and that modern methods for association analysis can control the potential confounding effects of population structure. We show that pattern of association around the haemoglobin S allele differs substantially across populations due to differences in haplotype structure. Motivated by these observations we consider new approaches to association analysis that might prove valuable for multicentre GWAS in Africa: we relax the assumptions of SNP–based fixed effect analysis; we apply Bayesian approaches to allow for heterogeneity in the effect of an allele on risk across studies; and we introduce a region-based test to allow for heterogeneity in the location of causal alleles.

[1]  J. Marchini,et al.  Genotype Imputation with Thousands of Genomes , 2011, G3: Genes | Genomes | Genetics.

[2]  Severe falciparum malaria. World Health Organization, Communicable Diseases Cluster. , 2000, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[3]  Christian Gieger,et al.  Multiple Loci Are Associated with White Blood Cell Phenotypes , 2011, PLoS genetics.

[4]  Manuel A. R. Ferreira,et al.  Practical aspects of imputation-driven meta-analysis of genome-wide association studies. , 2008, Human molecular genetics.

[5]  Ogobara K. Doumbo,et al.  Ethical Data Release in Genome-Wide Association Studies in Developing Countries , 2009, PLoS medicine.

[6]  Wen-Harn Pan,et al.  A Genome-Wide Association Study Reveals a Quantitative Trait Locus of Adiponectin on CDH13 That Predicts Cardiometabolic Outcomes , 2011, Diabetes.

[7]  M. Brown,et al.  Promise and pitfalls of the Immunochip , 2011, Arthritis research & therapy.

[8]  Christian Gieger,et al.  New gene functions in megakaryopoiesis and platelet formation , 2011, Nature.

[9]  M. Daly,et al.  Identifying Relationships among Genomic Disease Regions: Predicting Genes at Pathogenic SNP Associations and Rare Deletions , 2009, PLoS genetics.

[10]  S. Satpathy,et al.  Severe falciparum malaria , 2004, Indian journal of pediatrics.

[11]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[12]  Peter Donnelly,et al.  Quantifying the Underestimation of Relative Risks from Genome-Wide Association Studies , 2011, PLoS genetics.

[13]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[14]  Andre Franke,et al.  Genome-wide association study indicates two novel resistance loci for severe malaria , 2012, Nature.

[15]  Jon Wakefield,et al.  A Bayesian measure of the probability of false discovery in genetic epidemiology studies. , 2007, American journal of human genetics.

[16]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[17]  Christian Gieger,et al.  A genome-wide meta-analysis identifies 22 loci associated with eight hematological parameters in the HaemGen consortium , 2009, Nature Genetics.

[18]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[19]  Michael Parker,et al.  Ethical issues in human genomics research in developing countries , 2011, BMC medical ethics.

[20]  X. Dai,et al.  Negative Regulation of Lymphocyte Activation by the Adaptor Protein LAX 1 , 2005, The Journal of Immunology.

[21]  Luke Jostins,et al.  Imputation of low-frequency variants using the HapMap3 benefits from large, diverse reference sets , 2011, European Journal of Human Genetics.

[22]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[23]  D. Kwiatkowski,et al.  Methodological challenges of genome-wide association analysis in Africa , 2010, Nature Reviews Genetics.

[24]  Eleazar Eskin,et al.  Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. , 2011, American journal of human genetics.

[25]  P. Deloukas,et al.  Multiple common variants for celiac disease influencing immune gene expression , 2010, Nature Genetics.

[26]  Peter Donnelly,et al.  Genome-wide and fine-resolution association analysis of malaria in West Africa , 2009, Nature Genetics.

[27]  Gonçalo R. Abecasis,et al.  Fine Mapping of Five Loci Associated with Low-Density Lipoprotein Cholesterol Detects Variants That Double the Explained Heritability , 2011, PLoS genetics.

[28]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[29]  G. McVean A Genealogical Interpretation of Principal Components Analysis , 2009, PLoS genetics.

[30]  Matti Pirinen,et al.  Efficient computation with a linear mixed model on large-scale data sets with applications to genetic studies , 2012, 1207.4886.

[31]  Simon C. Potter,et al.  Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis , 2011, Nature.

[32]  Gonçalo Abecasis,et al.  Genotype-imputation accuracy across worldwide human populations. , 2009, American journal of human genetics.

[33]  Peter Donnelly,et al.  HAPGEN2: simulation of multiple disease SNPs , 2011, Bioinform..

[34]  Christian Gieger,et al.  Multiple loci influence erythrocyte phenotypes in the CHARGE Consortium , 2009, Nature Genetics.

[35]  Nancy Fullman,et al.  Global malaria mortality between 1980 and 2010: a systematic analysis , 2012, The Lancet.

[36]  Alkes L. Price,et al.  New approaches to population stratification in genome-wide association studies , 2010, Nature Reviews Genetics.

[37]  Organización Mundial de la Salud Guidelines for the treatment of malaria , 2010 .

[38]  P. Donnelly,et al.  Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip , 2009, PLoS genetics.

[39]  M. Pirinen,et al.  Genome-wide association study identifies a variant in HDAC9 associated with large vessel ischemic stroke , 2012, Nature Genetics.

[40]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[41]  Dianne J Terlouw,et al.  Protective effects of the sickle cell gene against malaria morbidity and mortality , 2002, The Lancet.

[42]  William J. Astle,et al.  Population Structure and Cryptic Relatedness in Genetic Association Studies , 2009, 1010.4681.

[43]  Y. Teo,et al.  A statistical method for region-based meta-analysis of genome-wide association studies in genetically diverse populations , 2011, European Journal of Human Genetics.

[44]  D. Kwiatkowski,et al.  Valid Consent for Genomic Epidemiology in Developing Countries , 2007, PLoS medicine.

[45]  Paul Weston,et al.  Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility , 2011, Nature Genetics.

[46]  Karen L. Mohlke,et al.  Novel Loci for Adiponectin Levels and Their Influence on Type 2 Diabetes and Metabolic Traits: A Multi-Ethnic Meta-Analysis of 45,891 Individuals , 2012, PLoS genetics.

[47]  D. Labie,et al.  Evidence for the multicentric origin of the sickle cell hemoglobin gene in Africa. , 1984, Proceedings of the National Academy of Sciences of the United States of America.