Identity-by-descent analyses for measuring population dynamics and selection in recombining pathogens

Identification of genomic regions that are identical by descent (IBD) has proven useful for human genetic studies where analyses have led to the discovery of familial relatedness and fine-mapping of disease critical regions. Unfortunately however, IBD analyses have been underutilized in analysis of other organisms, including human pathogens. This is in part due to the lack of statistical methodologies for non-diploid genomes in addition to the added complexity of multiclonal infections. As such, we have developed an IBD methodology, called isoRelate, for analysis of haploid recombining microorganisms in the presence of multiclonal infections. Using the inferred IBD status at genomic locations, we have also developed a novel statistic for identifying loci under positive selection and propose relatedness networks as a means of exploring shared haplotypes within populations. We evaluate the performance of our methodologies for detecting IBD and selection, including comparisons with existing tools, then perform an exploratory analysis of whole genome sequencing data from a global Plasmodium falciparum dataset of more than 2500 genomes. This analysis identifies Southeast Asia as having many highly related isolates, possibly as a result of both reduced transmission from intensified control efforts and population bottlenecks following the emergence of antimalarial drug resistance. Many signals of selection are also identified, most of which overlap genes that are known to be associated with drug resistance, in addition to two novel signals observed in multiple countries that have yet to be explored in detail. Additionally, we investigate relatedness networks over the selected loci and determine that one of these sweeps has spread between continents while the other has arisen independently in different countries. IBD analysis of microorganisms using isoRelate can be used for exploring population structure, positive selection and haplotype distributions, and will be a valuable tool for monitoring disease control and elimination efforts of many diseases.

[1]  Brian L Browning,et al.  Identity by descent between distant relatives: detection and applications. , 2012, Annual review of genetics.

[2]  Jonathan Crabtree,et al.  Comparative genomics of the neglected human malaria parasite Plasmodium vivax , 2008, Nature.

[3]  A. Hughes,et al.  Very large long–term effective population size in the virulent human malaria parasite Plasmodium falciparum , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[4]  S. Schaffner,et al.  hmmIBD: software to infer pairwise identity by descent between haploid genotypes , 2017, bioRxiv.

[5]  B. Genton,et al.  A molecular marker of artemisinin-resistant Plasmodium falciparum malaria , 2013, Nature.

[6]  Richard J Maude,et al.  Spatial and temporal epidemiology of clinical malaria in Cambodia 2004–2013 , 2014, Malaria Journal.

[7]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[8]  S. Schaffner,et al.  Harnessing genomics and genome biology to understand malaria biology , 2012, Nature Reviews Genetics.

[9]  Gil McVean,et al.  Indels, structural variation, and recombination drive genomic diversity in Plasmodium falciparum , 2016, Genome research.

[10]  R. Nielsen,et al.  Inferring Demographic History from a Spectrum of Shared Haplotype Lengths , 2013, PLoS genetics.

[11]  Daniel L. K. Yamins,et al.  Identification and Functional Validation of the Novel Antimalarial Resistance Locus PF10_0355 in Plasmodium falciparum , 2011, PLoS genetics.

[12]  S. Schaffner,et al.  Modeling malaria genomics reveals transmission decline and rebound in Senegal , 2015, Proceedings of the National Academy of Sciences.

[13]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[14]  Leping Li,et al.  ART: a next-generation sequencing read simulator , 2012, Bioinform..

[15]  Geoffrey L. Johnston,et al.  Mitotic Evolution of Plasmodium falciparum Shows a Stable Core Genome but Recombination in Antigen Families , 2013, PLoS genetics.

[16]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[17]  Nicholas P. J. Day,et al.  Genomic epidemiology of artemisinin resistant malaria. , 2016, eLife.

[18]  Anders Albrechtsen,et al.  Natural Selection and the Distribution of Identity-by-Descent in the Human Genome , 2010, Genetics.

[19]  D. Kwiatkowski,et al.  Characterizing the impact of sustained sulfadoxine/pyrimethamine use upon the Plasmodium falciparum population in Malawi , 2016, Malaria Journal.

[20]  S. Schaffner,et al.  SNP Genotyping Identifies New Signatures of Selection in a Deep Sample of West African Plasmodium falciparum Malaria Parasites , 2012, Molecular biology and evolution.

[21]  Victoria C. Corey,et al.  Mapping the malaria parasite druggable genome by using in vitro evolution and chemogenomics , 2017, Science.

[22]  L. Almasy,et al.  Multipoint quantitative-trait linkage analysis in general pedigrees. , 1998, American journal of human genetics.

[23]  G. McVean,et al.  Recombination Hotspots and Population Structure in Plasmodium falciparum , 2005, PLoS biology.

[24]  E. Holmes,et al.  Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[25]  P. Newton,et al.  Adaptive Copy Number Evolution in Malaria Parasites , 2008, PLoS genetics.

[26]  E. Thompson Identity by Descent: Variation in Meiosis, Across Genomes, and in Populations , 2013, Genetics.

[27]  Mark Abney,et al.  Using identity by descent estimation with dense genotype data to detect positive selection , 2012, European Journal of Human Genetics.

[28]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[29]  John C. Tan,et al.  Independent emergence of artemisinin resistance mutations among Plasmodium falciparum in Southeast Asia. , 2015, The Journal of infectious diseases.

[30]  Pardis C Sabeti,et al.  Sequence-based association and selection scans identify drug resistance loci in the Plasmodium falciparum malaria parasite , 2012, Proceedings of the National Academy of Sciences.

[31]  D Payne,et al.  Spread of chloroquine resistance in Plasmodium falciparum. , 1987, Parasitology today.

[32]  Yingyao Zhou,et al.  A Systematic Map of Genetic Variation in Plasmodium falciparum , 2006 .

[33]  G. N. Hannan,et al.  Estimating genotyping error rates from Mendelian errors in SNP array genotypes and their impact on inference. , 2007, Genomics.

[34]  J. T. Williams,et al.  Microsatellite markers reveal a spectrum of population structures in the malaria parasite Plasmodium falciparum. , 2000, Molecular biology and evolution.

[35]  Gil McVean,et al.  Deconvolution of multiple infections in Plasmodium falciparum from high throughput sequencing data , 2017, bioRxiv.

[36]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[37]  Anders Albrechtsen,et al.  Relatedness mapping and tracts of relatedness for genome‐wide data in the presence of linkage disequilibrium , 2009, Genetic epidemiology.

[38]  Allison D. Griggs,et al.  Genetic relatedness analysis reveals the cotransmission of genetically related Plasmodium falciparum parasites in Thiès, Senegal , 2017, Genome Medicine.

[39]  Taane G. Clark,et al.  Detecting and characterizing genomic signatures of positive selection in global populations. , 2013, American journal of human genetics.

[40]  Adele M. Lehane,et al.  Globally prevalent PfMDR1 mutations modulate Plasmodium falciparum susceptibility to artemisinin-based combination therapies , 2016, Nature Communications.

[41]  A. Cowman,et al.  Modulation of PF10_0355 (MSPDBL2) Alters Plasmodium falciparum Response to Antimalarial Drugs , 2013, Antimicrobial Agents and Chemotherapy.

[42]  David Wakeham,et al.  XIBD: software for inferring pairwise identity by descent on the X chromosome , 2016, Bioinform..

[43]  John C. Wootton,et al.  Genetic diversity and chloroquine selective sweeps in Plasmodium falciparum , 2002, Nature.

[44]  Pardis C Sabeti,et al.  Detecting recent positive selection in the human genome from haplotype structure , 2002, Nature.

[45]  Pardis C Sabeti,et al.  Genome-wide detection and characterization of positive selection in human populations , 2007, Nature.

[46]  Gilean McVean,et al.  Multiple populations of artemisinin-resistant Plasmodium falciparum in Cambodia , 2013, Nature Genetics.

[47]  T. Taylor,et al.  Return of chloroquine antimalarial efficacy in Malawi. , 2006, The New England journal of medicine.

[48]  Philipp W. Messer,et al.  SLiM: Simulating Evolution with Selection and Linkage , 2013, Genetics.

[49]  John C. Tan,et al.  Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing , 2012, Nature.

[50]  D. Falush,et al.  Inference of Population Structure using Dense Haplotype Data , 2012, PLoS genetics.

[51]  D. Conway,et al.  Exceptionally long-range haplotypes in Plasmodium falciparum chromosome 6 maintained in an endemic African population , 2016, Malaria Journal.

[52]  Philip Montgomery,et al.  Genome-wide SNP genotyping highlights the role of natural selection in Plasmodium falciparum population divergence , 2008, Genome Biology.

[53]  Chaolong Wang,et al.  Inference of unexpected genetic relatedness among individuals in HapMap Phase III. , 2010, American journal of human genetics.

[54]  P. Roepe,et al.  Evolution of a unique Plasmodium falciparum chloroquine-resistance phenotype in association with pfcrt polymorphism in Papua New Guinea and South America , 2001, Proceedings of the National Academy of Sciences of the United States of America.