hmmIBD: software to infer pairwise identity by descent between haploid genotypes

BackgroundA number of recent malaria studies have used identity by descent (IBD) to study epidemiological processes relevant to malaria control. In this paper, a software package, hmmIBD, is introduced for estimating pairwise IBD between haploid genomes, such as those of the malaria parasite, sampled from one or two populations. Source code is freely available.MethodsThe performance of hmmIBD was verified using simulated data and benchmarked against an existing method for detecting IBD within populations. Code for all tests is freely available. The utility of hmmIBD for detecting IBD across populations was demonstrated using Plasmodium falciparum data from Cambodia and Ghana.ResultsAlongside an existing method, hmmIBD was highly accurate, sensitive and specific. It is fast, requiring only 70 s on average to analyse 50 whole genome sequences on a laptop computer, and scales linearly in the number of pairwise comparisons. Treatment of different populations under hmmIBD improves detection of IBD across populations.ConclusionFast and accurate software for detecting IBD in malaria parasite genetic data sampled from one or two populations is presented. The latter will likely be a useful feature for malaria elimination efforts, since it could facilitate identification of imported malaria cases. Software is robust to possible misspecification of the genotyping error and the recombination rate. However, exclusion of data in regions whose rates vary greatly from their genome-wide average is recommended.

[1]  Caroline O Buckee,et al.  Quantifying connectivity between local Plasmodium falciparum malaria parasite populations using identity by descent , 2017, PLoS genetics.

[2]  T. Wellems,et al.  Genetic mapping of the chloroquine-resistance locus on Plasmodium falciparum chromosome 7. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Gil McVean,et al.  Indels, structural variation, and recombination drive genomic diversity in Plasmodium falciparum , 2016, Genome research.

[4]  S. Schaffner,et al.  Modeling malaria genomics reveals transmission decline and rebound in Senegal , 2015, Proceedings of the National Academy of Sciences.

[5]  Brian L Browning,et al.  Identity by descent between distant relatives: detection and applications. , 2012, Annual review of genetics.

[6]  John Blangero,et al.  Benchmarking Relatedness Inference Methods with Genome-Wide Data from Thousands of Relatives , 2017, Genetics.

[7]  David Wakeham,et al.  XIBD: software for inferring pairwise identity by descent on the X chromosome , 2016, Bioinform..

[8]  John C. Wootton,et al.  Genetic diversity and chloroquine selective sweeps in Plasmodium falciparum , 2002, Nature.

[9]  Edward A. Wenger,et al.  Modeling the genetic relatedness of Plasmodium falciparum parasites following meiotic recombination and cotransmission , 2018, PLoS Comput. Biol..

[10]  E. Thompson Identity by Descent: Variation in Meiosis, Across Genomes, and in Populations , 2013, Genetics.

[11]  Xiaofeng Zhu,et al.  Single-trait and multi-trait genome-wide association analyses identify novel loci for blood pressure in African-ancestry populations , 2017, PLoS genetics.

[12]  François Nosten,et al.  Longitudinal genomic surveillance of Plasmodium falciparum malaria parasites reveals complex genomic architecture of emerging artemisinin resistance , 2017, Genome Biology.

[13]  J. Le bras,et al.  Invasion of Africa by a single pfcrt allele of South East Asian type , 2006, Malaria Journal.

[14]  Allison D. Griggs,et al.  Genetic relatedness analysis reveals the cotransmission of genetically related Plasmodium falciparum parasites in Thiès, Senegal , 2017, Genome Medicine.

[15]  Amy L. Williams,et al.  A performance assessment of relatedness inference methods using genome-wide data from thousands of relatives , 2017, bioRxiv.

[16]  Melanie Bahlo,et al.  Detecting Selection Signals In Plasmodium falciparum Using Identity-By-Descent Analysis , 2016, bioRxiv.

[17]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.