coiaf: Directly estimating complexity of infection with allele frequencies

In malaria, individuals are often infected with different parasite strains; the complexity of infection (COI) is defined as the number of genetically distinct parasite strains in an individual. Changes in the mean COI in a population have been shown to be informative of changes in transmission intensity with a number of probabilistic likelihood and Bayesian models now developed to estimate the COI. However, rapid, direct measures based on heterozygosity or FwS do not properly represent the COI. In this work, we present two new methods that use easily calculated measures to directly estimate the COI from allele frequency data. Using a simulation framework, we show that our methods are computationally efficient and comparably accurate to current methods in the literature. Through a sensitivity analysis, we characterize how the bias and accuracy of our two methods are impacted by the distribution of parasite densities and the assumed sequencing depth and number of sampled loci. We further estimate the COI globally from Plasmodium falciparum sequencing data using our developed methods and compare the results against the literature. We show significant differences in estimated COI globally between continents and a weak relationship between malaria prevalence and COI. Author summary Computational models, used in conjunction with rapidly advancing sequencing technologies, are increasingly being used to help inform surveillance efforts and understand the epidemiological dynamics of malaria. One such important metric, the complexity of infection (COI), indirectly quantifies the level of transmission. Existing “gold-standard” COI measures rely on complex probabilistic likelihood and Bayesian models. As an alternative, we have developed the statistics and software package coiaf, which features two rapid, direct measures to estimate of the number of genetically distinct parasite strains in an individual (the COI). Our methods were evaluated using simulated data and subsequently compared to current “state-of-the-art” methods, yielding comparable results. Lastly, we examined the distribution of the COI in several locations across the world, identifying significant differences in COI between continents. coiaf, therefore, provides a new, promising framework for rapidly characterizing polyclonal infections.

[1]  Soumita Modak Finding groups in data: an introduction to cluster analysis , 2023, Journal of Applied Statistics.

[2]  D. Monos,et al.  Next-generation sequencing technologies: An overview. , 2021, Human immunology.

[3]  C. Koepfli,et al.  Systematic review of Plasmodium falciparum and Plasmodium vivax polyclonal infections: Impact of prevalence, study population characteristics, and laboratory procedures , 2021, bioRxiv.

[4]  T. Lefèvre,et al.  Determinants of Plasmodium falciparum multiplicity of infection and genetic diversity in Burkina Faso , 2020, Parasites & Vectors.

[5]  M. Grillet,et al.  Malaria in Venezuela: changes in the complexity of infection reflects the increment in transmission intensity , 2020, Malaria Journal.

[6]  Xiaoping Zhou,et al.  Genetic diversity of Plasmodium falciparum and genetic profile in children affected by uncomplicated malaria in Cameroon , 2020, Malaria Journal.

[7]  J. Hester High Precision Timing of R Expressions [R package bench version 1.1.1] , 2020 .

[8]  T. Anderson,et al.  Co-transmission of Related Malaria Parasite Lineages Shapes Within-Host Parasite Diversity. , 2019, Cell host & microbe.

[9]  Anna E. Jeffreys,et al.  Insights into malaria susceptibility using genome-wide data on 17,000 individuals from Africa, Asia and Oceania , 2019, Nature Communications.

[10]  Theerarat Kochakarn,et al.  The resistome and genomic reconnaissance in the age of malaria elimination , 2019, Disease Models & Mechanisms.

[11]  D. Kwiatkowski,et al.  An open dataset of Plasmodium falciparum genome variation in 7,000 worldwide samples , 2019, bioRxiv.

[12]  Hannah C. Slater,et al.  Evaluating the Performance of Malaria Genetics for Inferring Changes in Transmission Intensity Using Transmission Modeling , 2019, bioRxiv.

[13]  Su Yun Kang,et al.  Mapping the global prevalence, incidence, and mortality of Plasmodium falciparum, 2000–17: a spatial and temporal modelling study , 2019, The Lancet.

[14]  J. Bailey,et al.  The impact of antimalarial resistance on the genetic structure of Plasmodium falciparum in the DRC , 2019, Nature Communications.

[15]  Hannah C. Slater,et al.  Impact of seasonal variations in Plasmodium falciparum malaria transmission on the surveillance of pfhrp2 gene deletions , 2019, eLife.

[16]  P. Gething,et al.  malariaAtlas: an R interface to global malariometric data hosted by the Malaria Atlas Project , 2018, Malaria Journal.

[17]  Vincent Arel-Bundock,et al.  countrycode: An R package to convert country names and country codes , 2018, J. Open Source Softw..

[18]  Gil McVean,et al.  The origins and relatedness structure of mixed infections vary with local prevalence of P. falciparum malaria , 2018, bioRxiv.

[19]  Vineeta Singh,et al.  Role of genetic factors and ethnicity on the multiplicity of Plasmodium falciparum infection in children with asymptomatic malaria in Yaoundé, Cameroon , 2018, Heliyon.

[20]  G. Yan,et al.  Molecular approaches to determine the multiplicity of Plasmodium infections , 2018, Malaria Journal.

[21]  J. Ohashi,et al.  Rapid selection of sulphadoxine-resistant Plasmodium falciparum and its effect on within-population genetic diversity in Papua New Guinea , 2018, Scientific Reports.

[22]  D. Hartl,et al.  Dramatic Changes in Malaria Population Genetic Complexity in Dielmo and Ndiop, Senegal, Revealed Using Genomic Surveillance , 2018, The Journal of infectious diseases.

[23]  Edward A. Wenger,et al.  Modeling the genetic relatedness of Plasmodium falciparum parasites following meiotic recombination and cotransmission , 2018, PLoS Comput. Biol..

[24]  G. Benelli,et al.  Current vector control challenges in the fight against malaria. , 2017, Acta tropica.

[25]  David L. Smith,et al.  Quantification of anti-parasite and anti-disease immunity to malaria as a function of age and exposure , 2017, bioRxiv.

[26]  Hannah C. Slater,et al.  Modelling the drivers of the spread of Plasmodium falciparum hrp2 gene deletions in sub-Saharan Africa , 2017, eLife.

[27]  P. Siba,et al.  Higher Complexity of Infection and Genetic Diversity of Plasmodium vivax Than Plasmodium falciparum Across All Malaria Transmission Zones of Papua New Guinea. , 2017, The American journal of tropical medicine and hygiene.

[28]  Gil McVean,et al.  Deconvolution of multiple infections in Plasmodium falciparum from high throughput sequencing data , 2017, bioRxiv.

[29]  Allison D. Griggs,et al.  Genetic relatedness analysis reveals the cotransmission of genetically related Plasmodium falciparum parasites in Thiès, Senegal , 2017, Genome Medicine.

[30]  Allison D. Griggs,et al.  Genetic relatedness analysis reveals the cotransmission of genetically related Plasmodium falciparum parasites in Thiès, Senegal , 2017, Genome Medicine.

[31]  Caroline O. Buckee,et al.  THE REAL McCOIL: A method for the concurrent estimation of the complexity of infection and SNP allele frequency for malaria parasites , 2017, PLoS Comput. Biol..

[32]  D. Gurarie,et al.  Spatial Effects on the Multiplicity of Plasmodium falciparum Infections , 2016, PloS one.

[33]  N. Duah,et al.  Genetic diversity of Plasmodium falciparum isolates from uncomplicated malaria cases in Ghana over a decade , 2016, Parasites & Vectors.

[34]  R. Haque,et al.  Plasmodium falciparum Genetic Diversity in Bangladesh Does Not Suggest a Hypoendemic Population Structure. , 2016, The American journal of tropical medicine and hygiene.

[35]  Jian Wang,et al.  Empirical estimation of sequencing error rates using smoothing splines , 2016, BMC Bioinformatics.

[36]  S. P. Kachur,et al.  Within-host competition and drug resistance in the human malaria parasite Plasmodium falciparum , 2016, Proceedings of the Royal Society B: Biological Sciences.

[37]  B. Grenfell,et al.  The potential impact of coinfection on antimicrobial chemotherapy and drug resistance. , 2015, Trends in microbiology.

[38]  S. Ghogomu,et al.  Plasmodium falciparum merozoite surface protein 1 block 2 gene polymorphism in field isolates along the slope of mount Cameroon: a cross – sectional study , 2015, BMC Infectious Diseases.

[39]  S. Schaffner,et al.  Modeling malaria genomics reveals transmission decline and rebound in Senegal , 2015, Proceedings of the National Academy of Sciences.

[40]  Pardis C Sabeti,et al.  COIL: a methodology for evaluating malarial complexity of infection using likelihood from single nucleotide polymorphism data , 2015, Malaria Journal.

[41]  Samuel A. Assefa,et al.  estMOI: estimating multiplicity of infection using parasite deep sequencing data , 2014, Bioinform..

[42]  Joel C. Mouatcho,et al.  Malaria rapid diagnostic tests: challenges and prospects. , 2013, Journal of medical microbiology.

[43]  P. Siba,et al.  High levels of genetic diversity of Plasmodium falciparum populations in Papua New Guinea despite variable infection prevalence. , 2013, The American journal of tropical medicine and hygiene.

[44]  Andre Franke,et al.  Genome-wide association study indicates two novel resistance loci for severe malaria , 2012, Nature.

[45]  D. Kwiatkowski,et al.  Characterization of Within-Host Plasmodium falciparum Diversity Using Next-Generation Sequence Data , 2012, PloS one.

[46]  H. Al-Mekhlafi,et al.  Genetic diversity of Plasmodium falciparum isolates from Pahang, Malaysia based on MSP-1 and MSP-2 genes , 2011, Parasites & Vectors.

[47]  M. Mota,et al.  Superinfection in malaria: Plasmodium shows its iron will , 2011, EMBO reports.

[48]  C. Newbold,et al.  Host-mediated regulation of superinfection in malaria , 2011, Nature Medicine.

[49]  T. K. Barik,et al.  Malaria vector control: from past to future , 2011, Parasitology Research.

[50]  C. Nsanzabana,et al.  A comparison of methods to detect and quantify the markers of antimalarial drug resistance. , 2010, The American journal of tropical medicine and hygiene.

[51]  Aldina Barral,et al.  Towards a precise test for malaria diagnosis in the Brazilian Amazon: comparison among field microscopy, a rapid diagnostic test, nested PCR, and a computational expert system based on artificial neural networks , 2010, Malaria Journal.

[52]  C. Wongsrichanalai,et al.  Drug-Resistant Malaria: The Era of ACT , 2010, Current infectious disease reports.

[53]  W. Takken,et al.  Malaria vector control: current and future strategies. , 2009, Trends in parasitology.

[54]  N. White,et al.  Qinghaosu (Artemisinin): The Price of Success , 2008, Science.

[55]  Andrew J Tatem,et al.  Assembling a global database of malaria parasite prevalence for the Malaria Atlas Project , 2007, Malaria Journal.

[56]  S. Hay,et al.  The Malaria Atlas Project: Developing Global Maps of Malaria Risk , 2006, PLoS medicine.

[57]  T. Mutabingwa,et al.  Artemisinin-based combination therapies (ACTs): best hope for malaria treatment but inaccessible to the needy! , 2005, Acta tropica.

[58]  C. Wongsrichanalai,et al.  Rapid diagnostic testing for malaria , 2003, Tropical medicine & international health : TM & IH.

[59]  Thierry Bardini Bootstrapping , 2000 .

[60]  H. Contamin,et al.  Plasmodium falciparum: a comparative analysis of the genetic diversity in malaria-mesoendemic areas of Brazil and Madagascar , 2000, Parasitology Research.

[61]  G. Snounou,et al.  The use of PCR genotyping in the assessment of recrudescence or reinfection after antimalarial drug treatment. , 1998, Parasitology today.

[62]  D B Rubin,et al.  Markov chain Monte Carlo methods in biostatistics , 1996, Statistical methods in medical research.

[63]  B. Efron,et al.  Bootstrap confidence intervals , 1996 .

[64]  Jorge Nocedal,et al.  A Limited Memory Algorithm for Bound Constrained Optimization , 1995, SIAM J. Sci. Comput..

[65]  E. Ziegel,et al.  Bootstrapping: A Nonparametric Approach to Statistical Inference , 1993 .

[66]  J. Warden The price of success , 1988, British medical journal.

[67]  G. Barbujani,et al.  Autocorrelation of gene frequencies under isolation by distance. , 1987, Genetics.

[68]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[69]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[70]  C. Rogier,et al.  The epidemiology of multiple Plasmodium falciparum infections. 5. Variation of Plasmodium falciparum msp1 block 2 and msp2 allele prevalence and of infection complexity in two neighbouring Senegalese villages with different transmission conditions , 1999 .

[71]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[72]  S. Paul A three-parameter generalization of the binomial distribution , 1985 .

[73]  L. Varga Bootstrap methods and their applications , 2022 .