THE REAL McCOIL: A method for the concurrent estimation of the complexity of infection and SNP allele frequency for malaria parasites

As many malaria-endemic countries move towards elimination of Plasmodium falciparum, the most virulent human malaria parasite, effective tools for monitoring malaria epidemiology are urgent priorities. P. falciparum population genetic approaches offer promising tools for understanding transmission and spread of the disease, but a high prevalence of multi-clone or polygenomic infections can render estimation of even the most basic parameters, such as allele frequencies, challenging. A previous method, COIL, was developed to estimate complexity of infection (COI) from single nucleotide polymorphism (SNP) data, but relies on monogenomic infections to estimate allele frequencies or requires external allele frequency data which may not available. Estimates limited to monogenomic infections may not be representative, however, and when the average COI is high, they can be difficult or impossible to obtain. Therefore, we developed THE REAL McCOIL, Turning HEterozygous SNP data into Robust Estimates of ALelle frequency, via Markov chain Monte Carlo, and Complexity Of Infection using Likelihood, to incorporate polygenomic samples and simultaneously estimate allele frequency and COI. This approach was tested via simulations then applied to SNP data from cross-sectional surveys performed in three Ugandan sites with varying malaria transmission. We show that THE REAL McCOIL consistently outperforms COIL on simulated data, particularly when most infections are polygenomic. Using field data we show that, unlike with COIL, we can distinguish epidemiologically relevant differences in COI between and within these sites. Surprisingly, for example, we estimated high average COI in a peri-urban subregion with lower transmission intensity, suggesting that many of these cases were imported from surrounding regions with higher transmission intensity. THE REAL McCOIL therefore provides a robust tool for understanding the molecular epidemiology of malaria across transmission settings.

[1]  David S. Roos,et al.  Population Genetics, Evolutionary Genomics, and Genome-Wide Studies of Malaria: A View across the International Centers of Excellence for Malaria Research , 2015, The American journal of tropical medicine and hygiene.

[2]  H. Contamin,et al.  PCR typing of field isolates of Plasmodium falciparum , 1995, Journal of clinical microbiology.

[3]  David L Smith,et al.  A high-throughput method for quantifying alleles and haplotypes of the malaria vaccine candidate Plasmodium falciparum merozoite surface protein-1 19 kDa , 2006, Malaria Journal.

[4]  M Tanner,et al.  A prospective study of Plasmodium falciparum multiplicity of infection and morbidity in Tanzanian children. , 2004, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[5]  Matthew Stephens,et al.  USING LINEAR PREDICTORS TO IMPUTE ALLELE FREQUENCIES FROM SUMMARY OR POOLED GENOTYPE DATA. , 2010, The annals of applied statistics.

[6]  A. Guerra-Neira,et al.  Plasmodium diversity in non-malaria individuals from the Bioko Island in Equatorial Guinea (West Central-Africa) , 2006, International journal of health geographics.

[7]  G. Snounou,et al.  The use of PCR genotyping in the assessment of recrudescence or reinfection after antimalarial drug treatment. , 1998, Parasitology today.

[8]  Diego F. Echeverry,et al.  Long term persistence of clonal malaria parasite Plasmodium falciparum lineages in the Colombian Pacific region , 2013, BMC Genetics.

[9]  David Serre,et al.  Complexity of Infection and Genetic Diversity in Cambodian Plasmodium vivax , 2016, PLoS neglected tropical diseases.

[10]  M Tanner,et al.  Age dependence of the multiplicity of Plasmodium falciparum infections and of other malariological indices in an area of high endemicity. , 1999, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[11]  Caroline O. Buckee,et al.  Dissecting the determinants of malaria chronicity: why within-host models struggle to reproduce infection dynamics , 2015, Journal of The Royal Society Interface.

[12]  Alan E Hubbard,et al.  Gel versus capillary electrophoresis genotyping for categorizing treatment outcomes in two anti-malarial trials in Uganda , 2010, Malaria Journal.

[13]  Harold Jaffe,et al.  Relationship between Plasmodium falciparum malaria prevalence, genetic diversity and endemic Burkitt lymphoma in Malawi , 2014, Scientific Reports.

[14]  Thomas A. Smith,et al.  6. Multiple Plasmodium falciparum infections in Tanzanian infants , 1999 .

[15]  W. Jarra,et al.  Genotyping of Plasmodium falciparum isolates by the polymerase chain reaction and potential uses in epidemiological studies. , 1995, Bulletin of the World Health Organization.

[16]  Umberto D'Alessandro,et al.  Variation in malaria transmission intensity in seven sites throughout Uganda. , 2006, The American journal of tropical medicine and hygiene.

[17]  Bryan Greenhouse,et al.  Factors Associated with Malaria Parasitemia, Anemia and Serological Responses in a Spectrum of Epidemiological Settings in Uganda , 2015, PloS one.

[18]  Taane G. Clark,et al.  Genome-Wide Analysis of Selection on the Malaria Parasite Plasmodium falciparum in West African Populations of Differing Infection Endemicity , 2014, Molecular biology and evolution.

[19]  Hsiao-Han Chang,et al.  Clonal outbreak of Plasmodium falciparum infection in eastern Panama. , 2014, The Journal of infectious diseases.

[20]  Peter D. Crompton,et al.  Novel serologic biomarkers provide accurate estimates of recent Plasmodium falciparum exposure for individuals and communities , 2015, Proceedings of the National Academy of Sciences.

[21]  P. Ross,et al.  High level multiplex genotyping by MALDI-TOF mass spectrometry , 1998, Nature Biotechnology.

[22]  Teun Bousema,et al.  Identification of hot spots of malaria transmission for targeted malaria control. , 2010, The Journal of infectious diseases.

[23]  Andrew J Tatem,et al.  Malaria transmission, infection, and disease at three sites with varied transmission intensity in Uganda: implications for malaria control. , 2015, The American journal of tropical medicine and hygiene.

[24]  D. Conway,et al.  Molecular Epidemiology of Malaria , 2007, Clinical Microbiology Reviews.

[25]  John C. Tan,et al.  Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing , 2012, Nature.

[26]  D. Hartl,et al.  Plasmodium falciparum: worldwide sequence diversity and evolution of the malaria vaccine candidate merozoite surface protein-2 (MSP-2). , 2007, Experimental parasitology.

[27]  Andrea S Foulkes,et al.  An Expectation Maximization Approach to Estimate Malaria Haplotype Frequencies in Multiply Infected Children , 2007, Statistical applications in genetics and molecular biology.

[28]  Zamin Iqbal,et al.  Inferring Strain Mixture within Clinical Plasmodium falciparum Isolates from Genomic Sequence Data , 2016, PLoS Comput. Biol..

[29]  Taane G. Clark,et al.  Characterization of Within-Host Plasmodium falciparum Diversity Using Next-Generation Sequence Data , 2012, PloS one.

[30]  Teun Bousema,et al.  Persistence of Plasmodium falciparum parasitemia after artemisinin combination therapy: evidence from a randomized trial in Uganda , 2016, Scientific Reports.

[31]  Terrie E. Taylor,et al.  Subtle changes in Plasmodium falciparum infection complexity following enhanced intervention in Malawi , 2015, Acta tropica.

[32]  Steven R. Meshnick,et al.  Genetic Evidence of Importation of Drug-Resistant Plasmodium falciparum to Guatemala from the Democratic Republic of the Congo , 2014, Emerging infectious diseases.

[33]  François Nosten,et al.  Population genetic correlates of declining transmission in a human pathogen , 2012, Molecular ecology.

[34]  C. Rogier,et al.  Age-dependent carriage of multiple Plasmodium falciparum merozoite surface antigen-2 alleles in asymptomatic malaria infections. , 1995, American Journal of Tropical Medicine and Hygiene.

[35]  J. T. Williams,et al.  Microsatellite markers reveal a spectrum of population structures in the malaria parasite Plasmodium falciparum. , 2000, Molecular biology and evolution.

[36]  W. Pan,et al.  Evaluation of the population structure and genetic diversity of Plasmodium falciparum in southern China , 2015, Malaria Journal.

[37]  Trevor Bedford,et al.  Genetic Diversity and Protective Efficacy of the RTS,S/AS01 Malaria Vaccine. , 2015, The New England journal of medicine.

[38]  David L Smith,et al.  Estimating the annual entomological inoculation rate for Plasmodium falciparum transmitted by Anopheles gambiae s.l. using three sampling methods in three sites in Uganda , 2014, Malaria Journal.

[39]  Pardis C Sabeti,et al.  COIL: a methodology for evaluating malarial complexity of infection using likelihood from single nucleotide polymorphism data , 2015, Malaria Journal.

[40]  Pardis C Sabeti,et al.  A general SNP-based molecular barcode for Plasmodium falciparum identification and tracking , 2008 .

[41]  Samuel A. Assefa,et al.  Microsatellite genotyping and genome-wide single nucleotide polymorphism-based indices of Plasmodium falciparum diversity within clinical infections , 2016, Malaria Journal.

[42]  W H TALIAFERRO,et al.  Acquired immunity in malaria. , 1948, Abstracts. International Congress on Tropical Medicine and Malaria.

[43]  F. Ayala,et al.  Genetic polymorphism and natural selection in the malaria parasite Plasmodium falciparum. , 1998, Genetics.

[44]  Svensson,et al.  Complexity of Plasmodium falciparum infections is consistent over time and protects against clinical disease in Tanzanian children. , 1999, The Journal of infectious diseases.

[45]  David L Smith,et al.  Malaria genotyping for epidemiologic surveillance , 2015, Proceedings of the National Academy of Sciences.

[46]  Bryan Greenhouse,et al.  Estimating malaria parasite prevalence from community surveys in Uganda: a comparison of microscopy, rapid diagnostic tests and polymerase chain reaction , 2015, Malaria Journal.

[47]  Pardis C. Sabeti,et al.  Genetic Surveillance Detects Both Clonal and Epidemic Transmission of Malaria following Enhanced Intervention in Senegal , 2013, PloS one.

[48]  Hsiao-Han Chang,et al.  Genomic sequencing of Plasmodium falciparum malaria parasites from Senegal reveals the demographic history of the population. , 2012, Molecular biology and evolution.

[49]  Marcel Tanner,et al.  Prevalence and implications of multiple-strain infections. , 2011, The Lancet. Infectious diseases.

[50]  O. Branch,et al.  Plasmodium falciparum Genotypes, Low Complexity of Infection, and Resistance to Subsequent Malaria in Participants in the Asembo Bay Cohort Project , 2001, Infection and Immunity.

[51]  C. Rogier,et al.  The epidemiology of multiple Plasmodium falciparum infections. 5. Variation of Plasmodium falciparum msp1 block 2 and msp2 allele prevalence and of infection complexity in two neighbouring Senegalese villages with different transmission conditions , 1999 .

[52]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[53]  P Pamilo,et al.  On the estimation of population size from allele frequency changes. , 1980, Genetics.

[54]  Samuel A. Assefa,et al.  estMOI: estimating multiplicity of infection using parasite deep sequencing data , 2014, Bioinform..

[55]  B. Birren,et al.  Genetic Diversity and Protective Efficacy of the RTS , S / AS 01 Malaria Vaccine , 2015 .

[56]  S. Schaffner,et al.  Modeling malaria genomics reveals transmission decline and rebound in Senegal , 2015, Proceedings of the National Academy of Sciences.

[57]  Rajendra Maharaj,et al.  Operational strategies to achieve and maintain malaria elimination , 2010, The Lancet.

[58]  T. Jelínek,et al.  Genetic diversity of Plasmodium falciparum and its relationship to parasite density in an area with different malaria endemicities in West Uganda , 2001, Tropical medicine & international health : TM & IH.

[59]  M Tanner,et al.  The epidemiology of multiple Plasmodium falciparum infections. 11. Premunition in Plasmodium falciparum infection : Insights from the epidemiology of multiple infections , 1999 .

[60]  Teun Bousema,et al.  Hitting Hotspots: Spatial Targeting of Malaria for Control and Elimination , 2012, PLoS medicine.

[61]  Bryan Greenhouse,et al.  Validation of microsatellite markers for use in genotyping polyclonal Plasmodium falciparum infections. , 2006, The American journal of tropical medicine and hygiene.

[62]  Dionicia Gamboa,et al.  Malaria Molecular Epidemiology: Lessons from the International Centers of Excellence for Malaria Research Network , 2015, The American journal of tropical medicine and hygiene.

[63]  Gilean McVean,et al.  Genetic architecture of artemisinin-resistant Plasmodium falciparum , 2015, Nature Genetics.

[64]  Nicholas P. J. Day,et al.  Genomic epidemiology of artemisinin resistant malaria. , 2016, eLife.

[65]  F. Nosten,et al.  Close kinship within multiple-genotype malaria parasite infections , 2012, Proceedings of the Royal Society B: Biological Sciences.

[66]  D. Conway,et al.  Population genetic structure of Plasmodium falciparum across a region of diverse endemicity in West Africa , 2012, Malaria Journal.

[67]  Ryan J. Haasl,et al.  Multi-locus inference of population structure: a comparison between single nucleotide polymorphisms and microsatellites , 2011, Heredity.

[68]  J C Reeder,et al.  Reduced risk of clinical malaria in children infected with multiple clones of Plasmodium falciparum in a highly endemic area: a prospective community study. , 1997, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[69]  André Garcia,et al.  Multiplicity of Plasmodium falciparum infection in asymptomatic children in Senegal: relation to transmission, age and erythrocyte variants , 2008, Malaria Journal.

[70]  M. Farooq,et al.  Parasite density and the spectrum of clinical illness in falciparum malaria. , 2008, Journal of the College of Physicians and Surgeons--Pakistan : JCPSP.

[71]  M Tanner,et al.  Multiple Plasmodium falciparum infections in Tanzanian infants. , 1999, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[72]  W. Kidima,et al.  Plasmodium falciparum msp2 Genotypes and Multiplicity of Infections among Children under Five Years with Uncomplicated Malaria in Kibaha, Tanzania , 2015, Journal of parasitology research.

[73]  Kwaku Poku Asante,et al.  Trends in multiplicity of Plasmodium falciparum infections among asymptomatic residents in the middle belt of Ghana , 2013, Malaria Journal.

[74]  J. Bailey,et al.  Use of massively parallel pyrosequencing to evaluate the diversity of and selection on Plasmodium falciparum csp T-cell epitopes in Lilongwe, Malawi. , 2012, The Journal of infectious diseases.

[75]  Andrew J. Tatem,et al.  Associations between urbanicity and malaria at local scales in Uganda , 2015, Malaria Journal.

[76]  M. Dgedge,et al.  Plasmodium falciparum multiple infections in Mozambique, its relation to other malariological indices and to prospective risk of malaria morbidity , 2003, Tropical medicine & international health : TM & IH.