COIL: a methodology for evaluating malarial complexity of infection using likelihood from single nucleotide polymorphism data

BackgroundComplex malaria infections are defined as those containing more than one genetically distinct lineage of Plasmodium parasite. Complexity of infection (COI) is a useful parameter to estimate from patient blood samples because it is associated with clinical outcome, epidemiology and disease transmission rate. This manuscript describes a method for estimating COI using likelihood, called COIL, from a panel of bi-allelic genotyping assays.MethodsCOIL assumes that distinct parasite lineages in complex infections are unrelated and that genotyped loci do not exhibit significant linkage disequilibrium. Using the population minor allele frequency (MAF) of the genotyped loci, COIL uses the binomial distribution to estimate the likelihood of a COI level given the prevalence of observed monomorphic or polymorphic genotypes within each sample.ResultsCOIL reliably estimates COI up to a level of three or five with at least 24 or 96 unlinked genotyped loci, respectively, as determined by in silico simulation and empirical validation. Evaluation of COI levels greater than five in patient samples may require a very large collection of genotype data, making sequencing a more cost-effective approach for evaluating COI under conditions when disease transmission is extremely high. Performance of the method is positively correlated with the MAF of the genotyped loci. COI estimates from existing SNP genotype datasets create a more detailed portrait of disease than analyses based simply on the number of polymorphic genotypes observed within samples.ConclusionsThe capacity to reliably estimate COI from a genome-wide panel of SNP genotypes provides a potentially more accurate alternative to methods relying on PCR amplification of a small number of loci for estimating COI. This approach will also increase the number of applications of SNP genotype data, providing additional motivation to employ SNP barcodes for studies of disease epidemiology or control measure efficacy. The COIL program is available for download from GitHub, and users may also upload their SNP genotype data to a web interface for simple and efficient determination of sample COI.

[1]  C. Rogier,et al.  Age-dependent carriage of multiple Plasmodium falciparum merozoite surface antigen-2 alleles in asymptomatic malaria infections. , 1995, American Journal of Tropical Medicine and Hygiene.

[2]  W. G. Hill,et al.  Estimation of numbers of malaria clones in blood samples , 1995, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[3]  H. Contamin,et al.  PCR typing of field isolates of Plasmodium falciparum , 1995, Journal of clinical microbiology.

[4]  M. Tanner,et al.  Analysis of multiple Plasmodium falciparum infections in Tanzanian children during the phase III trial of the malaria vaccine SPf66. , 1997, The Journal of infectious diseases.

[5]  J C Reeder,et al.  Reduced risk of clinical malaria in children infected with multiple clones of Plasmodium falciparum in a highly endemic area: a prospective community study. , 1997, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[6]  O. Kaneko,et al.  A PCR method for molecular epidemiology of Plasmodium falciparum Msp-1. , 1998, The Tokai Journal of Experimental and Clinical Medicine.

[7]  H. Babiker Unstable malaria in Sudan: the influence of the dry season. Plasmodium falciparum population in the unstable malaria area of eastern Sudan is stable and genetically complex. , 1998, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[8]  D. Arnot,et al.  Unstable malaria in Sudan: the influence of the dry season. Clone multiplicity of Plasmodium falciparum infections in individuals exposed to variable levels of disease transmission. , 1998, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[9]  G. Langsley,et al.  A primary malarial infection is composed of a very wide range of genetically diverse but related parasites. , 1998, The Journal of clinical investigation.

[10]  M Tanner,et al.  Age dependence of the multiplicity of Plasmodium falciparum infections and of other malariological indices in an area of high endemicity. , 1999, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[11]  Svensson,et al.  Complexity of Plasmodium falciparum infections is consistent over time and protects against clinical disease in Tanzanian children. , 1999, The Journal of infectious diseases.

[12]  Thomas A. Smith,et al.  6. Multiple Plasmodium falciparum infections in Tanzanian infants , 1999 .

[13]  M Tanner,et al.  Multiple Plasmodium falciparum infections in Tanzanian infants. , 1999, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[14]  O. Branch,et al.  Plasmodium falciparum Genotypes, Low Complexity of Infection, and Resistance to Subsequent Malaria in Participants in the Asembo Bay Cohort Project , 2001, Infection and Immunity.

[15]  T. Smith,et al.  Prospective risk of morbidity in relation to multiplicity of infection with Plasmodium falciparum in São Tomé. , 2001, Acta tropica.

[16]  M. D. Wilson,et al.  Novel Plasmodium falciparum clones and rising clone multiplicities are associated with the increase in malaria morbidity in Ghanaian children during the transition into the high transmission season , 2001, Parasitology.

[17]  A. Guerra-Neira,et al.  Plasmodium diversity in non-malaria individuals from the Bioko Island in Equatorial Guinea (West Central-Africa) , 2006, International journal of health geographics.

[18]  D. Conway,et al.  RTS,S/AS02A Malaria Vaccine Does Not Induce Parasite CSP T Cell Epitope Selection and Reduces Multiplicity of Infection , 2006, PLoS clinical trials.

[19]  André Garcia,et al.  Multiplicity of Plasmodium falciparum infection in asymptomatic children in Senegal: relation to transmission, age and erythrocyte variants , 2008, Malaria Journal.

[20]  Pardis C Sabeti,et al.  A genome-wide map of diversity in Plasmodium falciparum , 2007, Nature Genetics.

[21]  Pardis C Sabeti,et al.  A general SNP-based molecular barcode for Plasmodium falciparum identification and tracking , 2008 .

[22]  Juliana M. Sá,et al.  Single-nucleotide polymorphism, linkage disequilibrium and geographic structure in the malaria parasite Plasmodium vivax: prospects for genome-wide association studies , 2010, BMC Genetics.

[23]  P. Deloukas,et al.  Population Genetic Analysis of Plasmodium falciparum Parasites Using a Customized Illumina GoldenGate Genotyping Assay , 2011, PloS one.

[24]  F. Nosten,et al.  Close kinship within multiple-genotype malaria parasite infections , 2012, Proceedings of the Royal Society B: Biological Sciences.

[25]  Diego F. Echeverry,et al.  Long term persistence of clonal malaria parasite Plasmodium falciparum lineages in the Colombian Pacific region , 2013, BMC Genetics.

[26]  Taane G. Clark,et al.  Characterization of Within-Host Plasmodium falciparum Diversity Using Next-Generation Sequence Data , 2012, PloS one.

[27]  François Nosten,et al.  Population genetic correlates of declining transmission in a human pathogen , 2012, Molecular ecology.

[28]  Pardis C. Sabeti,et al.  Genetic Surveillance Detects Both Clonal and Epidemic Transmission of Malaria following Enhanced Intervention in Senegal , 2013, PloS one.

[29]  M. Wahlgren,et al.  Genetic diversity of Plasmodium falciparum infections in mild and severe malaria of children from Kampala, Uganda , 2013, Parasitology Research.

[30]  D. Serre,et al.  Single-cell genomics for dissection of complex malaria infections , 2014, Genome research.

[31]  Samuel A. Assefa,et al.  estMOI: estimating multiplicity of infection using parasite deep sequencing data , 2014, Bioinform..

[32]  Samuel A. Assefa,et al.  A barcode of organellar genome polymorphisms identifies the geographic origin of Plasmodium falciparum strains , 2014, Nature Communications.