An open dataset of Plasmodium falciparum genome variation in 7,000 worldwide samples [version 1; peer review: 2 approved]

MalariaGEN is a data-sharing network that enables groups around the world to work together on the genomic epidemiology of malaria. Here we describe a new release of curated genome variation data on 7,000 Plasmodium falciparum samples from MalariaGEN partner studies in 28 malaria-endemic countries. High-quality genotype calls on 3 million single nucleotide polymorphisms (SNPs) and short indels were produced using a standardised analysis pipeline. Copy number variants associated with drug resistance and structural variants that cause failure of rapid diagnostic tests were also analysed. Almost all samples showed genetic evidence of resistance to at least one antimalarial drug, and some samples from Southeast Asia carried markers of resistance to six commonly-used drugs. Genes expressed during the mosquito stage of the parasite life-cycle are prominent among loci that show strong geographic differentiation. By continuing to enlarge this open data resource we aim to facilitate research into the evolutionary processes affecting malaria control and to accelerate development of the surveillance toolkit required for malaria elimination. This manuscript from the MalariaGEN consortium, a data-sharing community of teams working on Plasmodium falciparum genomic epidemiology, presents the new release of curated P. falciparum genomes from isolates collected in 73 locations in Africa, Asia, South America and Oceania. Based on robust and perfectly detailed methods (ranging from the treatment of the blood samples, the DNA extraction, the Illumina and computational platforms developed to produce genome sequencing for variant discovery and genotype calling), they analyzed 7000 P. falciparum genome sequences and provided numerous exciting data. For instance, they found that variations (SNPs and indels) in P. falciparum genome affected about a quarter of the 23 Mb genome (and mostly coding regions), or that duplication genotypes are frequent around mdr1, plasmepsin2/3 and gch1, which are known to be associated with antimalarial drug resistance (including mefloquine, piperaquine and sulfadoxine/pyrimethamine). Moreover, population genetic analyses conducted on this largest available data resource, depict a comprehensive picture of P. falciparum parasite populations globally and sub populations at continental level. In the results, a large section is devoted to the description of the geographic patterns of validated molecular markers (SNPs and CNVs) associated with antimalarial drug resistance. By compiling data on all samples collected from 2002–2015, they present clear profiles of drug resistance by regional sub-populations for the most used antimalarial drugs. Finally, they reveal a global landscape regarding a major challenge for malaria elimination, that are deletions in hrp2 and 3 genes linked The analysis of whole-genome sequences obtained from Plasmodium falciparum is particularly challenging due to the presence of hypervariable regions, highly repetitive sequences, and frequent mixture of parasites due to multiple infections of the host. The authors of this study describe a curated list of over three million high-confidence polymorphisms obtained from the genome sequence analysis of more than 7000 samples of P. falciparum collected by several studies in 73 locations in Africa, Asia, South America and Oceania. This work, reporting a laudable effort to substantially enrich publicly available genome data of P. falciparum worldwide, is of paramount importance for the field. The contribution goes in line with authors' previous consortia publications, extending largely the number of available data that can be analysed via web with powerful data analysis pipelines. By providing open access to a curated list of polymorphisms based on reproducible and high-quality protocols for the sequencing and analysis of P. falciparum genomes this study is likely to decrease the difficulties that have delayed the research on genomic epidemiology and population genomics of P. falciparum . Among other advances, studies in this area are likely to have important implications for a better understanding of the evolution towards drug resistance of the different global parasite populations ultimately contributing for a better control of this devastating disease. The manuscript is very well written and clear. It presents eight genetically distinct populations of parasites each endemic to different word regions, including South America, West Africa, Central Africa, East Africa, South Asia, West Southeast Asia, East Southeast Asia and Oceania. An interesting genetic and geographic characterization of the eight parasite populations is also shown. Of note, the finding of higher within-host diversity in the parasite populations endemic to Africa, the identification of single nucleotide polymorphism with high levels of geographic differentiation, and further characterization of geographic patterns of drug resistance and polymorphisms with potential

William L. Hamilton | Diego F. Echeverry | J. Rayner | D. Kwiatkowski | D. Conway | K. Rockett | O. Branch | X. Su | R. Price | M. Manske | P. Newton | S. Jackson | A. Craig | C. Dolecek | J. Stalker | A. Nzila | T. Wellems | T. Bousema | K. Marsh | C. Drakeley | M. F. Boni | R. Pearson | M. Diakité | P. Siba | A. Oduro | A. Miles | F. Nosten | U. d’Alessandro | J. Ouédraogo | P. Bejon | I. Mueller | A. Djimde | E. Ashley | N. Day | M. Mayxay | N. White | N. Peshu | T. Hien | A. Barry | Eleanor Drury | A. Jeffreys | C. Hubbart | K. Rowlands | D. Jyothi | Cinzia Malangone | Ben Jeffery | Ian Wright | Lee Hart | K. Kluczynski | Christa Henrichs | R. Amato | S. Auburn | Olivo Miotto | Jacob Almagro-Garcia | C. Amaratunga | J. Marfurt | L. Tavul | M. Randrianarivelojosia | M. Kekre | R. Fairhurst | G. Busby | Christopher G. Jacob | D. Saunders | M. Fukuda | P. Ringwald | C. Plowe | K. Chotivanich | L. Golassa | C. Ariani | A. Ghansah | C. Hill | A. Tshefu | G. Awandare | A. Dondorp | M. Imwong | H. Noedl | D. Gamboa | S. Borrmann | A. Hodgson | D. Ishengoma | C. Sutherland | A. Claessens | P. Duffy | Vito Baraka | A. Amambua-Ngwa | Pharath Lim | S. Pukrittayakamee | O. Mokuolu | M. Onyamboko | A. Ahouidi | E. Kamau | Irene Omedo | J. Vinetz | G. Bertin | L. Amenga-Etego | Kimberly J. Johnson | W. Yavo | T. Apinjoh | B. Andagalu | V. Thathy | V. Mobegi | K. Loua | M. Kyaw | B. Denis | Thanat Chookajorn | T. Egwang | L. Ochola-Oyier | C. Kamaliddin | B. Erko | F. Verra | S. Gonçalves | Thuy-Nhien Nguyen | J. Wendler | H. Ba | Theerarat Kochakarn | Souleymane Dama | A. Konaté | Victoria Simpson | J. Montgomery | V. Andrianaranjaka | K. Oyebola | C. Lon | Tim J. C. Anderson | Shannon Takala-Harrison | Peter Bull | O. Maïga-Ascofaré | Alex Shayo | G. A. Harrison | Harold Ocholla | Mozam Ali | Abdul Faiz | L. Ruiz | Htut Ye | A. Phyo | Rintis Noviyanti | Caterina A Fanello

[1]  J. Grimsby,et al.  Detection of low-density Plasmodium falciparum infections using amplicon deep sequencing , 2019, Malaria Journal.

[2]  T. Kwan-Gett,et al.  Use cases for genetic epidemiology in malaria elimination , 2019, Malaria Journal.

[3]  J. Flowers,et al.  Origins and geographic diversification of African rice (Oryza glaberrima) , 2018, bioRxiv.

[4]  J. Bailey,et al.  Reuse of malaria rapid diagnostic tests for amplicon deep sequencing to estimate Plasmodium falciparum transmission intensity in western Uganda , 2018, Scientific Reports.

[5]  C. Severini,et al.  Identification of Plasmodium falciparum isolates lacking histidine-rich protein 2 and 3 in Eritrea. , 2017, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[6]  Joana C. Silva,et al.  Association of a Novel Mutation in the Plasmodium falciparum Chloroquine Resistance Transporter With Decreased Piperaquine Sensitivity , 2017, The Journal of infectious diseases.

[7]  J. Barnwell,et al.  Histidine-rich protein 2 (pfhrp2) and pfhrp3 gene deletions in Plasmodium falciparum isolates from select sites in Brazil and Bolivia , 2017, PloS one.

[8]  Hannah C. Slater,et al.  Pfhrp2-Deleted Plasmodium falciparum Parasites in the Democratic Republic of the Congo: A National Cross-sectional Survey , 2016, The Journal of infectious diseases.

[9]  H. Chandel,et al.  Prevalence of pfhrp2 and/or pfhrp3 Gene Deletion in Plasmodium falciparum Population in Eight Highly Endemic States in India , 2016, PloS one.

[10]  Frank H. Collins,et al.  Human malaria diagnosis using a single-step direct-PCR based on the Plasmodium cytochrome oxidase III gene , 2016, Malaria Journal.

[11]  J. Barnwell,et al.  Variation in Plasmodium falciparum Histidine-Rich Protein 2 (Pfhrp2) and Plasmodium falciparum Histidine-Rich Protein 3 (Pfhrp3) Gene Deletions in Guyana and Suriname , 2015, PloS one.

[12]  Adina Heinberg,et al.  The molecular basis of antifolate resistance in Plasmodium falciparum: looking beyond point mutations , 2015, Annals of the New York Academy of Sciences.

[13]  J. Skarbinski,et al.  Return of widespread chloroquine-sensitive Plasmodium falciparum to Malawi. , 2014, The Journal of infectious diseases.

[14]  C. Roper,et al.  Mapping 'partially resistant', 'fully resistant', and 'super resistant' malaria. , 2013, Trends in parasitology.

[15]  A. M. de Oliveira,et al.  Multiple genetic origins of histidine-rich protein 2 gene deletion in Plasmodium falciparum parasites from Peru , 2013, Scientific Reports.

[16]  R. Sauerwein,et al.  The Human Malaria Parasite Pfs47 Gene Mediates Evasion of the Mosquito Immune System , 2013, Science.

[17]  F. Nosten,et al.  pfmdr1 Amplification Is Related to Increased Plasmodium falciparum In Vitro Sensitivity to the Bisquinoline Piperaquine , 2012, Antimicrobial Agents and Chemotherapy.

[18]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[19]  O. Stine,et al.  Return of chloroquine-susceptible falciparum malaria in Malawi was a reexpansion of diverse susceptible parasites. , 2010, The Journal of infectious diseases.

[20]  M. Perkins,et al.  A Large Proportion of P. falciparum Isolates in the Amazon Region of Peru Lack pfhrp2 and pfhrp3: Implications for Malaria Rapid Diagnostic Tests , 2010, PloS one.

[21]  Toshihiro Mita,et al.  Spread and evolution of Plasmodium falciparum drug resistance. , 2009, Parasitology international.

[22]  Taane G. Clark,et al.  A global network for investigating the genomic epidemiology of malaria , 2008, Nature.

[23]  P. Nilsson,et al.  Genome wide gene amplifications and deletions in Plasmodium falciparum. , 2007, Molecular and biochemical parasitology.

[24]  F. Nosten,et al.  Recurrent gene amplification and soft selective sweeps during evolution of multidrug resistance in malaria parasites. , 2006, Molecular biology and evolution.

[25]  T. Taylor,et al.  Return of chloroquine antimalarial efficacy in Malawi. , 2006, The New England journal of medicine.

[26]  R. Price,et al.  Mefloquine resistance in Plasmodium falciparum and increased pfmdr1 gene copy number , 2004, The Lancet.

[27]  John C. Wootton,et al.  Genetic diversity and chloroquine selective sweeps in Plasmodium falciparum , 2002, Nature.

[28]  R. Sinden,et al.  CTRP is essential for mosquito infection by malaria ookinetes , 1999, The EMBO journal.

[29]  T. Triglia,et al.  Amplification of the multidrug resistance gene pfmdr1 in Plasmodium falciparum has arisen as multiple independent events , 1991, Molecular and cellular biology.

[30]  R. Konings,et al.  A novel protein antigen of the malaria parasite Plasmodium falciparum, located on the surface of gametes and sporozoites. , 1991, Molecular and biochemical parasitology.

[31]  A. Cowman,et al.  Amplification of the multidrug resistance gene in some chloroquine-resistant isolates of P. falciparum , 1989, Cell.

[32]  Diego F. Echeverry,et al.  Global sequence variation in the histidine-rich proteins 2 and 3 of Plasmodium falciparum: implications for the performance of malaria rapid diagnostic tests. , 2022 .

[33]  Claude-Alain H. Roten,et al.  Fast and accurate short read alignment with Burrows–Wheeler transform , 2009, Bioinform..