Machine learning enhances prediction of plants as potential sources of antimalarials

Plants are a rich source of bioactive compounds and a number of plant-derived antiplasmodial compounds have been developed into pharmaceutical drugs for the prevention and treatment of malaria, a major public health challenge. However, identifying plants with antiplasmodial potential can be time-consuming and costly. One approach for selecting plants to investigate is based on ethnobotanical knowledge which, though having provided some major successes, is restricted to a relatively small group of plant species. Machine learning, incorporating ethnobotanical and plant trait data, provides a promising approach to improve the identification of antiplasmodial plants and accelerate the search for new plant-derived antiplasmodial compounds. In this paper we present a novel dataset on antiplasmodial activity for three flowering plant families – Apocynaceae, Loganiaceae and Rubiaceae (together comprising c. 21,100 species) – and demonstrate the ability of machine learning algorithms to predict the antiplasmodial potential of plant species. We evaluate the predictive capability of a variety of algorithms – Support Vector Machines, Logistic Regression, Gradient Boosted Trees and Bayesian Neural Networks – and compare these to two ethnobotanical selection approaches – based on usage as an antimalarial and general usage as a medicine. We evaluate the approaches using the given data and when the given samples are reweighted to correct for sampling biases. In both evaluation settings each of the machine learning models have a higher precision than the ethnobotanical approaches. In the bias-corrected scenario, the Support Vector classifier performs best – attaining a mean precision of 0.67 compared to the best performing ethnobotanical approach with a mean precision of 0.46. We also use the bias correction method and the Support Vector classifier to estimate the potential of plants to provide novel antiplasmodial compounds. We estimate that 7677 species in Apocynaceae, Loganiaceae and Rubiaceae warrant further investigation and that at least 1300 active antiplasmodial species are highly unlikely to be investigated by conventional approaches. While traditional and Indigenous knowledge remains vital to our understanding of people-plant relationships and an invaluable source of information, these results indicate a vast and relatively untapped source in the search for new plant-derived antiplasmodial compounds.

[1]  Alexandre Antonelli Indigenous knowledge is key to sustainable food systems , 2023, Nature.

[2]  E. Fernández‐Pascual,et al.  Low availability of functional seed trait data from the tropics could negatively affect global macroecological studies, predictive models and plant conservation , 2022, Annals of botany.

[3]  R. Govaerts,et al.  The World Checklist of Vascular Plants, a continuously updated resource for exploring global plant diversity , 2021, Scientific Data.

[4]  G. Cordell,et al.  Alkaloids in Contemporary Drug Discovery to Meet Global Disease Needs , 2021, Molecules.

[5]  G. Heuvelink,et al.  SoilGrids 2.0: producing soil information for the globe with quantified spatial uncertainty , 2021, SOIL.

[6]  F. Forest,et al.  Plants used traditionally as antimalarials in Latin America: mining the Tree of Life for potential new medicines. , 2021, Journal of ethnopharmacology.

[7]  A. Bender,et al.  Artificial intelligence in drug discovery: what is realistic, what are illusions? Part 2: a discussion of chemical and biological data , 2021, Drug discovery today.

[8]  W. D. Nes,et al.  Pollen sterols are associated with phylogeny and environment but not with pollinator guilds , 2021, The New phytologist.

[9]  G. Glauser,et al.  Spatial and evolutionary predictability of phytochemical diversity , 2021, Proceedings of the National Academy of Sciences.

[10]  A. Leach,et al.  MAIP: a web service for predicting blood‐stage malaria inhibitors , 2020, Journal of Cheminformatics.

[11]  S. Percário,et al.  Anti-malarial activity and toxicity of Aspidosperma nitidum Benth: a plant used in traditional medicine in the Brazilian Amazon , 2020 .

[12]  F. Forest,et al.  Molecules from nature: Reconciling biodiversity conservation and global healthcare imperatives for sustainable use of medicinal plants and fungi , 2020, PLANTS, PEOPLE, PLANET.

[13]  D. Fidock,et al.  Emergence and clonal expansion of in vitro artemisinin-resistant Plasmodium falciparum kelch13 R561H mutant parasites in Rwanda , 2020, Nature Medicine.

[14]  Daniele Silvestro,et al.  Prior choice affects ability of Bayesian neural networks to identify unknowns , 2020, ArXiv.

[15]  K. Franke,et al.  Evaluation of plant sources for antiinfective lead compound discovery by correlating phylogenetic, spatial, and bioactivity data , 2020, Proceedings of the National Academy of Sciences.

[16]  Meenakshi Gupta,et al.  Analysis of alkaloids (indole alkaloids, isoquinoline alkaloids, tropane alkaloids) , 2020, Recent Advances in Natural Products Analysis.

[17]  David J Newman,et al.  Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019. , 2020, Journal of natural products.

[18]  Denis Bastianelli,et al.  TRY plant trait database - enhanced coverage and open access. , 2019, Global change biology.

[19]  B. Prajogo,et al.  Antiplasmodial Activity and Phytochemical Constituents of Selected Antimalarial Plants Used by Native People in West Timor Indonesia. , 2019, Turkish journal of pharmaceutical sciences.

[20]  G Madhukar,et al.  Development and rigorous validation of antimalarial predictive models using machine learning approaches , 2019, SAR and QSAR in environmental research.

[21]  D. Akena,et al.  Persistence of chloroquine resistance alleles in malaria endemic countries: a systematic review of burden and risk factors , 2019, Malaria Journal.

[22]  Daniele Silvestro,et al.  CoordinateCleaner: Standardized cleaning of occurrence records from biological collection databases , 2019, Methods in Ecology and Evolution.

[23]  A. Berg,et al.  Present and future Köppen-Geiger climate classification maps at 1-km resolution , 2018, Scientific Data.

[24]  C. Saslis-Lagoudakis,et al.  A phylogenetic road map to antimalarial Artemisia species. , 2018, Journal of ethnopharmacology.

[25]  Samuel Egieyeh,et al.  Predictive classifier models built from natural products with antimalarial bioactivity using machine learning approach , 2018, PloS one.

[26]  K. Hungerbühler,et al.  Comprehensive Toxic Plants-Phytotoxins Database and Its Application in Assessing Aquatic Micropollution Potential. , 2018, Journal of agricultural and food chemistry.

[27]  Meng Wang,et al.  The China Plant Trait Database: toward a comprehensive regional compilation of functional traits for land plants. , 2017, Ecology.

[28]  Marc Ferrez Jardim Botânico do Rio de Janeiro , 2017 .

[29]  P. Satish,et al.  Antiplasmodial efficacy of Calotropis gigantea (L.) against Plasmodium falciparum (3D7 strain) and Plasmodium berghei (ANKA) , 2017, Journal of vector borne diseases.

[30]  S. Hansen,et al.  Phylogeny Predicts the Quantity of Antimalarial Alkaloids within the Iconic Yellow Cinchona Bark (Rubiaceae: Cinchona calisaya) , 2017, Front. Plant Sci..

[31]  Marvin N. Wright,et al.  SoilGrids250m: Global gridded soil information based on machine learning , 2017, PloS one.

[32]  Carsten Meyer,et al.  Multidimensional biases, gaps and uncertainties in global plant occurrence information. , 2016, Ecology letters.

[33]  Olaf Conrad,et al.  Climatologies at high resolution for the earth’s land surface areas , 2016, Scientific Data.

[34]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[35]  P. Akah,et al.  Landolphia owariensis leaf extracts reduce parasitemia in Plasmodium berghei-infected mice , 2016, Pharmaceutical biology.

[36]  Scott Chamberlain,et al.  Interface to the Global 'Biodiversity' Information Facility'API' , 2016 .

[37]  Brian J. Enquist,et al.  Variation and macroevolution in leaf functional traits in the Hawaiian silversword alliance (Asteraceae) , 2016 .

[38]  Carlo Blasi,et al.  Wild boar rooting intensity determines shifts in understorey composition and functional traits , 2015 .

[39]  Niharika Singh,et al.  Antiplasmodial activity of medicinal plants from Chhotanagpur plateau, Jharkhand, India. , 2015, Journal of ethnopharmacology.

[40]  H. Peter Linder,et al.  Diversification rate shifts in the Cape Floristic Region: The right traits in the right place at the right time , 2014 .

[41]  L. Maes,et al.  Evaluation of the In Vitro Antiplasmodial, Antileishmanial, and Antitrypanosomal Activity of Medicinal Plants Used in Saudi and Yemeni Traditional Medicine , 2014, Evidence-based complementary and alternative medicine : eCAM.

[42]  H. Olff,et al.  Mesoherbivores affect grasshopper communities in a megaherbivore-dominated South African savannah , 2014, Oecologia.

[43]  G. Cumming,et al.  Termite Mounds Increase Functional Diversity of Woody Plants in African Savannas , 2014, Ecosystems.

[44]  E. Lima,et al.  Chemical Composition of Aspidosperma ulei Markgr. and Antiplasmodial Activity of Selected Indole Alkaloids , 2013, Molecules.

[45]  A. A. Rahuman,et al.  Antiplasmodial potential of selected medicinal plants from eastern Ghats of South India. , 2013, Experimental parasitology.

[46]  V. Kantamreddi,et al.  Screening Indian Plant Species for Antiplasmodial Properties – Ethnopharmacological Compared with Random Selection , 2012, Phytotherapy research : PTR.

[47]  L. Maes,et al.  Study of the in Vitro Antiplasmodial, Antileishmanial and Antitrypanosomal Activities of Medicinal Plants from Saudi Arabia , 2012, Molecules.

[48]  M. Symonds,et al.  Can phylogeny predict chemical diversity and potential medicinal activity of plants? A case study of amaryllidaceae , 2012, BMC Evolutionary Biology.

[49]  Bill Shipley,et al.  Functional structure of an arid steppe plant community reveals similarities with Grime's C-S-R theory , 2012 .

[50]  Kazuki Saito,et al.  KNApSAcK family databases: integrated metabolite-plant species databases for multifaceted plant research. , 2012, Plant & cell physiology.

[51]  I. C. Prentice,et al.  Evidence of a universal scaling relationship for leaf CO2 drawdown along an aridity gradient. , 2011, The New phytologist.

[52]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[53]  Y. Lim,et al.  Assessment of antiproliferative and antiplasmodial activities of five selected Apocynaceae species , 2011, BMC complementary and alternative medicine.

[54]  M. Nicoletti,et al.  Antiplasmodial activity of the alkaloids of Peschiera fuchsiaefolia. , 2009, Planta medica.

[55]  Karl Pearson F.R.S. X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling , 2009 .

[56]  A. Krettli Antimalarial drug discovery: screening of Brazilian medicinal plants and purified compounds , 2009, Expert opinion on drug discovery.

[57]  M. Weigend World Geographical Scheme for Recording Plant Distributions, 2nd , 2009 .

[58]  Mehryar Mohri,et al.  Sample Selection Bias Correction Theory , 2008, ALT.

[59]  D. Webb,et al.  Contrasting Structure and Function of Pubescent and Glabrous Varieties of Hawaiian Metrosideros polymorpha (Myrtaceae) at High Elevation , 2007 .

[60]  P. Houghton,et al.  Uses and abuses of in vitro tests in ethnopharmacology: visualizing an elephant. , 2007, Journal of ethnopharmacology.

[61]  U. P. de Albuquerque,et al.  Life strategy and chemical composition as predictors of the selection of medicinal plants from the caatinga (Northeast Brazil) , 2005 .

[62]  J. Robinson,et al.  Evaluation of French Guiana traditional antimalarial remedies. , 2005, Journal of ethnopharmacology.

[63]  L. Angenot,et al.  In vitro screening of some Strychnos species for antiplasmodial activity. , 2005, Journal of ethnopharmacology.

[64]  P. Rasoanaivo,et al.  Screening extracts of Madagascan plants in search of antiplasmodial compounds , 2004, Phytotherapy research : PTR.

[65]  Bianca Zadrozny,et al.  Learning and evaluating classifiers under sample selection bias , 2004, ICML.

[66]  J. P. Grime,et al.  The plant traits that drive ecosystems: Evidence from three continents , 2004 .

[67]  A. Suksamrarn,et al.  Antiplasmodial triterpenes from twigs of Gardenia saxatilis. , 2003, Journal of ethnopharmacology.

[68]  J. Stehmann,et al.  Antimalarial activity of Cinchona-like plants used to treat fever and malaria in Brazil. , 2003, Journal of ethnopharmacology.

[69]  Satoshi Takamatsu,et al.  Antiparasitic alkaloids from Psychotria klugii. , 2003, Journal of natural products.

[70]  L. Angenot,et al.  Antiplasmodial activity of alkaloids from various strychnos species. , 2002, Journal of natural products.

[71]  S. Robledo,et al.  Antiprotozoal activities of Colombian plants. , 2001, Journal of ethnopharmacology.

[72]  A. Krettli,et al.  The search for new antimalarial drugs from plants used to treat fever and malaria or plants ramdomly selected: a review. , 2001, Memorias do Instituto Oswaldo Cruz.

[73]  Daniele Micci-Barreca,et al.  A preprocessing scheme for high-cardinality categorical attributes in classification and prediction problems , 2001, SKDD.

[74]  G. Cordell,et al.  The potential of alkaloids in drug discovery , 2001, Phytotherapy research : PTR.

[75]  B. Castillo,et al.  Alkaloid Screening of Herbarium Samples of Rubiaceae from Panama , 2001 .

[76]  L. P. Lounibos,et al.  Malaria Vector Heterogeneity in South America , 2000 .

[77]  J. Krungkrai,et al.  Antimalarials from Stephania venosa, Prismatomeris sessiliflora, Diospyros montana and Murraya siamensis. , 1999, Planta medica.

[78]  R. Sutherst,et al.  Malaria transmission and climate change in Australia , 1996, The Medical journal of Australia.

[79]  R. Girod,et al.  [Control of malaria re-emergence in Reunion]. , 1995, Sante.

[80]  J. Phillipson,et al.  Bio-active Compounds from Psychotria camponutans , 1995, Planta medica.

[81]  J. Phillipson,et al.  In vitro antiamoebic and antiplasmodial activities of alkaloids isolated from Alstonia angustifolia roots , 1992 .

[82]  L. Mwasumbi,et al.  Antimalarial activity of Tanzanian medicinal plants. , 1990, Planta medica.

[83]  M. Rejžek,et al.  Diterpenoids from Scutellaria barbata induce tumour-selective cytotoxicity by taking the brakes off apoptosis , 2022, Medicinal Plant Biology.

[84]  Powo Plants of the World Online. , 2020 .

[85]  Usda Nrcs The PLANTS Database , 2015 .

[86]  Kevin Marsh,et al.  The changing limits and incidence of malaria in Africa: 1939-2009. , 2012, Advances in parasitology.

[87]  D. Gesch,et al.  Global multi-resolution terrain elevation data 2010 (GMTED2010) , 2011 .

[88]  West Indian Ben,et al.  Dr. Duke's Phytochemical and Ethnobotanical Databases , 2010 .

[89]  Cátia Ramalhete,et al.  ANTIMALARIAL ACTIVITY OF SOME PLANTS TRADITIONALLY USED IN MOZAMBIQUE , 2009 .

[90]  Jeremy W. Lichstein,et al.  The Imprint of Species Turnover on Old-Growth Forest Carbon Balances - Insights From a Trait-Based Model of Forest Dynamics , 2009 .

[91]  Weltgesundheitsorganisation World malaria report , 2005 .

[92]  P. Rasoanaivo,et al.  Guidelines for the nonclinical evaluation of the efficacy of traditional antimalarials , 2004 .

[93]  M. Mallié,et al.  Antiplasmodial activity of aspidosperma indole alkaloids. , 2002, Phytomedicine : international journal of phytotherapy and phytopharmacology.

[94]  S. Meshnick,et al.  The History of Antimalarial Drugs , 2001 .

[95]  A. Giménez,et al.  A search for natural bioactive compounds in Bolivia through a multidisciplinary approach. Part I. Evaluation of the antimalarial activity of plants used by the Chacobo Indians. , 2000, Journal of ethnopharmacology.

[96]  L. Merrick World Economic Plants: A Standard Reference , 2000 .

[97]  D. Chadee,et al.  An epidemic outbreak of plasmodium vivax malaria in Trinidad - abstract , 1993 .

[98]  R. Brummitt,et al.  World geographical scheme for recording plant distributions , 1992 .

[99]  Antimalaria studies on Qinghaosu. , 1979, Chinese medical journal.

[100]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[101]  R. Strube The search for new antimalarial drugs. , 1975, The Journal of tropical medicine and hygiene.

[102]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .