Study of Chromatographic Retention of Natural Terpenoids by Chemoinformatic Tools

The study of chromatographic retention of natural products can be used to increase their identification speed in complex biological matrices. In this work, six variables were used to study the retention behavior in reversed phase liquid chromatography of 39 sesquiterpene lactones (SL) from an in-house database using chemoinformatics tools. To evaluate the retention of the SL, retention parameters on an ODS C-18 column in two different solvent systems were experimentally obtained, namely, MeOH-H2O 55:45 and MeCN-H2O 35:75. The chemoinformatics approach involved three descriptor type sets (one 2D and two 3D) comprising three groups of each (four, five, and six descriptors), two different training and test sets, four algorithms for variable selection (best first, linear forward, greedy stepwise, and genetic algorithm), and two modeling methods (partial least-squares regression and back-propagation artificial neural network). The influence of the six variables used in this study was assessed in a holistic context, and influences on the best model for each solvent system were analyzed. The best set for MeOH-H2O showed acceptable correlation statistics with training R(2) = 0.91, cross-validation Q(2) = 0.88, and external validation P(2) = 0.80, and the best MeCN-H2O model showed much higher correlation statistics with training R(2) = 0.96, cross-validation Q(2) = 0.92, and external validation P(2) = 0.91. Consensus models were built for each chromatographic system, and although all of them showed an improved statistical performance, only one for the MeCN-H2O system was able to separate isomers as well as to improve the performance. The approach described herein can therefore be used to generate reproducible and robust models for QSRR studies of natural products as well as an aid for dereplication of complex biological matrices using plant metabolomics-based techniques.

[1]  B. Widom Statistical Mechanics: A Concise Introduction for Chemists , 2002 .

[2]  A. A. D’Archivio,et al.  Quantitative structure-retention relationships of pesticides in reversed-phase high-performance liquid chromatography. , 2007, Analytica chimica acta.

[3]  Michael H. Abraham,et al.  Scales of solute hydrogen-bonding: their construction and application to physicochemical and biochemical processes , 2010 .

[4]  Oliver Fiehn,et al.  Applying in-silico retention index and mass spectra matching for identification of unknown metabolites in accurate mass GC-TOF mass spectrometry. , 2011, Analytical chemistry.

[5]  K. Klempnauer,et al.  Natural sesquiterpene lactones as inhibitors of Myb-dependent gene expression: structure-activity relationships. , 2013, European journal of medicinal chemistry.

[6]  Emmanuel Mikros,et al.  Recent advances and new strategies in the NMR-based identification of natural products. , 2014, Current opinion in biotechnology.

[7]  C. Wagstaff,et al.  Sesquiterpenoids Lactones: Benefits to Plants and People , 2013, International journal of molecular sciences.

[8]  Fozia Batool,et al.  Predicting Retention Times of Naturally Occurring Phenolic Compounds in Reversed-Phase Liquid Chromatography: A Quantitative Structure-Retention Relationship (QSRR) Approach , 2012, International journal of molecular sciences.

[9]  Marcelo J. P. Ferreira,et al.  The application of Bayes' theorem in natural products as a guide for skeletons identification , 1998 .

[10]  H. Noorizadeh,et al.  QSRR-based estimation of the retention time of opiate and sedative drugs by comprehensive two-dimensional gas chromatography , 2012, Medicinal Chemistry Research.

[11]  Roger G. Linington,et al.  Molecular networking as a dereplication strategy. , 2013, Journal of natural products.

[12]  L. A. Smith,et al.  Feature Subset Selection: A Correlation Based Filter Approach , 1997, ICONIP.

[13]  Johann Gasteiger,et al.  Chemoinformatics - An Important Scientific Discipline , 2006 .

[14]  Kazuki Saito,et al.  Metabolomics for unknown plant metabolites , 2013, Analytical and Bioanalytical Chemistry.

[15]  S C Basak,et al.  Comparative study of lipophilicity versus topological molecular descriptors in biological correlations. , 1984, Journal of pharmaceutical sciences.

[16]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[17]  Z. Herceg,et al.  Parthenolide: from plant shoots to cancer roots. , 2013, Drug discovery today.

[18]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[19]  P. Carrupt,et al.  Molecular fields in quantitative structure–permeation relationships: the VolSurf approach , 2000 .

[20]  John W. Dolan,et al.  Introduction to modern liquid chromatography , 1974 .

[21]  B. Rocha,et al.  Ethnobotany, Chemistry, and Biological Activities of the Genus Tithonia (Asteraceae) , 2012, Chemistry & biodiversity.

[22]  BioChem Press,et al.  Introduction of Extended Topochemical Atom (ETA) Indices in the Valence Electron Mobile (VEM) Environment as Tools for QSAR/QSPR Studies # , 2003 .

[23]  E. Want,et al.  Liquid chromatography-mass spectrometry based global metabolite profiling: a review. , 2012, Analytica chimica acta.

[24]  Károly Héberger,et al.  Ranking and similarity for quantitative structure-retention relationship models in predicting Lee retention indices of polycyclic aromatic hydrocarbons. , 2012, Analytica chimica acta.

[25]  F. Costa,et al.  Topical anti-inflammatory activity of yacon leaf extracts , 2013 .

[26]  P. Fernandes,et al.  Analysis of van der Waals surface area properties for human ether-a-go-go-related gene blocking activity: computational study on structurally diverse compounds , 2012, SAR and QSAR in environmental research.

[27]  S. Gibbons,et al.  Sesquiterpenes from Warburgia ugandensis and their antimycobacterial activity. , 2005, Phytochemistry.

[28]  Paola Gramatica,et al.  Statistical external validation and consensus modeling: a QSPR case study for Koc prediction. , 2007, Journal of molecular graphics & modelling.

[29]  Nicolas Foloppe,et al.  Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening , 2008, J. Chem. Inf. Model..

[30]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[31]  Alexander Tropsha,et al.  Quantitative structure-activity relationship modeling of rat acute toxicity by oral exposure. , 2009, Chemical research in toxicology.

[32]  O. Fiehn,et al.  FiehnLib: mass spectral and retention index libraries for metabolomics based on quadrupole and time-of-flight gas chromatography/mass spectrometry. , 2009, Analytical chemistry.

[33]  Synthesis and QSRR Study for a Series of Phosphoramidic Acid Derivatives , 2013 .

[34]  Irini Doytchinova,et al.  Quantitative structure--plasma protein binding relationships of acidic drugs. , 2012, Journal of pharmaceutical sciences.

[35]  J. Heilmann,et al.  Quantitative Structure-Cytotoxicity Relationships of Sesquiterpene Lactones derived from partial charge (Q)-based fractional Accessible Surface Area Descriptors (Q_frASAs) , 2002 .

[36]  A. Hensel,et al.  An unusual dimeric guaianolide with antiprotozoal activity and further sesquiterpene lactones from Eupatoriumperfoliatum. , 2011, Phytochemistry.

[37]  F Carnevale Neto,et al.  Interval multivariate curve resolution in the dereplication of HPLC-DAD data from Jatropha gossypifolia. , 2013, Phytochemical analysis : PCA.

[38]  K. Roy,et al.  QSPR with extended topochemical atom (ETA) indices, 3: Modeling of critical micelle concentration of cationic surfactants , 2012 .

[39]  Dimitar Hristozov,et al.  Sesquiterpene Lactones-Based Classification of the Family Asteraceae Using Neural Networks and k-Nearest Neighbors , 2007, J. Chem. Inf. Model..

[40]  Igor V. Tetko,et al.  Combinatorial QSAR Modeling of Chemical Toxicants Tested against Tetrahymena pyriformis , 2008, J. Chem. Inf. Model..

[41]  Han Zuilhof,et al.  Rapid control of Chinese star anise fruits and teas for neurotoxic anisatin by Direct Analysis in Real Time high resolution mass spectrometry. , 2012, Journal of chromatography. A.

[42]  J. C. Chaves,et al.  Constituents of glandular trichomes of Tithonia diversifolia: relationships to herbivory and antifeedant activity. , 2008, Phytochemistry.

[43]  B. Sepehri,et al.  Investigation of retention behavior of polychlorinated biphenyl congeners on 18 different HRGC columns using molecular surface average local ionization energy descriptors. , 2012, Journal of chromatography. A.

[44]  CHUN WEI YAP,et al.  PaDEL‐descriptor: An open source software to calculate molecular descriptors and fingerprints , 2011, J. Comput. Chem..

[45]  J. Teixidó,et al.  Quantitative structure-retention relationships applied to liquid chromatography gradient elution method for the determination of carbonyl-2,4-dinitrophenylhydrazone compounds. , 2013, Journal of chromatography. A.

[46]  Z. Ali,et al.  Drimane-type sesquiterpenes from Polygonum hydropiper. , 2011, Planta medica.

[47]  J. Schulte‐Mönting,et al.  Quantitative structure-activity relationship of sesquiterpene lactones as inhibitors of the transcription factor NF-kappaB. , 2004, Journal of medicinal chemistry.

[48]  A. Farmany,et al.  Investigation of Retention Behaviors of Essential Oils by Using QSRR , 2010 .

[49]  A. Jouyban,et al.  A principal component analysis approach for developing retention models in liquid chromatography. , 2012, Journal of chromatography. A.

[50]  F. Costa,et al.  A proposal for the quality control of Tanacetum parthenium (feverfew) and its hydroalcoholic extract , 2008 .

[51]  E. Bosch,et al.  Retention of Ionizable Compounds on HPLC. pH Scale in Methanol−Water and the pK and pH Values of Buffers , 1996 .

[52]  I. Helland ON THE STRUCTURE OF PARTIAL LEAST SQUARES REGRESSION , 1988 .

[53]  A. Skaltsounis,et al.  New Concepts, Experimental Approaches, and Dereplication Strategies for the Discovery of Novel Phytoestrogens from Natural Sources , 2013, Planta Medica.

[54]  A. Farmany,et al.  Quantitative structure-retention relationship for retention behavior of organic pollutants in textile wastewaters and landfill leachate in LC-APCI-MS , 2012, Environmental Science and Pollution Research.

[55]  H. Gali-Muhtasib,et al.  What made sesquiterpene lactones reach cancer clinical trials? , 2010, Drug discovery today.

[56]  E. Schilling,et al.  Infraspecific variation in the chemistry of glandular trichomes of two Brazilian Viguiera species (Heliantheae; Asteraceae) , 2001 .

[57]  I. Merfort,et al.  A Novel Dimeric Melampolide and Further Terpenoids from Smallanthus sonchifolius (Asteraceae) and the Inhibition of the Transcription Factor NF-κB , 2007 .

[58]  Yizeng Liang,et al.  Comparison of quantitative structure-retention relationship models on four stationary phases with different polarity for a diverse set of flavor compounds. , 2012, Journal of chromatography. A.

[59]  Johann Gasteiger,et al.  Sesquiterpene lactone-based classification of three Asteraceae tribes: a study based on self-organizing neural networks applied to chemosystematics. , 2005, Phytochemistry.

[60]  F. Costa,et al.  Guaianolides from Viguiera gardneri inhibit the transcription factor NF-κB , 2002 .

[61]  W. Setzer,et al.  The potential of secondary metabolites from plants as drugs or leads against protozoan neglected diseases - part II. , 2012, Current medicinal chemistry.

[62]  Sadegh Masoudi,et al.  A QSRR Study of Liquid Chromatography Retention Time of Pesticides using Linear and Nonlinear Chemometric Models , 2012 .

[63]  F. Seaman Sesquiterpene lactones as taxonomic characters in the asteraceae , 1982, The Botanical Review.

[64]  William N. Setzer,et al.  The Potential of Secondary Metabolites from Plants as Drugs or Leads against Protozoan Neglected Diseases—Part III: In-Silico Molecular Docking Investigations , 2016, Molecules.

[65]  O. Spring,et al.  Sesquiterpene lactones from glandular trichomes of Viguiera radula (Heliantheae; Asteraceae). , 2003, Phytochemistry.