Classification of central nervous system agents by least squares support vector machine based on their structural descriptors: A comparative study

Abstract Linear and quadratic discriminant analysis and least squares support vector machine (LS-SVM) were used to classify a data set of 326 central nervous system (CNS) drugs as active or inactive CNS agents according to their permeation into the blood–brain barrier. A pool of descriptors was calculated by DRAGON software and nine of them were selected based on Wilk's lambda and classification accuracy and used for classification of drugs in data set. The classification models were validated based on accuracy, sensitivity, specificity, Matthew's correlation coefficient and Cohen's kappa values. The developed LS-SVM model, as the superior model has the accuracy of 96.5% and 96.0%, Matthew's correlation coefficient of 0.930 and 0.920, Cohen's kappa value of 0.963 and 0.917, and area under recursive operating characteristic curve of 0.95 and 0.98 for training and test sets, respectively. The results of this study indicated the applicability of LS-SVM in classification of CNS drugs based on their structural descriptors.

[1]  Ruisheng Zhang,et al.  Classification study of novel piperazines as antagonists for the melanocortin-4 receptor based on least-squares support vector machines , 2009 .

[2]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[3]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[4]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[5]  M. Fatemi,et al.  Classification of drugs according to their milk/plasma concentration ratio. , 2010, European journal of medicinal chemistry.

[6]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[7]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8]  William A Banks,et al.  Characteristics of compounds that cross the blood-brain barrier , 2009, BMC neurology.

[9]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[10]  M Gumbleton,et al.  Progress and limitations in the use of in vitro cell cultures to serve as a permeability screen for the blood-brain barrier. , 2001, Journal of pharmaceutical sciences.

[11]  H. van de Waterbeemd,et al.  ADMET in silico modelling: towards prediction paradise? , 2003, Nature reviews. Drug discovery.

[12]  L. Molnár,et al.  Recent advances in the prediction of blood-brain partitioning from molecular structure. , 2003, Journal of pharmaceutical sciences.

[13]  Thomas Hofmann,et al.  Predicting CNS Permeability of Drug Molecules: Comparison of Neural Network and Support Vector Machine Algorithms , 2002, J. Comput. Biol..

[14]  Jure Zupan,et al.  Kohonen and counterpropagation artificial neural networks in analytical chemistry , 1997 .

[15]  Károly Héberger,et al.  Classification of olive oils using high throughput flow 1H NMR fingerprinting with principal component analysis, linear discriminant analysis and probabilistic neural networks , 2005 .

[16]  Francklin Rivas,et al.  Classification of Venezuelan spirituous beverages by means of discriminant analysis and artificial neural networks based on their Zn, Cu and Fe concentrations. , 2003, Talanta.

[17]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[18]  Mohammed Hossein Fatemi,et al.  Predictions of chromatographic retention indices of alkylphenols with support vector machines and multiple linear regression. , 2009, Journal of separation science.

[19]  M. A. Cabrera Pérez,et al.  In silico prediction of central nervous system activity of compounds. Identification of potential pharmacophores by the TOPS-MODE approach. , 2004, Bioorganic & medicinal chemistry.

[20]  D. E. Clark,et al.  Prediction of intestinal absorption and blood-brain barrier penetration by computational methods. , 2001, Combinatorial chemistry & high throughput screening.

[21]  Lijuan Xie,et al.  Classification of tomatoes with different genotypes by visible and short-wave near-infrared spectroscopy with least-squares support vector machines and other chemometrics. , 2009 .

[22]  Lemont B. Kier,et al.  Modeling Blood-Brain Barrier Partitioning Using the Electrotopological State , 2002, J. Chem. Inf. Comput. Sci..

[23]  Marc Adenot,et al.  Blood-Brain Barrier Permeation Models: Discriminating between Potential CNS and Non-CNS Drugs Including P-Glycoprotein Substrates , 2004, J. Chem. Inf. Model..

[24]  U. Norinder,et al.  Computational approaches to the prediction of the blood-brain distribution. , 2002, Advanced drug delivery reviews.

[25]  Johann Gasteiger,et al.  Comparison of Different Classification Methods Applied to a Mode of Toxic Action Data Set , 2004 .

[26]  Pablo R Duchowicz,et al.  QSAR analysis for quinoxaline-2-carboxylate 1,4-di-N-oxides as anti-mycobacterial agents. , 2009, Journal of molecular graphics & modelling.

[27]  Christopher D. Brown,et al.  Receiver operating characteristics curves and related decision measures: A tutorial , 2006 .

[28]  M. Abraham,et al.  A data base for partition of volatile organic compounds and drugs from blood/plasma/serum to brain, and an LFER analysis of the data. , 2006, Journal of pharmaceutical sciences.

[29]  Johann Gasteiger,et al.  Neural networks with counter-propagation learning strategy used for modelling , 1995 .

[30]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[31]  Harpreet S. Chadha,et al.  Hydrogen bonding. 33. Factors that influence the distribution of solutes between blood and brain. , 1994, Journal of pharmaceutical sciences.

[32]  D. E. Clark,et al.  In Silico Predictions of Blood-Brain Barrier Penetration: Considerations to “Keep in Mind” , 2005, Journal of Pharmacology and Experimental Therapeutics.

[33]  David J. Begley,et al.  Potential of Immobilized Artificial Membranes for Predicting Drug Penetration Across the Blood−Brain Barrier , 1998, Pharmaceutical Research.

[34]  Ram Sagar,et al.  C-3 alkyl/arylalkyl-2,3-dideoxy hex-2-enopyranosides as antitubercular agents: synthesis, biological evaluation, and QSAR study. , 2007, Journal of medicinal chemistry.

[35]  Forbes Ad,et al.  Classification-algorithm evaluation: five performance measures based on confusion matrices. , 1995 .

[36]  J R Chretien,et al.  Estimation of blood-brain barrier crossing of drugs using molecular size and shape, and H-bonding descriptors. , 1998, Journal of drug targeting.

[37]  S. Manel,et al.  Evaluating presence-absence models in ecology: the need to account for prevalence , 2001 .

[38]  Alexandru T. Balaban,et al.  Topological indices and real number vertex invariants based on graph eigenvalues or eigenvectors , 1991, J. Chem. Inf. Comput. Sci..

[39]  Alexandre Varnek,et al.  Correlation of blood-brain penetration using structural descriptors. , 2006, Bioorganic & medicinal chemistry.

[40]  D. E. Clark,et al.  Rapid calculation of polar molecular surface area and its application to the prediction of transport phenomena. 2. Prediction of blood-brain barrier penetration. , 1999, Journal of pharmaceutical sciences.

[41]  Meihua Tu,et al.  Development of a computational approach to predict blood-brain barrier permeability. , 2004, Drug metabolism and disposition: the biological fate of chemicals.

[42]  F. Lombardo,et al.  Computation of brain-blood partitioning of organic solutes via free energy calculations. , 1996, Journal of medicinal chemistry.

[43]  Andreas Reichel,et al.  An overview of in vitro techniques for blood-brain barrier studies. , 2003, Methods in molecular medicine.

[44]  Ola Engkvist,et al.  Prediction of CNS Activity of Compound Libraries Using Substructure Analysis , 2003, J. Chem. Inf. Comput. Sci..

[45]  Ajay,et al.  Designing libraries with CNS activity. , 1999, Journal of medicinal chemistry.

[46]  Roberto Kawakami Harrop Galvão,et al.  A method for calibration and validation subset partitioning. , 2005, Talanta.

[47]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[48]  Feng Luan,et al.  Classification of the carcinogenicity of N-nitroso compounds based on support vector machines and linear discriminant analysis. , 2005, Chemical research in toxicology.

[49]  Peteris Prusis,et al.  QSAR and proteo-chemometric analysis of the interaction of a series of organic compounds with melanocortin receptor subtypes. , 2003, Journal of medicinal chemistry.

[50]  Q. Smith,et al.  A review of blood-brain barrier transport techniques. , 2003, Methods in molecular medicine.

[51]  P. Seybold,et al.  Molecular modeling of the physical properties of the alkanes , 1988 .

[52]  M. Burl,et al.  Comparison of the performance of different discriminant algorithms in analyte discrimination tasks using an array of carbon black--polymer composite vapor detectors. , 2001, Analytical chemistry.

[53]  Milan Randic,et al.  Molecular Shape Profiles , 1995, J. Chem. Inf. Comput. Sci..

[54]  Davide Ballabio,et al.  Characterization of the traditional Cypriot spirit Zivania by means of Counterpropagation Artificial Neural Networks , 2007 .

[55]  Zhi-Wei Cao,et al.  Effect of Selection of Molecular Descriptors on the Prediction of Blood-Brain Barrier Penetrating and Nonpenetrating Agents by Statistical Learning Methods , 2005, J. Chem. Inf. Model..

[56]  Philip L. Smith,et al.  In vitro models to predict blood-brain barrier permeability , 1997 .

[57]  J. Suykens,et al.  Classification of brain tumours using short echo time 1H MR spectra. , 2004, Journal of magnetic resonance.

[58]  D J Begley,et al.  Understanding and circumventing the blood‐brain barrier , 2003, Acta paediatrica (Oslo, Norway : 1992). Supplement.

[59]  Mark L. Lewis,et al.  Predicting Penetration Across the Blood-Brain Barrier from Simple Descriptors and Fragmentation Schemes , 2007, J. Chem. Inf. Model..