Prediction of factor Xa inhibitors by machine learning methods.

Factor Xa (FXa) inhibitors have been explored as anticoagulants for treatment and prevention of thrombotic diseases. Molecular docking, pharmacophore, quantitative structure-activity relationships, and support vector machines (SVM) have been used for computer prediction of FXa inhibitors. These methods achieve promising prediction accuracies of 69-80% for FXa inhibitors and 85-99% for non-inhibitors. Prediction performance, particularly for inhibitors, may be further improved by exploring methods applicable to more diverse range of compounds and by using more appropriate set of molecular descriptors. We tested the capability of several machine learning methods (C4.5 decision tree, k-nearest neighbor, probabilistic neural network, and support vector machine) by using a much more diverse set of 1098 compounds (360 inhibitors and 738 non-inhibitors) than those in other studies. A feature selection method was used for selecting molecular descriptors appropriate for distinguishing FXa inhibitors and non-inhibitors. The prediction accuracies of these methods are 89.1-97.5% for FXa inhibitors and 92.3-98.1% for non-inhibitors. In particular, compared to other studies, support vector machine gives a substantially improved accuracy of 94.6% for FXa non-inhibitors and maintains a comparable accuracy of 98.1% for inhibitors, based-on a more rigorous test with more diverse range of compounds. Our study suggests that machine learning methods such as SVM are useful for facilitating the prediction of FXa inhibitors.

[1]  J E Roulston,et al.  Screening with tumor markers , 2002, Molecular biotechnology.

[2]  Y Xue,et al.  Prediction of torsade-causing potential of drugs by support vector machine approach. , 2004, Toxicological sciences : an official journal of the Society of Toxicology.

[3]  D. Fairlie,et al.  Protease inhibitors: current status and future prospects. , 2000, Journal of medicinal chemistry.

[4]  P. Lam,et al.  Discovery of 1-[3-(aminomethyl)phenyl]-N-3-fluoro-2'-(methylsulfonyl)-[1,1'-biphenyl]-4-yl]-3-(trifluoromethyl)-1H-pyrazole-5-carboxamide (DPC423), a highly potent, selective, and orally bioavailable inhibitor of blood coagulation factor Xa. , 2001, Journal of medicinal chemistry.

[5]  J. A. Kyle,et al.  N(2)-Aroylanthranilamide inhibitors of human factor Xa. , 2000, Journal of medicinal chemistry.

[6]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[7]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[8]  J. A. Kyle,et al.  Structure-based design of potent, amidine-derived inhibitors of factor Xa: evaluation of selectivity, anticoagulant activity, and antithrombotic activity. , 2000, Journal of Medicinal Chemistry.

[9]  H Martin,et al.  The design of phenylglycine containing benzamidine carboxamides as potent and selective inhibitors of factor Xa. , 2001, Bioorganic & medicinal chemistry letters.

[10]  D M Wallace,et al.  The Efficacy and Safety of Oral Anticoagulation in Patients with Cancer , 1995, Thrombosis and Haemostasis.

[11]  C. Dominguez,et al.  The de novo design and synthesis of cyclic urea inhibitors of factor Xa: initial SAR studies. , 1998, Bioorganic & medicinal chemistry letters.

[12]  Wei Xu,et al.  Design, synthesis, and activity of a novel series of factor Xa inhibitors: optimization of arylamidine groups. , 2002, Journal of medicinal chemistry.

[13]  Thomas Hofmann,et al.  Predicting CNS Permeability of Drug Molecules: Comparison of Neural Network and Support Vector Machine Algorithms , 2002, J. Comput. Biol..

[14]  P. Sprengeler,et al.  Neutral inhibitors of the serine protease factor Xa. , 2001, Bioorganic & medicinal chemistry letters.

[15]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.

[16]  R. Knabb,et al.  Preparation of meta-amidino-N,N-disubstituted anilines as potent inhibitors of coagulation factor Xa. , 1998, Bioorganic & Medicinal Chemistry Letters.

[17]  Takako Aizawa,et al.  Quantitative structure-activity relationships for estrogen receptor binding affinity of phenolic chemicals. , 2003, Water research.

[18]  D L Cheney,et al.  Design and structure-activity relationships of potent and selective inhibitors of blood coagulation factor Xa. , 1999, Journal of medicinal chemistry.

[19]  A. Olson,et al.  Modelling of Factor Xa‐inhibitor complexes: a computational flexible docking approach , 1999, Proteins.

[20]  M. Whitlow,et al.  Discovery of N-[2-[5-[Amino(imino)methyl]-2-hydroxyphenoxy]-3, 5-difluoro-6-[3-(4, 5-dihydro-1-methyl-1H-imidazol-2-yl)phenoxy]pyridin-4-yl]-N-methylgl y cine (ZK-807834): a potent, selective, and orally active inhibitor of the blood coagulation enzyme factor Xa. , 1998, Journal of medicinal chemistry.

[21]  Angela Smallwood,et al.  SAR and factor IXa crystal structure of a dual inhibitor of factors IXa and Xa. , 2004, Bioorganic & medicinal chemistry letters.

[22]  K. Ishikawa,et al.  Rational design, synthesis, and structure-activity relationships of novel factor Xa inhibitors: (2-substituted-4-amidinophenyl)pyruvic and -propionic acids. , 2003, Journal of medicinal chemistry.

[23]  R. Huber,et al.  Structural and functional analyses of benzamidine-based inhibitors in complex with trypsin: implications for the inhibition of factor Xa, tPA, and urokinase. , 1998, Journal of medicinal chemistry.

[24]  C. Dunwiddie,et al.  Optimization of the beta-aminoester class of factor Xa inhibitors. Part 1: P(4) and side-chain modifications for improved in vitro potency. , 2002, Bioorganic & medicinal chemistry letters.

[25]  Sayan Mukherjee,et al.  Choosing Multiple Parameters for Support Vector Machines , 2002, Machine Learning.

[26]  Z. R. Li,et al.  Prediction of estrogen receptor agonists and characterization of associated molecular descriptors by statistical learning methods. , 2006, Journal of molecular graphics & modelling.

[27]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[28]  Xin Chen,et al.  Effect of Molecular Descriptor Feature Selection in Support Vector Machine Classification of Pharmacokinetic and Toxicological Properties of Chemical Agents , 2004, J. Chem. Inf. Model..

[29]  R. Knabb,et al.  Nonpeptide factor Xa inhibitors: I. Studies with SF303 and SK549, a new class of potent antithrombotics. , 2000, The Journal of pharmacology and experimental therapeutics.

[30]  R. Knabb,et al.  Isoxazolines and isoxazoles as factor Xa inhibitors. , 2000, Bioorganic & Medicinal Chemistry Letters.

[31]  J. F. Wang,et al.  Prediction of P-Glycoprotein Substrates by a Support Vector Machine Approach , 2004, J. Chem. Inf. Model..

[32]  M. Myers,et al.  Nonbenzamidine compounds as selective factor Xa inhibitors. , 2000, Bioorganic & medicinal chemistry letters.

[33]  Hans Matter,et al.  Design and quantitative structure-activity relationship of 3-amidinobenzyl-1H-indole-2-carboxamides as potent, nonchiral, and selective inhibitors of blood coagulation factor Xa. , 2002, Journal of medicinal chemistry.

[34]  R. Bentley,et al.  Synthesis, SAR and in vivo activity of novel thienopyridine sulfonamide pyrrolidinones as factor Xa inhibitors. , 1999, Bioorganic & medicinal chemistry letters.

[35]  Andrew C. Good,et al.  Measuring CAMD technique performance: A virtual screening case study in the design of validation experiments , 2004, J. Comput. Aided Mol. Des..

[36]  M. Taha,et al.  Ligand-based assessment of factor Xa binding site flexibility via elaborate pharmacophore exploration and genetic algorithm-based QSAR modeling. , 2005, European journal of medicinal chemistry.

[37]  Y. Z. Chen,et al.  Quantitative Structure-Pharmacokinetic Relationships for drug distribution properties by using general regression neural network. , 2005, Journal of pharmaceutical sciences.

[38]  F. Sanz,et al.  Anchor-GRIND: filling the gap between standard 3D QSAR and the GRid-INdependent descriptors. , 2005 .

[39]  J. Maraganore,et al.  Thrombin-specific inhibition by and slow cleavage of hirulog-1. , 1992, The Biochemical journal.

[40]  Cesare Furlanello,et al.  An accelerated procedure for recursive feature ranking on microarray data , 2003, Neural Networks.

[41]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[42]  Y. Ru,et al.  Rational design and synthesis of novel, potent bis-phenylamidine carboxylate factor Xa inhibitors. , 1998, Journal of medicinal chemistry.

[43]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[44]  K. Grandien,et al.  Printed in U.S.A. Copyright © 1997 by The Endocrine Society Comparison of the Ligand Binding Specificity and Transcript Tissue Distribution of Estrogen Receptors � and � , 2022 .

[45]  M. Morrissey,et al.  Design, synthesis, and biological activity of novel purine and bicyclic pyrimidine factor Xa inhibitors. , 1998, Bioorganic & medicinal chemistry letters.

[46]  Thierry Langer,et al.  Pharmacophore Identification, in Silico Screening, and Virtual Library Design for Inhibitors of the Human Factor Xa , 2005, J. Chem. Inf. Model..

[47]  A. Spada,et al.  Amido-(propyl and allyl)-hydroxybenzamidines: development of achiral inhibitors of factor Xa. , 2000, Bioorganic & medicinal chemistry letters.

[48]  Weida Tong,et al.  Phytoestrogens and mycoestrogens bind to the rat uterine estrogen receptor. , 2002, The Journal of nutrition.

[49]  Brian Carnahan,et al.  Comparing Statistical and Machine Learning Classifiers: Alternatives for Predictive Modeling in Human Factors Research , 2003, Hum. Factors.

[50]  Juan J Perez,et al.  Managing molecular diversity. , 2005, Chemical Society reviews.

[51]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[52]  K. Roy,et al.  QSAR of human factor Xa inhibitor N2-aroylanthranilamides using principal component factor analysis. , 2002, Drug design and discovery.

[53]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[54]  Marta Murcia,et al.  Virtual screening with flexible docking and COMBINE-based models. Application to a series of factor Xa inhibitors. , 2004, Journal of medicinal chemistry.

[55]  Yu Zong Chen,et al.  Prediction of Cytochrome P450 3A4, 2D6, and 2C9 Inhibitors and Substrates by Using Support Vector Machines , 2005, J. Chem. Inf. Model..

[56]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[57]  Z R Li,et al.  Prediction of genotoxicity of chemical compounds by statistical learning methods. , 2005, Chemical research in toxicology.

[58]  Bernard F. Buxton,et al.  Support Vector Machines in Combinatorial Chemistry , 2001 .

[59]  Sulfonamidopyrrolidinone factor Xa inhibitors: potency and selectivity enhancements via P-1 and P-4 optimization. , 1999 .

[60]  Michael K. Gilson,et al.  Virtual Screening of Molecular Databases Using a Support Vector Machine , 2005, J. Chem. Inf. Model..

[61]  Y. Matsumoto,et al.  Antithrombotic effect of YM-75466 is separated from its effect on bleeding time and coagulation time. , 1998, European journal of pharmacology.

[62]  B. Furie,et al.  The molecular basis of blood coagulation , 1988, Cell.

[63]  Peter C Jurs,et al.  Predicting the genotoxicity of polycyclic aromatic compounds from molecular structure with different classifiers. , 2003, Chemical research in toxicology.

[64]  Bernard F. Buxton,et al.  Drug Design by Machine Learning: Support Vector Machines for Pharmaceutical Data Analysis , 2001, Comput. Chem..

[65]  M. Whitlow,et al.  Design, synthesis, and in vitro biological activity of indole-based factor Xa inhibitors. , 2000, Bioorganic & medicinal chemistry letters.

[66]  D. Case,et al.  Design and synthesis of highly constrained factor Xa inhibitors: amidine-substituted bis(benzoyl)--diazepan-2-ones and bis(benzylidene)-bis(gem-dimethyl)cycloketones. , 2003, Bioorganic & medicinal chemistry.

[67]  Hans Matter,et al.  Probing the subpockets of factor Xa reveals two binding modes for inhibitors based on a 2-carboxyindole scaffold: a study combining structure-activity relationship and X-ray crystallography. , 2005, Journal of medicinal chemistry.

[68]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[69]  D. R. Holland,et al.  Design, synthesis, and biological activity of potent and selective inhibitors of blood coagulation factor Xa. , 2004, Journal of medicinal chemistry.

[70]  Igor V. Pletnev,et al.  Drug Discovery Using Support Vector Machines. The Case Studies of Drug-likeness, Agrochemical-likeness, and Enzyme Inhibition Predictions , 2003, J. Chem. Inf. Comput. Sci..

[71]  K. Eagen,et al.  Design, synthesis, and biological activity of novel factor Xa inhibitors: 4-aryloxy substituents of 2,6-diphenoxypyridines. , 2002, Bioorganic & medicinal chemistry.

[72]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[73]  M. Whitlow,et al.  Synthesis, characterization, and structure-activity relationships of amidine-substituted (bis)benzylidene-cycloketone olefin isomers as potent and selective factor Xa inhibitors. , 1999, Journal of medicinal chemistry.

[74]  H. Yu,et al.  Discovering compact and highly discriminative features or combinations of drug activities using support vector machines , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[75]  M. Whitlow,et al.  Design, synthesis, and in vitro biological activity of benzimidazole based factor Xa inhibitors. , 2000, Bioorganic & medicinal chemistry letters.

[76]  N. Haginoya,et al.  Design, synthesis, and biological activity of non-amidine factor Xa inhibitors containing pyridine N-oxide and 2-carbamoylthiazole units. , 2004, Bioorganic & medicinal chemistry.

[77]  D. Hadjipavlou-Litina,et al.  Current trends in quantitative structure activity relationships on FXa inhibitors: Evaluation and comparative analysis , 2004, Medicinal research reviews.

[78]  G Klebe,et al.  Three-dimensional quantitative structure-activity relationship analyses using comparative molecular field analysis and comparative molecular similarity indices analysis to elucidate selectivity differences of inhibitors binding to trypsin, thrombin, and factor Xa. , 1999, Journal of medicinal chemistry.

[79]  K. Roy,et al.  QSAR with electrotopological state atom index: human factor Xa inhibitor N2-aroylanthranilamides. , 2002, Drug design and discovery.

[80]  P. Labute A widely applicable set of descriptors. , 2000, Journal of molecular graphics & modelling.

[81]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[82]  György M Keseru,et al.  A neural network based virtual screening of cytochrome P450 3A4 inhibitors. , 2002, Bioorganic & medicinal chemistry letters.

[83]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[84]  B. Katz,et al.  Development of serine protease inhibitors displaying a multicentered short (<2.3 A) hydrogen bond binding mode: inhibitors of urokinase-type plasminogen activator and factor Xa. , 2001, Journal of medicinal chemistry.

[85]  J R Pruitt,et al.  Design and synthesis of isoxazoline derivatives as factor Xa inhibitors. 1. , 1999, Journal of medicinal chemistry.

[86]  Bernard De Baets,et al.  Feature subset selection for splice site prediction , 2002, ECCB.

[87]  A. Spada,et al.  Crystal structures of human factor Xa complexed with potent inhibitors. , 2000, Journal of medicinal chemistry.

[88]  T. Cacoullos Estimation of a multivariate density , 1966 .

[89]  W. Bode,et al.  Design of benzamidine-type inhibitors of factor Xa. , 1998, Journal of medicinal chemistry.

[90]  Daniel L Cheney,et al.  Optimization of the beta-aminoester class of factor Xa inhibitors. Part 2: Identification of FXV673 as a potent and selective inhibitor with excellent In vivo anticoagulant activity. , 2002, Bioorganic & medicinal chemistry letters.

[91]  G. Broze Tissue factor pathway inhibitor and the current concept of blood coagulation , 1995, Blood coagulation & fibrinolysis : an international journal in haemostasis and thrombosis.

[92]  R. Czerminski,et al.  Use of Support Vector Machine in Pattern Classification: Application to QSAR Studies , 2001 .

[93]  C. Dominguez,et al.  Design, synthesis, and biological evaluation of potent and selective amidino bicyclic factor Xa inhibitors. , 2000, Journal of medicinal chemistry.

[94]  R. Snyder,et al.  Assessment of the sensitivity of the computational programs DEREK, TOPKAT, and MCASE in the prediction of the genotoxicity of pharmaceutical molecules , 2004, Environmental and molecular mutagenesis.

[95]  W D Shrader,et al.  Development of potent and selective factor Xa inhibitors. , 2001, Bioorganic & medicinal chemistry letters.