A lazy learning-based QSAR classification study for screening potential histone deacetylase 8 (HDAC8) inhibitors

Histone deacetylases 8 (HDAC8) is an enzyme repressing the transcription of various genes including tumour suppressor gene and has already become a target of human cancer treatment. In an effort to facilitate the discovery of HDAC8 inhibitors, two quantitative structure–activity relationship (QSAR) classification models were developed using K nearest neighbours (KNN) and neighbourhood classifier (NEC). Molecular descriptors were calculated for the data set and database compounds using ADRIANA.Code of Molecular Networks. Principal components analysis (PCA) was used to select the descriptors. The developed models were validated by leave-one-out cross validation (LOO CV). The performances of the developed models were evaluated with an external test set. Highly predictive models were used for database virtual screening. Furthermore, hit compounds were subsequently subject to molecular docking. Five hits were obtained based on consensus scoring function and binding affinity as potential HDAC8 inhibitors. Finally, HDAC8 structures in complex with five hits were also subjected to 5 ns molecular dynamics (MD) simulations to evaluate the complex structure stability. To the best of our knowledge, the NEC classification model used in this study is the first application of NEC to virtual screening for drug discovery.

[1]  Wim F Vranken,et al.  ACPYPE - AnteChamber PYthon Parser interfacE , 2012, BMC Research Notes.

[2]  C. Van Lint,et al.  Characterization of a human RPD3 ortholog, HDAC3. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Yu Zong Chen,et al.  Support vector machines approach for predicting druggable proteins: recent progress in its exploration and investigation of its usefulness. , 2007, Drug discovery today.

[4]  Abdullah Al Rubaish,et al.  Appraisal of Using Global Student Rating Items in Quality Management of Higher Education in Saudi Arabian University , 2012 .

[5]  Wim F. Vranken,et al.  ACPYPE - AnteChamber PYthon Parser , 2012 .

[6]  Alexander Golbraikh,et al.  A Novel Automated Lazy Learning QSAR (ALL-QSAR) Approach: Method Development, Applications, and Virtual Screening of Chemical Databases Using Validated ALL-QSAR Models , 2006, J. Chem. Inf. Model..

[7]  Olivier Taboureau,et al.  Classification of Cytochrome P450 1A2 Inhibitors and Noninhibitors by Machine Learning Techniques , 2009, Drug Metabolism and Disposition.

[8]  Jim Euchner Design , 2014, Catalysis from A to Z.

[9]  Chunyan Tan,et al.  A Two-Step Target Binding and Selectivity Support Vector Machines Approach for Virtual Screening of Dopamine Receptor Subtype-Selective Ligands , 2012, PloS one.

[10]  Keun Woo Lee,et al.  Dynamic Structure-Based Pharmacophore Model Development: A New and Effective Addition in the Histone Deacetylase 8 (HDAC8) Inhibitor Discovery , 2011, International journal of molecular sciences.

[11]  H. Berendsen,et al.  Interaction Models for Water in Relation to Protein Hydration , 1981 .

[12]  Martha S. Head,et al.  Validation Studies of the Site-Directed Docking Program LibDock , 2007, J. Chem. Inf. Model..

[13]  C. Horvath,et al.  Carbonyl- and sulfur-containing analogs of suberoylanilide hydroxamic acid: Potent inhibition of histone deacetylases. , 2006, Bioorganic & medicinal chemistry.

[14]  A. Harel-Bellan,et al.  Histone acetyltransferases and deacetylases in the control of cell proliferation and differentiation. , 2002, Advances in cancer research.

[15]  Ronald Breslow,et al.  Dimethyl sulfoxide to vorinostat: development of this histone deacetylase inhibitor as an anticancer drug , 2007, Nature Biotechnology.

[16]  R. De Francesco,et al.  Substrate binding to histone deacetylases as shown by the crystal structure of the HDAC8–substrate complex , 2007, EMBO reports.

[17]  A. Kazantsev,et al.  Biological and Potential Therapeutic Roles of Sirtuin Deacetylases , 2008, Cellular and Molecular Life Sciences.

[18]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[19]  Alexander Golbraikh,et al.  Differentiation of AmpC beta-lactamase binders vs. decoys using classification kNN QSAR modeling and application of the QSAR classifier to virtual screening , 2008, J. Comput. Aided Mol. Des..

[20]  Qinghua Hu,et al.  Neighborhood classifiers , 2008, Expert Syst. Appl..

[21]  L. Guarente,et al.  Transcriptional silencing and longevity protein Sir2 is an NAD-dependent histone deacetylase , 2000, Nature.

[22]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[23]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[24]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[25]  Barbara Hero,et al.  Histone Deacetylase 8 in Neuroblastoma Tumorigenesis , 2009, Clinical Cancer Research.

[26]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[27]  M. Parrinello,et al.  Polymorphic transitions in single crystals: A new molecular dynamics method , 1981 .

[28]  P. Ji,et al.  Discovery of Novel Bruton’s Tyrosine Kinase Inhibitors Using a Hybrid Protocol of Virtual Screening Approaches Based on SVM Model, Pharmacophore and Molecular Docking , 2012, Chemical biology & drug design.

[29]  T. Kummalue,et al.  The inv(16) Fusion Protein Associates with Corepressors via a Smooth Muscle Myosin Heavy-Chain Domain , 2003, Molecular and Cellular Biology.

[30]  T. Y. Lin,et al.  Neighborhood systems and relational databases , 1988, CSC '88.

[31]  F. Sánchez‐Madrid,et al.  HDAC6: a key regulator of cytoskeleton, cell migration and cell-cell interactions. , 2008, Trends in cell biology.

[32]  Xianghui Liu,et al.  SVM Model for Virtual Screening of Lck Inhibitors , 2009, J. Chem. Inf. Model..

[33]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[34]  Hao Tang,et al.  Novel Inhibitors of Human Histone Deacetylase (HDAC) Identified by QSAR Modeling of Known Inhibitors, Virtual Screening, and Experimental Validation , 2009, J. Chem. Inf. Model..

[35]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[36]  D. Diller,et al.  Kinases, homology models, and high throughput docking. , 2003, Journal of medicinal chemistry.

[37]  G. Brodeur Neuroblastoma: biological insights into a clinical enigma , 2003, Nature Reviews Cancer.

[38]  Ying Dai,et al.  Principal component analysis based methods in bioinformatics studies , 2011, Briefings Bioinform..

[39]  Y Z Chen,et al.  Virtual screening of selective multitarget kinase inhibitors by combinatorial support vector machines. , 2010, Molecular pharmaceutics.

[40]  Yu-Dong Cai,et al.  Support vector machine for SAR/QSAR of phenethyl-amines , 2007, Acta Pharmacologica Sinica.

[41]  Guang Ping Cao,et al.  Classification of HDAC8 Inhibitors and Non-Inhibitors Using Support Vector Machines , 2012 .

[42]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[43]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[44]  P. Schultz,et al.  Design, synthesis, and activity of HDAC inhibitors with a N-formyl hydroxylamine head group. , 2004, Bioorganic & medicinal chemistry letters.

[45]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[46]  R. Sternglanz,et al.  Role of NAD(+) in the deacetylase activity of the SIR2-like proteins. , 2000, Biochemical and biophysical research communications.

[47]  Y Z Chen,et al.  Combinatorial support vector machines approach for virtual screening of selective multi-target serotonin reuptake inhibitors from large compound libraries. , 2012, Journal of molecular graphics & modelling.

[48]  Aixia Yan,et al.  Classification of HCV NS5B Polymerase Inhibitors Using Support Vector Machine , 2012, International journal of molecular sciences.

[49]  Chi-Woong Mun,et al.  Comparison of k-nearest neighbor, quadratic discriminant and linear discriminant analysis in classification of electromyogram signals based on the wrist-motion directions , 2011 .

[50]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[51]  Abed,et al.  Characterization of a human RPD 3 ortholog , HDAC 3 , 1998 .

[52]  Jessica E. Bolden,et al.  Anticancer activities of histone deacetylase inhibitors , 2006, Nature Reviews Drug Discovery.

[53]  Yu Zong Chen,et al.  Prediction of Cytochrome P450 3A4, 2D6, and 2C9 Inhibitors and Substrates by Using Support Vector Machines , 2005, J. Chem. Inf. Model..

[54]  Tatsuya Akutsu,et al.  Graph Kernels for Molecular Structure-Activity Relationship Analysis with Support Vector Machines , 2005, J. Chem. Inf. Model..

[55]  E. Seto,et al.  The Rpd3/Hda1 family of lysine deacetylases: from bacteria and yeast to mice and men , 2008, Nature Reviews Molecular Cell Biology.