Predicting DPP-IV inhibitors with machine learning approaches

Dipeptidyl peptidase IV (DPP-IV) is a promising Type 2 diabetes mellitus (T2DM) drug target. DPP-IV inhibitors prolong the action of glucagon-like peptide-1 (GLP-1) and gastric inhibitory peptide (GIP), improve glucose homeostasis without weight gain, edema, and hypoglycemia. However, the marketed DPP-IV inhibitors have adverse effects such as nasopharyngitis, headache, nausea, hypersensitivity, skin reactions and pancreatitis. Therefore, it is still expected for novel DPP-IV inhibitors with minimal adverse effects. The scaffolds of existing DPP-IV inhibitors are structurally diversified. This makes it difficult to build virtual screening models based upon the known DPP-IV inhibitor libraries using conventional QSAR approaches. In this paper, we report a new strategy to predict DPP-IV inhibitors with machine learning approaches involving naïve Bayesian (NB) and recursive partitioning (RP) methods. We built 247 machine learning models based on 1307 known DPP-IV inhibitors with optimized molecular properties and topological fingerprints as descriptors. The overall predictive accuracies of the optimized models were greater than 80%. An external test set, composed of 65 recently reported compounds, was employed to validate the optimized models. The results demonstrated that both NB and RP models have a good predictive ability based on different combinations of descriptors. Twenty “good” and twenty “bad” structural fragments for DPP-IV inhibitors can also be derived from these models for inspiring the new DPP-IV inhibitor scaffold design.

[1]  Lei Chen,et al.  ADME evaluation in drug discovery. 10. Predictions of P-glycoprotein inhibitors using recursive partitioning and naive Bayesian classification techniques. , 2011, Molecular pharmaceutics.

[2]  G. Glenner,et al.  A new dipeptide naphthylamidase hydrolyzing glycyl-prolyl-β-naphthylamide , 2004, Histochemie.

[3]  A. Scheen,et al.  Cardiovascular effects of gliptins , 2013, Nature Reviews Cardiology.

[4]  Mitsuru Oka,et al.  Anagliptin, a potent dipeptidyl peptidase IV inhibitor: its single-crystal structure and enzyme interactions , 2015, Journal of enzyme inhibition and medicinal chemistry.

[5]  L. Juillerat-Jeanneret Dipeptidyl peptidase IV and its inhibitors: therapeutics for type 2 diabetes and what else? , 2014, Journal of medicinal chemistry.

[6]  T. A. McIntyre,et al.  Prediction of animal clearance using naïve Bayesian classification and extended connectivity fingerprints , 2009, Xenobiotica; the fate of foreign compounds in biological systems.

[7]  Rommie E. Amaro,et al.  POVME 2.0: An Enhanced Tool for Determining Pocket Shape and Volume Characteristics , 2014, Journal of chemical theory and computation.

[8]  Bhumika D. Patel,et al.  3D-QSAR studies of dipeptidyl peptidase-4 inhibitors using various alignment methods , 2014, Medicinal Chemistry Research.

[9]  Jiansong Fang,et al.  Predictions of BuChE Inhibitors Using Support Vector Machine and Naive Bayesian Classification Techniques in Drug Discovery , 2013, J. Chem. Inf. Model..

[10]  Jacob D. Durrant,et al.  POVME: an algorithm for measuring binding-pocket volumes. , 2011, Journal of molecular graphics & modelling.

[11]  G. Glenner,et al.  A new dipeptide naphthylamidase hydrolyzing glycyl-prolyl-beta-naphthylamide. , 1966, Histochemie. Histochemistry. Histochimie.

[12]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[13]  B. Seed,et al.  Cloning and functional expression of the T cell activation antigen CD26. , 1992, Journal of immunology.

[14]  Hiroshi Sakashita,et al.  Fused bicyclic heteroarylpiperazine-substituted L-prolylthiazolidines as highly potent DPP-4 inhibitors lacking the electrophilic nitrile group. , 2012, Bioorganic & medicinal chemistry.

[15]  Sheng Tian,et al.  ADME evaluation in drug discovery. 9. Prediction of oral bioavailability in humans based on molecular properties and structural fingerprints. , 2011, Molecular pharmaceutics.

[16]  Vimal K. Narula,et al.  A Potential Role for Dendritic Cell/Macrophage-Expressing DPP4 in Obesity-Induced Visceral Inflammation , 2012, Diabetes.

[17]  G. De’ath,et al.  CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE FOR ECOLOGICAL DATA ANALYSIS , 2000 .

[18]  Hideaki Shima,et al.  The structure and function of human dipeptidyl peptidase IV, possessing a unique eight-bladed beta-propeller fold. , 2003, Biochemical and biophysical research communications.

[19]  Shinichi Ishii,et al.  A comparative study of the binding modes of recently launched dipeptidyl peptidase IV inhibitors in the active site. , 2013, Biochemical and biophysical research communications.

[20]  Gustavo Henrique Goulart Trossini,et al.  Use of machine learning approaches for novel drug discovery , 2016, Expert opinion on drug discovery.

[21]  H. Dijkman,et al.  Organ distribution of aminopeptidase A and dipeptidyl peptidase IV in normal mice. , 1996, The journal of histochemistry and cytochemistry : official journal of the Histochemistry Society.

[22]  Chong Hak Chae,et al.  Docking-based 3D-QSAR study for selectivity of DPP4, DPP8, and DPP9 inhibitors. , 2007, Bioorganic & medicinal chemistry letters.

[23]  Matthew L Albert,et al.  Dipeptidylpeptidase 4 inhibition enhances lymphocyte trafficking, improving both naturally occurring tumor immunity and immunotherapy , 2015, Nature Immunology.

[24]  C. Bailey,et al.  Dipeptidyl peptidase IV (DPP IV) inhibitors: a newly emerging drug class for the treatment of type 2 diabetes , 2006, Diabetes & vascular disease research.

[25]  Andreas Bender,et al.  Ligand-Target Prediction Using Winnow and Naive Bayesian Algorithms and the Implications of Overall Performance Statistics , 2008, J. Chem. Inf. Model..

[26]  Ling Wang,et al.  Predicting mTOR Inhibitors with a Classifier Using Recursive Partitioning and Naïve Bayesian Approaches , 2014, PloS one.

[27]  Sven Branner,et al.  Crystal structure of human dipeptidyl peptidase IV/CD26 in complex with a substrate analog , 2003, Nature Structural Biology.

[28]  Meir Glick,et al.  Enrichment of High-Throughput Screening Data with Increasing Levels of Noise Using Support Vector Machines, Recursive Partitioning, and Laplacian-Modified Naive Bayesian Classifiers , 2006, J. Chem. Inf. Model..

[29]  Antonio Lavecchia,et al.  Machine-learning approaches in drug discovery: methods and applications. , 2015, Drug discovery today.

[30]  Koji Inaka,et al.  The structure and function of human dipeptidyl peptidase IV, possessing a unique eight-bladed β-propeller fold , 2003 .

[31]  M. Ghate,et al.  Recent approaches to medicinal chemistry and therapeutic potential of dipeptidyl peptidase-4 (DPP-4) inhibitors. , 2014, European journal of medicinal chemistry.

[32]  D. Drucker,et al.  The incretin system: glucagon-like peptide-1 receptor agonists and dipeptidyl peptidase-4 inhibitors in type 2 diabetes , 2006, The Lancet.

[33]  Tingjun Hou,et al.  ADME evaluation in drug discovery , 2002, Journal of molecular modeling.

[34]  Xiaoyang Xia,et al.  Classification of kinase inhibitors using a Bayesian model. , 2004, Journal of medicinal chemistry.

[35]  Jun Xu A new approach to finding natural chemical structure classes. , 2002, Journal of medicinal chemistry.