KinasepKipred: A Predictive Model for Estimating Ligand-Kinase Inhibitor Constant (pKi)

Kinases are one of the most important classes of drug targets for therapeutic use. Algorithms that can accurately predict the drug-kinase inhibitor constant (pKi) of kinases can considerably accelerate the drug discovery process. In this study, we have developed computational models, leveraging machine learning techniques, to predict ligand-kinase (pKi) values. Kinase-ligand inhibitor constant (Ki) data was retrieved from Drug Target Commons (DTC) and Metz databases. Machine learning models were developed based on structural and physicochemical features of the protein and, topological pharmacophore atomic triplets fingerprints of the ligands. Three machine learning models [random forest (RFR), extreme gradient boosting (XGBoost) and artificial neural network (ANN)] were tested for model development. The performance of our models were evaluated using several metrics with 95% confidence interval. RFR model was finally selected based on the evaluation metrics on test datasets and used for web implementation. The best and selected model achieved a Pearson correlation coefficient (R) of 0.887 (0.881, 0.893), root-mean-square error (RMSE) of 0.475 (0.465, 0.486), Concordance index (Con. Index) of 0.854 (0.851, 0.858), and an area under the curve of receiver operating characteristic curve (AUC-ROC) of 0.957 (0.954, 0.960) during the internal 5-fold cross validation. Availability GitHub: https://github.com/sirimullalab/KinasepKipred, Docker: sirimullalab/kinasepkipred Implementation https://drugdiscovery.utep.edu/pki/ Graphical TOC Entry

[1]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[2]  P. Hajduk,et al.  Navigating the kinome. , 2011, Nature chemical biology.

[3]  Miklos Feher,et al.  Novel 2D Fingerprints for Ligand-Based Virtual Screening , 2006, J. Chem. Inf. Model..

[4]  Artem Cherkasov,et al.  SimBoost: a read-across approach for predicting drug–target binding affinities using gradient boosting machines , 2017, Journal of Cheminformatics.

[5]  D. Fabbro 25 Years of Small Molecular Weight Kinase Inhibitors: Potentials and Limitations , 2015, Molecular Pharmacology.

[6]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.

[7]  Qiang Huang,et al.  Role of Mg2+ ions in protein kinase phosphorylation: insights from molecular dynamics simulations of ATP-kinase complexes , 2011 .

[8]  Andrew L. Hopkins,et al.  Drug discovery: Predicting promiscuity , 2009, Nature.

[9]  Nicolas Stransky,et al.  Targeting cancer with kinase inhibitors. , 2015, The Journal of clinical investigation.

[10]  Laetitia Martin-Chanas,et al.  Identify drug repurposing candidates by mining the Protein Data Bank , 2011, Briefings Bioinform..

[11]  Paul Watson,et al.  Naïve Bayes Classification Using 2D Pharmacophore Feature Triplet Vectors , 2008, J. Chem. Inf. Model..

[12]  Malcolm J. McGregor,et al.  Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design , 1999, J. Chem. Inf. Comput. Sci..

[13]  Benjamin Parent,et al.  Fuzzy Tricentric Pharmacophore Fingerprints, 1. Topological Fuzzy Pharmacophore Triplets and Adapted Molecular Similarity Scoring Schemes , 2006, J. Chem. Inf. Model..

[14]  D. Bojanic,et al.  Keynote review: in vitro safety pharmacology profiling: an essential tool for successful drug development. , 2005, Drug discovery today.

[15]  A. Anderson The process of structure-based drug design. , 2003, Chemistry & biology.

[16]  Manish Sud,et al.  MayaChemTools: An Open Source Package for Computational Drug Discovery , 2016, J. Chem. Inf. Model..

[17]  Mindy I. Davis,et al.  Comprehensive analysis of kinase inhibitor selectivity , 2011, Nature Biotechnology.

[18]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[19]  Luhua Lai,et al.  Prediction of potential drug targets based on simple sequence properties , 2007, BMC Bioinformatics.

[20]  Tapio Pahikkala,et al.  Toward more realistic drug^target interaction predictions , 2014 .

[21]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[22]  Goutam Paul,et al.  A machine learning approach towards the prediction of protein–ligand binding affinity based on fundamental molecular properties , 2018, RSC advances.

[23]  Elina Parri,et al.  Drug Target Commons: A Community Effort to Build a Consensus Knowledge Base for Drug-Target Interactions , 2017, Cell chemical biology.

[24]  Arzucan Özgür,et al.  DeepDTA: deep drug–target binding affinity prediction , 2018, Bioinform..

[25]  Dong-Sheng Cao,et al.  propy: a tool to generate various modes of Chou's PseAAC , 2013, Bioinform..

[26]  F. McCoy,et al.  Janus-faced PIDD: a sensor for DNA damage-induced cell death or survival? , 2012, Molecular cell.