Kinome-wide Activity Modeling from Diverse Public High-Quality Data Sets

Large corpora of kinase small molecule inhibitor data are accessible to public sector research from thousands of journal article and patent publications. These data have been generated employing a wide variety of assay methodologies and experimental procedures by numerous laboratories. Here we ask the question how applicable these heterogeneous data sets are to predict kinase activities and which characteristics of the data sets contribute to their utility. We accessed almost 500,000 molecules from the Kinase Knowledge Base (KKB) and after rigorous aggregation and standardization generated over 180 distinct data sets covering all major groups of the human kinome. To assess the value of the data sets, we generated hundreds of classification and regression models. Their rigorous cross-validation and characterization demonstrated highly predictive classification and quantitative models for the majority of kinase targets if a minimum required number of active compounds or structure-activity data points were available. We then applied the best classifiers to compounds most recently profiled in the NIH Library of Integrated Network-based Cellular Signatures (LINCS) program and found good agreement of profiling results with predicted activities. Our results indicate that, although heterogeneous in nature, the publically accessible data sets are exceedingly valuable and well suited to develop highly accurate predictors for practical Kinome-wide virtual screening applications and to complement experimental kinase profiling.

[1]  M. Vieth,et al.  Kinomics-structural biology and chemogenomics of kinase inhibitors and targets. , 2004, Biochimica et biophysica acta.

[2]  K. Shokat,et al.  Targeting the cancer kinome through polypharmacology , 2010, Nature Reviews Cancer.

[3]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[4]  William L. Jorgensen,et al.  Journal of Chemical Information and Modeling , 2005, J. Chem. Inf. Model..

[5]  M. Vieth,et al.  Kinomics: characterizing the therapeutically validated kinase space. , 2005, Drug discovery today.

[6]  P. Willett,et al.  Comparison of topological descriptors for similarity-based virtual screening using multiple bioactive reference structures. , 2004, Organic & biomolecular chemistry.

[7]  Michal Brylinski,et al.  Comprehensive Structural and Functional Characterization of the Human Kinome by Protein Structure Modeling and Ligand Virtual Screening , 2010, J. Chem. Inf. Model..

[8]  P. Zarrinkar,et al.  High-throughput kinase profiling as a platform for drug discovery , 2008, Nature Reviews Drug Discovery.

[9]  D. Rogers,et al.  Using Extended-Connectivity Fingerprints with Laplacian-Modified Bayesian Analysis in High-Throughput Screening Follow-Up , 2005, Journal of biomolecular screening.

[10]  R. Cramer Partial Least Squares (PLS): Its strengths and limitations , 1993 .

[11]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[12]  M. Murcko,et al.  Kinase-likeness and kinase-privileged fragments: toward virtual polypharmacology. , 2008, Journal of medicinal chemistry.

[13]  Xiaoyang Xia,et al.  Classification of kinase inhibitors using a Bayesian model. , 2004, Journal of medicinal chemistry.

[14]  P. Cohen Protein kinases — the major drug targets of the twenty-first century? , 2002, Nature reviews. Drug discovery.

[15]  Hans Briem,et al.  Classifying “Kinase Inhibitor‐Likeness” by Using Machine‐Learning Methods , 2005, Chembiochem : a European journal of chemical biology.

[16]  N. Gray,et al.  Targeting cancer with small molecule kinase inhibitors , 2009, Nature Reviews Cancer.

[17]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[18]  N. Gray,et al.  Rational design of inhibitors that bind to inactive kinase conformations , 2006, Nature chemical biology.

[19]  L. Wodicka,et al.  A small molecule–kinase interaction map for clinical kinase inhibitors , 2005, Nature Biotechnology.

[20]  R. Morphy Selectively nonselective kinase inhibition: striking the right balance. , 2010, Journal of medicinal chemistry.

[21]  S. Knapp,et al.  A systematic interaction map of validated kinase inhibitors with Ser/Thr kinases , 2007, Proceedings of the National Academy of Sciences.

[22]  Meir Glick,et al.  Enrichment of High-Throughput Screening Data with Increasing Levels of Noise Using Support Vector Machines, Recursive Partitioning, and Laplacian-Modified Naive Bayesian Classifiers , 2006, J. Chem. Inf. Model..

[23]  P. Cohen,et al.  Specificity and mechanism of action of some commonly used protein kinase inhibitors. , 2000, The Biochemical journal.

[24]  D. Manallack,et al.  Selecting Screening Candidates for Kinase and G Protein‐Coupled Receptor Targets Using Neural Networks. , 2002 .

[25]  E. Casale,et al.  Through the "gatekeeper door": exploiting the active kinase conformation. , 2010, Journal of medicinal chemistry.

[26]  Woody Sherman,et al.  Large-Scale Systematic Analysis of 2D Fingerprint Methods and Parameters to Improve Virtual Screening Enrichments , 2010, J. Chem. Inf. Model..

[27]  Mindy I. Davis,et al.  A quantitative analysis of kinase inhibitor selectivity , 2008, Nature Biotechnology.

[28]  Ian A. Watson,et al.  Chemical fragments as foundations for understanding target space and activity prediction. , 2008, Journal of medicinal chemistry.

[29]  Gavin Harper,et al.  Assessment of chemical coverage of kinome space and its implications for kinase drug discovery. , 2008, Journal of medicinal chemistry.

[30]  Beat Ernst,et al.  Drug discovery today. , 2003, Current topics in medicinal chemistry.

[31]  Irini Akritopoulou-Zanze,et al.  Kinase-targeted libraries: the design and synthesis of novel, potent, and selective kinase inhibitors. , 2009, Drug discovery today.

[32]  Stephen R. Johnson,et al.  Trends in kinase selectivity: insights for target class-focused library screening. , 2011, Journal of medicinal chemistry.

[33]  S. Bryant,et al.  PubChem as a public resource for drug discovery. , 2010, Drug discovery today.