Use of ligand based models for protein domains to predict novel molecular targets and applications to triage affinity chromatography data.

The elucidation of drug targets is important both to optimize desired compound action and to understand drug side-effects. In this study, we created statistical models which link chemical substructures of ligands to protein domains in a probabilistic manner and employ the model to triage the results of affinity chromatography experiments. By annotating targets with their InterPro domains, general rules of ligand-protein domain associations were derived and successfully employed to predict protein targets outside the scope of the training set. This methodology was then tested on a proteomics affinity chromatography data set containing 699 compounds. The domain prediction model correctly detected 31.6% of the experimental targets at a specificity of 46.8%. This is striking since 86% of the predicted targets are not part of them (but share InterPro domains with them), and thus could not have been predicted by conventional target prediction approaches. Target predictions improve drastically when significance (FDR) scores for target pulldowns are employed, emphasizing their importance for eliminating artifacts. Filament proteins (such as actin and tubulin) are detected to be 'frequent hitters' in proteomics experiments and their presence in pulldowns is not supported by the target predictions. On the other hand, membrane-bound receptors such as serotonin and dopamine receptors are noticeably absent in the affinity chromatography sets, although their presence would be expected from the predicted targets of compounds. While this can partly be explained by the experimental setup, we suggest the computational methods employed here as a complementary step of identifying protein targets of small molecules. Affinity chromatography results for gefitinib are discussed in detail and while two out of the three kinases with the highest affinity to gefitinib in biochemical assays are detected by affinity chromatography, also the possible involvement of NSF as a target for modulating cancer progressions via beta-arrestin can be proposed by this method.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  R. Lefkowitz,et al.  Identification of NSF as a beta-arrestin1-binding protein. Implications for beta2-adrenergic receptor regulation. , 1999, The Journal of biological chemistry.

[3]  J Mottram,et al.  Intracellular targets of cyclin-dependent kinase inhibitors: identification by affinity chromatography using immobilised inhibitors. , 2000, Chemistry & biology.

[4]  Y.Z. Chen,et al.  Ligand–protein inverse docking and its potential use in the computer search of protein targets of a small molecule , 2001, Proteins.

[5]  Alex Bateman,et al.  The InterPro database, an integrated documentation resource for protein families, domains and functional sites , 2001, Nucleic Acids Res..

[6]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[7]  Yue Sun,et al.  β-Arrestin2 Is Critically Involved in CXCR4-mediated Chemotaxis, and This Is Mediated by Its Enhancement of p38 MAPK Activation* , 2002, The Journal of Biological Chemistry.

[8]  G. Superti-Furga,et al.  Rediscovering the sweet spot in drug discovery. , 2003, Drug discovery today.

[9]  Jean-Loup Faulon,et al.  The Signature Molecular Descriptor. 1. Using Extended Valence Sequences in QSAR and QSPR Studies , 2003, J. Chem. Inf. Comput. Sci..

[10]  Yoshiya Oda,et al.  Quantitative chemical proteomics for identifying candidate drug targets. , 2003, Analytical chemistry.

[11]  T. Hampton,et al.  "Promiscuous" anticancer drugs that hit multiple targets may thwart resistance. , 2004, JAMA.

[12]  Min Wu,et al.  Fishing for targets: novel approaches using small molecule baits. , 2004, Drug discovery today. Technologies.

[13]  Janet M Thornton,et al.  Ligand selectivity and competition between enzymes in silico , 2004, Nature Biotechnology.

[14]  T. Terada,et al.  Design and synthesis of novel hydrophilic spacers for the reduction of nonspecific binding proteins on affinity resins. , 2004, Bioorganic & medicinal chemistry.

[15]  R. Botting,et al.  Cyclooxygenase Isozymes: The Biology of Prostaglandin Synthesis and Inhibition , 2004, Pharmacological Reviews.

[16]  N. Paul,et al.  Recovering the true targets of specific ligands by virtual screening of the protein data bank , 2004, Proteins.

[17]  Michelle R. Arkin,et al.  Small-molecule inhibitors of protein–protein interactions: progressing towards the dream , 2004, Nature Reviews Drug Discovery.

[18]  G. Kéri,et al.  Multidrug transporter ABCG2 prevents tumor cell death induced by the epidermal growth factor receptor inhibitor Iressa (ZD1839, Gefitinib). , 2005, Cancer research.

[19]  Cathy H. Wu,et al.  InterPro, progress and status in 2005 , 2004, Nucleic Acids Res..

[20]  Cathy H. Wu,et al.  InterPro, progress and status in 2005 , 2004, Nucleic Acids Res..

[21]  L. Wodicka,et al.  A small molecule–kinase interaction map for clinical kinase inhibitors , 2005, Nature Biotechnology.

[22]  R. Aebersold,et al.  Scoring proteomes with proteotypic peptide probes , 2005, Nature Reviews Molecular Cell Biology.

[23]  Christopher W. V. Hogue,et al.  Domain-based small molecule binding site annotation , 2006, BMC Bioinformatics.

[24]  Ajay N. Jain,et al.  Robust ligand-based modeling of the biological targets of known drugs. , 2006, Journal of medicinal chemistry.

[25]  J. Jenkins,et al.  Prediction of Biological Targets for Compounds Using Multiple‐Category Bayesian Models Trained on Chemogenomics Databases. , 2006 .

[26]  Peteris Prusis,et al.  Rough set‐based proteochemometrics modeling of G‐protein‐coupled receptor‐ligand interactions , 2006, Proteins.

[27]  R. DuBois,et al.  Emerging roles of beta-arrestins. , 2006, Cell cycle.

[28]  A. Bender,et al.  Circular fingerprints: flexible molecular descriptors with applications from physical chemistry to ADME. , 2006, IDrugs : the investigational drugs journal.

[29]  K. Fidelis,et al.  Generalized modeling of enzyme–ligand interactions using proteochemometrics and local protein substructures , 2006, Proteins.

[30]  Qiong Shi,et al.  Role of beta-arrestin 1 in the metastatic progression of colorectal cancer. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[31]  A. Bender,et al.  In silico target fishing: Predicting biological targets from chemical structure , 2006 .

[32]  Z. Deng,et al.  Bridging chemical and biological space: "target fishing" using 2D and 3D molecular descriptors. , 2006, Journal of medicinal chemistry.

[33]  C. E. Peishoff,et al.  A critical assessment of docking programs and scoring functions. , 2006, Journal of medicinal chemistry.

[34]  Specific affinity extraction method for small molecule-binding proteins. , 2006, Analytical chemistry.

[35]  Rapid computational identification of the targets of protein kinase inhibitors. , 2006, Current opinion in drug discovery & development.

[36]  Y. Oda,et al.  Chemical proteomics for drug discovery based on compound-immobilized affinity chromatography. , 2007, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[37]  P. Clemons,et al.  Chemogenomic data analysis: prediction of small-molecule targets and the advent of biological fingerprint. , 2007, Combinatorial chemistry & high throughput screening.

[38]  A. Bender,et al.  Modeling Promiscuity Based on in vitro Safety Pharmacology Profiling Data , 2007, ChemMedChem.

[39]  Bernhard Kuster,et al.  Quantitative chemical proteomics reveals mechanisms of action of clinical ABL kinase inhibitors , 2007, Nature Biotechnology.

[40]  Andreas Bender,et al.  Understanding False Positives in Reporter Gene Assays: in Silico Chemogenomics Approaches To Prioritize Cell-Based HTS Data , 2007, J. Chem. Inf. Model..

[41]  Jing Li,et al.  Association of variant ABCG2 and the pharmacokinetics of epidermal growth factor receptor tyrosine kinase inhibitors in cancer patients , 2007, Cancer biology & therapy.

[42]  Robert D. Finn,et al.  New developments in the InterPro database , 2007, Nucleic Acids Res..

[43]  A. Bender,et al.  Analysis of Pharmacology Data and the Prediction of Adverse Drug Reactions and Off‐Target Effects from Chemical Structure , 2007, ChemMedChem.

[44]  Kevin K. Anderson,et al.  A Bayesian estimator of protein-protein association probabilities , 2008, Bioinform..

[45]  Jean-Loup Faulon,et al.  Genome scale enzyme–metabolite and drug–target interaction predictions using the signature molecular descriptor , 2008 .

[46]  Mindy I. Davis,et al.  A quantitative analysis of kinase inhibitor selectivity , 2008, Nature Biotechnology.