Likelihood of side effects depends on desired clinical impact: Affinities within a very small set of targets enables inference of promiscuity or specificity of kinase inhibitors

As the heterogeneous nature of cancer starts to emerge, the focus of molecular therapy is shifting progressively towards multi-target drugs. For example, drug-based interference with several signaling pathways controlling different aspects of cell fate provides a multi-pronged attack that is proving more effective than magic bullets in hampering development and progression of malignancy. Such therapeutic agents typically target kinases, the basic signal transducers of the cell. Because kinases share common evolutionary backgrounds, they also share structural attributes, making it difficult for drugs to tell apart paralogs of clinical importance from off-target kinases. Thus, multi-target kinase inhibitors (KIs) tend to have undesired cross-reactivities with potentially lethal or debilitating side effects. As multi-target therapies are favored, a pressing issue takes the stakes: which type of clinical impact can only be achieved with a promiscuous drug, and conversely, which clinical effect lends itself to drug specificity? Combining statistical analysis with data mining and machine learning, we determine extremely small inferential bases with 3-5 targets that enable a kinomewide assessment of promiscuity and specificity with over 97% accuracy. Thus, the likelihood of side effects in molecular therapy arising from undesired cross-activities is pivotally dependent on the intended clinical impact restricted to checking a few relevant targets.

[1]  Quoc-Nam Tran Microarray Data Mining: A New Algorithm for Gene Selection Using Lorenz Curves & Gini Ratios , 2010, 2010 Seventh International Conference on Information Technology: New Generations.

[2]  John P. Overington,et al.  Can we rationally design promiscuous drugs? , 2006, Current opinion in structural biology.

[3]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[4]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[5]  B. Stockwell,et al.  Multicomponent therapeutics for networked systems , 2005, Nature Reviews Drug Discovery.

[6]  T. Hampton,et al.  "Promiscuous" anticancer drugs that hit multiple targets may thwart resistance. , 2004, JAMA.

[7]  Chuong B Do,et al.  What is the expectation maximization algorithm? , 2008, Nature Biotechnology.

[8]  Simon K. Mencher,et al.  BMC Clinical Pharmacology BioMed Central Debate , 2005 .

[9]  K. Kinzler,et al.  Cancer genes and the pathways they control , 2004, Nature Medicine.

[10]  Quocnam Tran,et al.  Mining Medical Databases with Modified Gini Index Classification , 2008, Fifth International Conference on Information Technology: New Generations (itng 2008).

[11]  S. Frantz Drug discovery: Playing dirty , 2005, Nature.

[12]  G. Parmigiani,et al.  The Consensus Coding Sequences of Human Breast and Colorectal Cancers , 2006, Science.

[13]  Paul D. P. Pharoah,et al.  p53 polymorphisms: cancer implications , 2009, Nature Reviews Cancer.

[14]  Stefan Knapp,et al.  Activation segment dimerization: a mechanism for kinase autophosphorylation of non-consensus sites , 2008, The EMBO journal.

[15]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[16]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[17]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[18]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[19]  Mindy I. Davis,et al.  A quantitative analysis of kinase inhibitor selectivity , 2008, Nature Biotechnology.