ChemGPS‐NP Mapping of Chemical Compounds for Prediction of Anticancer Mode of Action

Chemical space is basically infinite, and comprises all molecules that could possibly exist. Intelligent ways to efficiently navigate through chemical space and to select promising compounds in drug discovery are important tasks, and the focus of this thesis. In this work a new model for chemical space navigation, ChemGPS-NP, was developed. This model is based on a methodology where a global chemical space map is defined through principal component analysis of physico-chemical properties of a reference set of compounds. Through interpolation from the reference set, positions of novel compounds can be defined on this map and interpreted as chemical properties. ChemGPS-NP was demonstrated to be able to chart the entire biologically relevant chemical space, including both drug-like and natural compounds. This is an important improvement considering the present interest in natural products (NPs) in the pharmaceutical industry, as well as the track record of NPs to serve as basis for more than 50% of all marketed drugs. ChemGPS-NP proved able to handle and process large data sets, to aid in efficient selection of test objects, and to extract useful information from the results of high-throughput screening campaigns. Using ChemGPS-NP, it was shown that NPs occupy unique regions of chemical property space in comparison to drug-like compounds, and a number of features distinguishing NPs from medicinal chemistry compounds were identified. ChemGPS-NP was also shown to be able to reliably predict mode of action of anticancer agents based on chemical structure, a finding that has potential to improve cancer research efficiency. Applying a property based similarity search based on calculated eight dimensional Euclidean distances from ChemGPS-NP rendered a tool to identify NP inspired potential leads for drug discovery. Furthermore, ChemGPS-NPWeb, an online version of ChemGPS-NP, was developed, which provides scientists with open access to the tool via http://chemgps.bmc.uu.se/.

[1]  Erik Johansson,et al.  Unlocking interpretation in near infrared multivariate calibrations by orthogonal partial least squares. , 2009, Analytical chemistry.

[2]  E. Johansson,et al.  On the impact of uncorrelated variation in regression mathematics , 2008 .

[3]  U. Edlund,et al.  Visualization of GC/TOF-MS-based metabolomics data for identification of biochemically interesting compounds using OPLS class models. , 2008, Analytical chemistry.

[4]  Sorel Muresan,et al.  ChemGPS-NP: tuned for navigation in biologically relevant chemical space. , 2006, Journal of natural products.

[5]  M. Rantalainen,et al.  OPLS discriminant analysis: combining the strengths of PLS‐DA and SIMCA classification , 2006 .

[6]  A. Isaksson,et al.  Screening of an annotated compound library for drug activity in a resistant myeloma cell line , 2006, Cancer Chemotherapy and Pharmacology.

[7]  A. Tolcher,et al.  Development of novel alkylating drugs as anticancer agents. , 2004, Current opinion in investigational drugs.

[8]  D. Condorelli,et al.  Structure-based rationalization of antitumor drugs mechanism of action by a MIF approach. , 2004, European journal of medicinal chemistry.

[9]  John J McGuire,et al.  Anticancer antifolates: current status and future directions. , 2003, Current pharmaceutical design.

[10]  A. Escargueil,et al.  Catalytic topoisomerase II inhibitors in cancer therapy. , 2003, Pharmacology & therapeutics.

[11]  E. Sausville,et al.  Issues and progress with protein kinase inhibitors for cancer treatment , 2003, Nature Reviews Drug Discovery.

[12]  L. Gundersen,et al.  Cytotoxic activity of 6-alkynyl- and 6-alkenylpurines. , 2003, Bioorganic & medicinal chemistry letters.

[13]  S. Wold,et al.  Orthogonal projections to latent structures (O‐PLS) , 2002 .

[14]  R. Martino,et al.  The prodrugs of 5-fluorouracil. , 2002, Current medicinal chemistry. Anti-cancer agents.

[15]  M. Braña,et al.  Intercalators as anticancer drugs. , 2001, Current pharmaceutical design.

[16]  Giuseppe Musumarra,et al.  A multivariate insight into the in vitro antitumour screen database of the National Cancer Institute: classification of compounds, similarities among cell lines and the influence of molecular targets , 2001, J. Comput. Aided Mol. Des..

[17]  Tudor I. Oprea,et al.  Chemography: the Art of Navigating in Chemical Space , 2000 .

[18]  G. S. Johnson,et al.  An Information-Intensive Approach to the Molecular Pharmacology of Cancer , 1997, Science.

[19]  K. Nilsson,et al.  Anti-cancer drug characterisation using a human cell line panel representing defined types of drug resistance. , 1996, British Journal of Cancer.

[20]  M. S. Khots,et al.  D-optimal designs , 1995 .

[21]  M. Boyd,et al.  Some practical considerations and applications of the national cancer institute in vitro anticancer drug discovery screen , 1995 .

[22]  J N Weinstein,et al.  Neural computing in cancer drug development: predicting mechanism of action. , 1992, Science.

[23]  D. Scudiero,et al.  Feasibility of a high-flux anticancer drug screen using a diverse panel of cultured human tumor cell lines. , 1991, Journal of the National Cancer Institute.

[24]  R. Larsson,et al.  Pharmacological modification of multi‐drug resistance (mdr) in vitro detected by a novel fluorometric microculture cytotoxicity assay. Reversal of resistance and selective cytotoxic actions of cyclosporin a and verapamil on mdr leukemia t‐cells , 1990, International journal of cancer.

[25]  D A Scudiero,et al.  Display and analysis of patterns of differential activity of drugs against human tumor cell lines: development of mean graph and COMPARE algorithm. , 1989, Journal of the National Cancer Institute.

[26]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[27]  S. Wold,et al.  The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses , 1984 .

[28]  D L Massart,et al.  Use of a microcomputer for the definition of multivariate confidence regions in medical diagnosis based on clinical laboratory profiles. , 1984, Computers and biomedical research, an international journal.

[29]  Svante Wold,et al.  Pattern recognition by means of disjoint principal components models , 1976, Pattern Recognit..