Determination of minimal transcriptional signatures of compounds for target prediction

The identification of molecular target and mechanism of action of compounds is a key hurdle in drug discovery. Multiplexed techniques for bead-based expression profiling allow the measurement of transcriptional signatures of compound-treated cells in high-throughput mode. Such profiles can be used to gain insight into compounds' mode of action and the protein targets they are modulating. Through the proxy of target prediction from such gene signatures we explored important aspects of the use of transcriptional profiles to capture biological variability of perturbed cellular assays. We found that signatures derived from expression data and signatures derived from biological interaction networks performed equally well, and we showed that gene signatures can be optimised using a genetic algorithm. Gene signatures of approximately 128 genes seemed to be most generic, capturing a maximum of the perturbation inflicted on cells through compound treatment. Moreover, we found evidence for oxidative phosphorylation to be one of the most general ways to capture compound perturbation.

[1]  J. Collins,et al.  Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks , 2005, Nature Biotechnology.

[2]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[3]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[4]  Gary D Bader,et al.  The human genome and drug discovery after a decade. Roads (still) not taken , 2011, 1102.0448.

[5]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[6]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[7]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[8]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[9]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[10]  Dorothea Emig,et al.  Measuring and analyzing tissue specificity of human genes and protein complexes , 2011, EURASIP J. Bioinform. Syst. Biol..

[11]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[12]  Angelo D. Favia,et al.  Protein promiscuity and its implications for biotechnology , 2009, Nature Biotechnology.

[13]  Peer Bork,et al.  Drug-Induced Regulation of Target Expression , 2010, PLoS Comput. Biol..

[14]  Dragos Horvath,et al.  Predicting ADME properties and side effects: the BioPrint approach. , 2003, Current opinion in drug discovery & development.

[15]  M. Orešič,et al.  Pathways to the analysis of microarray data. , 2005, Trends in biotechnology.

[16]  R. Tagliaferri,et al.  Discovery of drug mode of action and drug repositioning from transcriptional responses , 2010, Proceedings of the National Academy of Sciences.

[17]  Richard Morphy,et al.  From magic bullets to designed multiple ligands. , 2004, Drug discovery today.

[18]  J. Weinstein,et al.  Pharmacogenomic analysis: correlating molecular substructure classes with microarray gene expression data , 2002, The Pharmacogenomics Journal.

[19]  Douglas W. Selinger,et al.  Pathway Reporter Assays Reveal Small Molecule Mechanisms of Action , 2009 .

[20]  Andreas Bender,et al.  Ligand-Target Prediction Using Winnow and Naive Bayesian Algorithms and the Implications of Overall Performance Statistics , 2008, J. Chem. Inf. Model..

[21]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[22]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[23]  Emanuel F Petricoin,et al.  Causal reasoning identifies mechanisms of sensitivity for a novel AKT kinase inhibitor, GSK690693 , 2010, BMC Genomics.