Expression2Kinases: mRNA profiling linked to multiple upstream regulatory layers

MOTIVATION Genome-wide mRNA profiling provides a snapshot of the global state of cells under different conditions. However, mRNA levels do not provide direct understanding of upstream regulatory mechanisms. Here, we present a new approach called Expression2Kinases (X2K) to identify upstream regulators likely responsible for observed patterns in genome-wide gene expression. By integrating chromatin immuno-precipitation (ChIP)-seq/chip and position weight matrices (PWMs) data, protein-protein interactions and kinase-substrate phosphorylation reactions, we can better identify regulatory mechanisms upstream of genome-wide differences in gene expression. We validated X2K by applying it to recover drug targets of food and drug administration (FDA)-approved drugs from drug perturbations followed by mRNA expression profiling; to map the regulatory landscape of 44 stem cells and their differentiating progeny; to profile upstream regulatory mechanisms of 327 breast cancer tumors; and to detect pathways from profiled hepatic stellate cells and hippocampal neurons. The X2K approach can advance our understanding of cell signaling and unravel drugs mechanisms of action. AVAILABILITY The software and source code are freely available at: http://www.maayanlab.net/X2K. CONTACT avi.maayan@mssm.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Alexei A. Sharov,et al.  Defining Developmental Potency and Cell Lineage Trajectories by Expression Profiling of Differentiating Mouse Embryonic Stem Cells , 2008, DNA research : an international journal for rapid publication of reports on genes and genomes.

[2]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[3]  Peer Bork,et al.  Drug-Induced Regulation of Target Expression , 2010, PLoS Comput. Biol..

[4]  S. Friedman Hepatic stellate cells: protean, multifunctional, and enigmatic cells of the liver. , 2008, Physiological reviews.

[5]  A. Means,et al.  Calcium/calmodulin-dependent protein kinase types II and IV differentially regulate CREB-dependent gene expression , 1994, Molecular and cellular biology.

[6]  Gordon K Smyth,et al.  Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2004, Statistical applications in genetics and molecular biology.

[7]  David J. Arenillas,et al.  JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles , 2009, Nucleic Acids Res..

[8]  H. Bading,et al.  Coupling of extrasynaptic NMDA receptors to a CREB shut-off pathway is developmentally regulated. , 2002, Biochimica et biophysica acta.

[9]  Alexander E. Kel,et al.  TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes , 2005, Nucleic Acids Res..

[10]  Avi Ma'ayan,et al.  SNAVI: Desktop application for analysis and visualization of large-scale signaling networks , 2009, BMC Systems Biology.

[11]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[12]  Avi Ma'ayan,et al.  Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases , 2007, BMC Bioinformatics.

[13]  Guido Sanguinetti,et al.  Large-scale learning of combinatorial transcriptional dynamics from gene expression , 2011, Bioinform..

[14]  David A. Brenner,et al.  The Role of Focal Adhesion Kinase-Phosphatidylinositol 3-Kinase-Akt Signaling in Hepatic Stellate Cell Proliferation and Type I Collagen Expression* , 2003, The Journal of Biological Chemistry.

[15]  M. Wilson,et al.  An Important Role of Neural Activity-Dependent CaMKIV Signaling in the Consolidation of Long-Term Memory , 2001, Cell.

[16]  M. Buck,et al.  A Ribosomal S-6 Kinase–Mediated Signal to C/EBP-β Is Critical for the Development of Liver Fibrosis , 2007, PloS one.

[17]  Ariel S. Schwartz,et al.  An Atlas of Combinatorial Transcriptional Regulation in Mouse and Man , 2010, Cell.

[18]  H. Bading,et al.  Control of Recruitment and Transcription-Activating Function of CBP Determines Gene Regulation by NMDA Receptors and L-Type Calcium Channels , 1999, Neuron.

[19]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[20]  J. Kornhauser,et al.  PhosphoSite: A bioinformatics resource dedicated to physiological protein phosphorylation , 2004, Proteomics.

[21]  C. Ledent,et al.  CB1 cannabinoid receptor antagonism: a new strategy for the treatment of liver fibrosis , 2006, Nature Medicine.

[22]  D. Schuppan,et al.  Differential modulation of rat hepatic stellate phenotype by natural and synthetic retinoids , 2004, Hepatology.

[23]  Nir Friedman,et al.  Physical Module Networks: an integrative approach for reconstructing transcription regulation , 2011, Bioinform..

[24]  Giorgio Valle,et al.  The Gene Ontology in 2010: extensions and refinements , 2009, Nucleic Acids Res..

[25]  M. Shapira,et al.  Transcriptome Profiling Reveals TGF-β Signaling Involvement in Epileptogenesis , 2009, The Journal of Neuroscience.

[26]  Y. Okazaki,et al.  Novel homeodomain‐interacting protein kinase family member, HIPK4, phosphorylates human p53 at serine 9 , 2007, FEBS letters.

[27]  F. Anania,et al.  Leptin as a novel profibrogenic cytokine in hepatic stellate cells: mitogenesis and inhibition of apoptosis mediated by extracellular regulated kinase (Erk) and Akt phosphorylation , 2004, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[28]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[29]  Susumu Goto,et al.  KEGG for representation and analysis of molecular networks involving diseases and drugs , 2009, Nucleic Acids Res..

[30]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[31]  Nikolaj Blom,et al.  Phospho.ELM: A database of experimentally verified phosphorylation sites in eukaryotic proteins , 2004, BMC Bioinformatics.

[32]  H. Tsukamoto,et al.  Adipogenic Transcriptional Regulation of Hepatic Stellate Cells* , 2005, Journal of Biological Chemistry.

[33]  Avi Ma'ayan,et al.  GATE: software for the analysis and visualization of high-dimensional time series expression data , 2009, Bioinform..

[34]  A. Huang,et al.  Correlation of microarray-based breast cancer molecular subtypes and clinical outcomes: implications for treatment optimization , 2011, BMC Cancer.

[35]  H. Tsukamoto,et al.  Wnt antagonism inhibits hepatic stellate cell activation and liver fibrosis. , 2008, American journal of physiology. Gastrointestinal and liver physiology.

[36]  H. Enslen,et al.  Differential activation of CREB by Ca2+/calmodulin-dependent protein kinases type II and type IV involves phosphorylation of a site that negatively regulates activity. , 1994, Genes & development.

[37]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[39]  L. Wakefield,et al.  Reduction in Smad2/3 signaling enhances tumorigenesis but suppresses metastasis of breast cancer cell lines. , 2003, Cancer research.

[40]  Wei He,et al.  Breast cancer bone metastasis mediated by the Smad tumor suppressor pathway. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Tony Pawson,et al.  NetworKIN: a resource for exploring cellular phosphorylation networks , 2007, Nucleic Acids Res..

[42]  R. Schwabe,et al.  Gene expression profiles during hepatic stellate cell activation in culture and in vivo. , 2007, Gastroenterology.

[43]  Avi Ma'ayan,et al.  KEA: kinase enrichment analysis , 2009, Bioinform..

[44]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[45]  Jill P. Mesirov,et al.  GSEA-P: a desktop application for Gene Set Enrichment Analysis , 2007, Bioinform..

[46]  F. Raaphorst Deregulated expression of Polycomb-group oncogenes in human malignant lymphomas and epithelial tumors. , 2005, Human molecular genetics.

[47]  Avi Ma'ayan,et al.  ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments , 2010, Bioinform..

[48]  Hilmar Bading,et al.  Decoding NMDA Receptor Signaling: Identification of Genomic Programs Specifying Neuronal Survival and Death , 2007, Neuron.

[49]  M. Cronin,et al.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. , 2004, The New England journal of medicine.

[50]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[51]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[52]  Peng Qiu,et al.  Discovering Biological Progression Underlying Microarray Samples , 2011, PLoS Comput. Biol..