Network-based inference of protein activity helps functionalize the genetic landscape of cancer

Identifying the multiple dysregulated oncoproteins that contribute to tumorigenesis in a given patient is crucial for developing personalized treatment plans. However, accurate inference of aberrant protein activity in biological samples is still challenging as genetic alterations are only partially predictive and direct measurements of protein activity are generally not feasible. To address this problem we introduce and experimentally validate a new algorithm, virtual inference of protein activity by enriched regulon analysis (VIPER), for accurate assessment of protein activity from gene expression data. We used VIPER to evaluate the functional relevance of genetic alterations in regulatory proteins across all samples in The Cancer Genome Atlas (TCGA). In addition to accurately infer aberrant protein activity induced by established mutations, we also identified a fraction of tumors with aberrant activity of druggable oncoproteins despite a lack of mutations, and vice versa. In vitro assays confirmed that VIPER-inferred protein activity outperformed mutational analysis in predicting sensitivity to targeted inhibitors.

[1]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[2]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[3]  I. Weinstein Addiction to Oncogenes--the Achilles Heal of Cancer , 2002, Science.

[4]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[5]  John Quackenbush,et al.  Open source software for the analysis of microarray data. , 2003, BioTechniques.

[6]  Andrea Califano,et al.  Transcriptional analysis of the B cell germinal center reaction , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[8]  Seon-Young Kim,et al.  PAGE: Parametric Analysis of Gene Set Enrichment , 2005, BMC Bioinform..

[9]  Adam A. Margolin,et al.  Reverse engineering of regulatory networks in human B cells , 2005, Nature Genetics.

[10]  P. Park,et al.  Discovering statistically significant pathways in expression profiling studies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[11]  T. Barrette,et al.  Mining for regulatory programs in the cancer transcriptome , 2005, Nature Genetics.

[12]  Daniel J. Vis,et al.  T-profiler: scoring the activity of predefined groups of genes using gene expression data , 2005, Nucleic Acids Res..

[13]  J. Collins,et al.  Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks , 2005, Nature Biotechnology.

[14]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[15]  A. Levine,et al.  The p53 pathway: positive and negative feedback loops , 2005, Oncogene.

[16]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[17]  Alexandre V. Morozov,et al.  Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE , 2006, ISMB.

[18]  Joe W Gray,et al.  Beta1 integrin inhibitory antibody induces apoptosis of breast cancer cells, inhibits growth, and distinguishes malignant from normal phenotype in three dimensional cultures and in vivo. , 2006, Cancer research.

[19]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[20]  Thomas D. Wu,et al.  Molecular subclasses of high-grade glioma predict prognosis, delineate a pattern of disease progression, and resemble stages in neurogenesis. , 2006, Cancer cell.

[21]  Zhen Jiang,et al.  Bioconductor Project Bioconductor Project Working Papers Year Paper Extensions to Gene Set Enrichment , 2013 .

[22]  Qi Liu,et al.  Improving gene set analysis of microarray data by SAM-GS , 2007, BMC Bioinformatics.

[23]  A. Kundaje,et al.  Learning Regulatory Programs That Accurately Predict Differential Expression with MEDUSA , 2007, Annals of the New York Academy of Sciences.

[24]  R. Tothill,et al.  Novel Molecular Subtypes of Serous and Endometrioid Ovarian Cancer Linked to Clinical Outcome , 2008, Clinical Cancer Research.

[25]  Xiang-Jun Lu,et al.  Inferring Condition-Specific Modulation of Transcription Factor Activity in Yeast through Regulon-Based Analysis of Genomewide Expression , 2008, PloS one.

[26]  Annarita D'Addabbo,et al.  Comparative study of gene set enrichment methods , 2009, BMC Bioinformatics.

[27]  G. Cattoretti,et al.  Constitutively activated STAT3 promotes cell proliferation and survival in the activated B-cell subtype of diffuse large B-cell lymphomas. , 2007, Blood.

[28]  Doheon Lee,et al.  Inferring Pathway Activity toward Precise Disease Classification , 2008, PLoS Comput. Biol..

[29]  Ji Luo,et al.  Principles of Cancer Therapy: Oncogene and Non-oncogene Addiction , 2009, Cell.

[30]  R. Dalla‐Favera,et al.  Mutations of multiple genes cause deregulation of NF-κB in diffuse large B-cell lymphoma , 2009, Nature.

[31]  Mariano J. Alvarez,et al.  Genome-wide Identification of Post-translational Modulators of Transcription Factor Activity in Human B-Cells , 2009, Nature Biotechnology.

[32]  D. Hunter,et al.  mixtools: An R Package for Analyzing Mixture Models , 2009 .

[33]  David R. Hunter,et al.  mixtools: An R Package for Analyzing Mixture Models , 2009 .

[34]  Gavin MacBeath,et al.  Dissecting protein function and signaling using protein microarrays. , 2009, Current opinion in chemical biology.

[35]  Mariano J. Alvarez,et al.  Correlating measurements across samples improves accuracy of large-scale expression profile experiments , 2009, Genome Biology.

[36]  Kai Wang,et al.  Dissecting the Interface Between Signaling and Transcriptional Regulation in Human B Cells , 2008, Pacific Symposium on Biocomputing.

[37]  Jeffrey M. Rosen,et al.  Residual breast cancers after conventional therapy display mesenchymal as well as tumor-initiating features , 2009, Proceedings of the National Academy of Sciences.

[38]  Andrea Califano,et al.  Integrated biochemical and computational approach identifies BCL6 direct target genes controlling multiple pathways in normal germinal center B cells. , 2008, Blood.

[39]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[40]  J. Uhm,et al.  The transcriptional network for mesenchymal transformation of brain tumours , 2010 .

[41]  M. Gönen,et al.  Cellular and genetic diversity in the progression of in situ human breast carcinomas to an invasive phenotype. , 2010, The Journal of clinical investigation.

[42]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[43]  Avi Ma'ayan,et al.  ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments , 2010, Bioinform..

[44]  Mariano J. Alvarez,et al.  A human B-cell interactome identifies MYB and FOXM1 as master regulators of proliferation in germinal centers , 2010, Molecular systems biology.

[45]  Donald Geman,et al.  A Comprehensive Statistical Model for Cell Signaling , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[46]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[47]  Xuerui Yang,et al.  An Extensive MicroRNA-Mediated Network of RNA-RNA Interactions Regulates Established Oncogenic Pathways in Glioblastoma , 2011, Cell.

[48]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[49]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[50]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[51]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[52]  C. Roberts,et al.  Molecular Pathways Molecular Pathways : SWI / SNF ( BAF ) Complexes Are Frequently Mutated in Cancer — Mechanisms and Potential Therapeutic Insights , 2013 .

[53]  V. Kulasingam,et al.  Quantitative mass spectrometry-based assay development and validation: from small molecules to proteins. , 2013, Clinical biochemistry.

[54]  Andrea Califano,et al.  Direct reversal of glucocorticoid resistance by AKT inhibition in acute lymphoblastic leukemia. , 2013, Cancer cell.

[55]  Joshua C. Gilbert,et al.  An Interactive Resource to Identify Cancer Genetic and Lineage Dependencies Targeted by Small Molecules , 2013, Cell.

[56]  W. Hahn,et al.  Prospective enterprise-level molecular genotyping of a cohort of cancer patients. , 2014, The Journal of molecular diagnostics : JMD.

[57]  Mariano J. Alvarez,et al.  Cross-species regulatory network analysis identifies a synergistic interaction between FOXM1 and CENPF that drives prostate cancer malignancy. , 2014, Cancer cell.

[58]  J. Rodriguez,et al.  Interplay between nuclear transport and ubiquitin/SUMO modifications in the regulation of cancer-related proteins. , 2014, Seminars in cancer biology.

[59]  Andreas Krämer,et al.  Causal analysis approaches in Ingenuity Pathway Analysis , 2013, Bioinform..

[60]  Mariano J. Alvarez,et al.  Identification of Causal Genetic Drivers of Human Disease through Systems-Level Analysis of Regulatory Networks , 2014, Cell.

[61]  Anne E Carpenter,et al.  ZFHX4 interacts with the NuRD core member CHD4 and regulates the glioblastoma tumor-initiating cell state. , 2014, Cell reports.

[62]  R. Mazroui,et al.  Control of mRNA turnover: implication of cytoplasmic RNA granules. , 2014, Seminars in cell & developmental biology.

[63]  Mariano J. Alvarez,et al.  Interrogation of a Context‐Specific Transcription Factor Network Identifies Novel Regulators of Pluripotency , 2015, Stem cells.