Identifying Network Perturbation in Cancer

We present a computational framework, called DISCERN (DIfferential SparsE Regulatory Network), to identify informative topological changes in gene-regulator dependence networks inferred on the basis of mRNA expression datasets within distinct biological states. DISCERN takes two expression datasets as input: an expression dataset of diseased tissues from patients with a disease of interest and another expression dataset from matching normal tissues. DISCERN estimates the extent to which each gene is perturbed—having distinct regulator connectivity in the inferred gene-regulator dependencies between the disease and normal conditions. This approach has distinct advantages over existing methods. First, DISCERN infers conditional dependencies between candidate regulators and genes, where conditional dependence relationships discriminate the evidence for direct interactions from indirect interactions more precisely than pairwise correlation. Second, DISCERN uses a new likelihood-based scoring function to alleviate concerns about accuracy of the specific edges inferred in a particular network. DISCERN identifies perturbed genes more accurately in synthetic data than existing methods to identify perturbed genes between distinct states. In expression datasets from patients with acute myeloid leukemia (AML), breast cancer and lung cancer, genes with high DISCERN scores in each cancer are enriched for known tumor drivers, genes associated with the biological processes known to be important in the disease, and genes associated with patient prognosis, in the respective cancer. Finally, we show that DISCERN can uncover potential mechanisms underlying network perturbation by explaining observed epigenomic activity patterns in cancer and normal tissue types more accurately than alternative methods, based on the available epigenomic data from the ENCODE project.

[1]  Sandra Siehler,et al.  Regulation of RhoGEF proteins by G12/13‐coupled receptors , 2009, British journal of pharmacology.

[2]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[3]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[4]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[5]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[6]  A. Witteveen,et al.  Converting a breast cancer microarray signature into a high-throughput diagnostic test , 2006, BMC Genomics.

[7]  M. Krzakowski,et al.  CTLA-4, CD28, and ICOS gene polymorphism associations with non-small-cell lung cancer. , 2011, Human immunology.

[8]  Brent S. Pedersen,et al.  Pybedtools: a flexible Python library for manipulating genomic datasets and annotations , 2011, Bioinform..

[9]  G. Bianconi,et al.  Differential network entropy reveals cancer system hallmarks , 2012, Scientific Reports.

[10]  David Haussler,et al.  PARADIGM-SHIFT predicts the function of mutations in multiple cancers using pathway impact analysis , 2012, Bioinform..

[11]  M. Cronin,et al.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. , 2004, The New England journal of medicine.

[12]  R. Snee,et al.  Ridge Regression in Practice , 1975 .

[13]  Peter Bühlmann Regression shrinkage and selection via the Lasso: a retrospective (Robert Tibshirani): Comments on the presentation , 2011 .

[14]  S. Tsai,et al.  Dysregulation of GIMAP genes in non-small cell lung cancer. , 2008, Lung cancer.

[15]  Kai Wang,et al.  Meta-analysis of Inter-species Liver Co-expression Networks Elucidates Traits Associated with Common Human Diseases , 2009, PLoS Comput. Biol..

[16]  Jo Campling,et al.  Analysis of Variance (ANOVA) , 2002 .

[17]  Zhaolei Zhang,et al.  Regression Analysis of Combined Gene Expression Regulation in Acute Myeloid Leukemia , 2014, PLoS Comput. Biol..

[18]  Eli Upfal,et al.  De Novo Discovery of Mutated Driver Pathways in Cancer , 2011, RECOMB.

[19]  S. Shurtleff,et al.  Clinical utility of microarray-based gene expression profiling in the diagnosis and subclassification of leukemia: report from the International Microarray Innovations in Leukemia Study Group. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[20]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Charity W. Law,et al.  voom: precision weights unlock linear model analysis tools for RNA-seq read counts , 2014, Genome Biology.

[22]  R. Myers,et al.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data , 2005, Nucleic acids research.

[23]  T. Yokota,et al.  Stat3-dependent induction of BATF in M1 mouse myeloid leukemia cells , 2002, Oncogene.

[24]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[25]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[26]  T. Jaatinen,et al.  Isolation of hematopoietic stem cells from human cord blood. , 2007, Current protocols in stem cell biology.

[27]  Debashis Ghosh,et al.  EZH2 is a marker of aggressive breast cancer and promotes neoplastic transformation of breast epithelial cells , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  G. Glass Primary, Secondary, and Meta-Analysis of Research1 , 1976 .

[29]  Manu Setty,et al.  Inferring transcriptional and microRNA-mediated regulatory programs in glioblastoma , 2012, Molecular systems biology.

[30]  Mark Gerstein,et al.  Measuring the Evolutionary Rewiring of Biological Networks , 2011, PLoS Comput. Biol..

[31]  Benjamin J. Raphael,et al.  De novo discovery of mutated driver pathways in cancer , 2011 .

[32]  B. Schuster-Böckler,et al.  Chromatin organization is a major influence on regional mutation rates in human cancer cells , 2012, Nature.

[33]  M. Caligiuri,et al.  BAALC, the human member of a novel mammalian neuroectoderm gene lineage, is implicated in hematopoiesis and acute leukemia , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[34]  William Stafford Noble,et al.  FIMO: scanning for occurrences of a given motif , 2011, Bioinform..

[35]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[36]  P. Marker The Polycomb group protein EZH2 directly controls DNA methylation , 2007 .

[37]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[38]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[39]  Eric S. Lander,et al.  The genomic complexity of primary human prostate cancer , 2010, Nature.

[40]  B. Schölkopf,et al.  High-Dimensional Graphical Model Selection Using ℓ1-Regularized Logistic Regression , 2007 .

[41]  Maitreya J. Dunham,et al.  Comparative gene expression between two yeast species , 2013, BMC Genomics.

[42]  Or Zuk,et al.  Identification of transcriptional regulators in the mouse immune system , 2013, Nature Immunology.

[43]  Carsten Denkert,et al.  New network topology approaches reveal differential correlation patterns in breast cancer , 2013, BMC Systems Biology.

[44]  B. Ponder,et al.  Histone lysine methyltransferase Wolf-Hirschhorn syndrome candidate 1 is involved in human carcinogenesis through regulation of the Wnt pathway. , 2011, Neoplasia.

[45]  A. Dreher Modeling Survival Data Extending The Cox Model , 2016 .

[46]  A. Michelson,et al.  Differential regulation of mesodermal gene expression by Drosophila cell type-specific Forkhead transcription factors , 2012, Development.

[47]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[48]  Ron Shamir,et al.  Dissection of Regulatory Networks that Are Altered in Disease via Differential Co-expression , 2013, PLoS Comput. Biol..

[49]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer-associated genes , 2013 .

[50]  H. Hondermarck,et al.  Nerve growth factor is a potential therapeutic target in breast cancer. , 2008, Cancer research.

[51]  Fatima Al-Shahrour,et al.  Musashi-2 regulates normal hematopoiesis and promotes aggressive myeloid leukemia , 2010, Nature Medicine.

[52]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[53]  Wyeth W. Wasserman,et al.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles , 2004, Nucleic Acids Res..

[54]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[55]  Li Jiang,et al.  MicroRNA-148b is frequently down-regulated in gastric cancer and acts as a tumor suppressor by inhibiting cell proliferation , 2011, Molecular Cancer.

[56]  Wei Wang,et al.  Comparative annotation of functional regions in the human genome using epigenomic data , 2013, Nucleic acids research.

[57]  Jeremy J. W. Chen,et al.  A Novel Function of YWHAZ/β-Catenin Axis in Promoting Epithelial–Mesenchymal Transition and Lung Cancer Metastasis , 2012, Molecular Cancer Research.

[58]  Y. Wang,et al.  Analysis of differentially expressed genes in ductal carcinoma with DNA microarray. , 2013, European review for medical and pharmacological sciences.

[59]  A. Nobel,et al.  Supervised risk predictor of breast cancer based on intrinsic subtypes. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[60]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[61]  M. Stratton,et al.  Deciphering Signatures of Mutational Processes Operative in Human Cancer , 2013, Cell reports.

[62]  Trevor J Pugh,et al.  Initial genome sequencing and analysis of multiple myeloma , 2011, Nature.

[63]  David A. Orlando,et al.  Selective Inhibition of Tumor Oncogenes by Disruption of Super-Enhancers , 2013, Cell.

[64]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[65]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[66]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[67]  R Berger,et al.  NB4, a maturation inducible cell line with t(15;17) marker isolated from a human acute promyelocytic leukemia (M3). , 1991, Blood.

[68]  Matthew B. Callaway,et al.  MuSiC: Identifying mutational significance in cancer genomes , 2012, Genome research.

[69]  R. Toillon,et al.  TrkA overexpression enhances growth and metastasis of breast cancer cells , 2009, Oncogene.

[70]  R. Morgan,et al.  The role of HOX genes in normal hematopoiesis and acute leukemia , 2013, Leukemia.

[71]  Svante Wold,et al.  Analysis of variance (ANOVA) , 1989 .

[72]  A. Thompson,et al.  Redox Signaling by the RNA Polymerase III TFIIB-Related Factor Brf2 , 2015, Cell.

[73]  J.,et al.  The New England Journal of Medicine , 2012 .

[74]  A. Børresen-Dale,et al.  The landscape of cancer genes and mutational processes in breast cancer , 2012, Nature.

[75]  Susmita Datta,et al.  A statistical framework for differential network analysis from microarray data , 2010, BMC Bioinformatics.

[76]  James U. Bowie,et al.  Network rewiring is an important mechanism of gene essentiality change , 2012, Scientific Reports.

[77]  David A. Orlando,et al.  Master Transcription Factors and Mediator Establish Super-Enhancers at Key Cell Identity Genes , 2013, Cell.

[78]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[79]  Leyla Isik,et al.  Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations. , 2009, Cancer research.

[80]  Peter A. Jones,et al.  A decade of exploring the cancer epigenome — biological and translational implications , 2011, Nature Reviews Cancer.

[81]  B. Ponder,et al.  Minichromosome Maintenance Protein 7 is a potential therapeutic target in human cancer and a novel prognostic marker of non-small cell lung cancer , 2011, Molecular Cancer.

[82]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.

[83]  R. Tibshirani,et al.  Strong rules for discarding predictors in lasso‐type problems , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[84]  Andrea Califano,et al.  Rewiring makes the difference , 2011, Molecular systems biology.

[85]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[86]  Nitin Bhardwaj,et al.  Rewiring of Transcriptional Regulatory Networks: Hierarchy, Rather Than Connectivity, Better Reflects the Importance of Regulators , 2010, Science Signaling.

[87]  Ø. Bruserud,et al.  Platelet functions and clinical effects in acute myelogenous leukemia , 2007, Thrombosis and Haemostasis.

[88]  K. Gunsalus,et al.  Network modeling links breast cancer susceptibility and centrosome dysfunction. , 2007, Nature genetics.

[89]  J. Soh,et al.  Silenced expression of NFKBIA in lung adenocarcinoma patients with a never-smoking history. , 2013, Acta medica Okayama.

[90]  David A. Drubin,et al.  Learning a Prior on Regulatory Potential from eQTL Data , 2009, PLoS genetics.

[91]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[92]  Ash A. Alizadeh,et al.  Association of a leukemic stem cell gene expression signature with clinical outcomes in acute myeloid leukemia. , 2010, JAMA.

[93]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[94]  Piotr J. Balwierz,et al.  ISMARA: automated modeling of genomic signals as a democracy of regulatory motifs , 2014, Genome research.

[95]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[96]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[97]  D. Lin,et al.  Abnormal beta-catenin and reduced axin expression are associated with poor differentiation and progression in non-small cell lung cancer. , 2006, American journal of clinical pathology.

[98]  Doron Lancet,et al.  MalaCards: an integrated compendium for diseases and their annotation , 2013, Database J. Biol. Databases Curation.

[99]  T. Ideker,et al.  Differential network biology , 2012, Molecular systems biology.

[100]  Najman,et al.  NB 4 , a Maturation Inducible Cell Line With t ( 15 ; 17 ) Marker Isolated From a Human Acute Promyelocytic Leukemia ( M 3 ) , 2022 .

[101]  Sylvain Sardy,et al.  On the Practice of Rescaling Covariates , 2008 .

[102]  Robert Clarke,et al.  Differential dependency network analysis to identify condition-specific topological changes in biological networks , 2009, Bioinform..

[103]  Mona Singh,et al.  Toward the dynamic interactome: it's about time , 2010, Briefings Bioinform..

[104]  S. Dhanasekaran,et al.  The polycomb group protein EZH2 is involved in progression of prostate cancer , 2002, Nature.

[105]  Hannah Stower Gene expression: Super enhancers , 2013, Nature Reviews Genetics.

[106]  T. Ideker,et al.  Integrative approaches for finding modular structure in biological networks , 2013, Nature Reviews Genetics.

[107]  Babak Shahbaba,et al.  A pluripotency signature predicts histologic transformation and influences survival in follicular lymphoma patients. , 2009, Blood.

[108]  Nir Friedman,et al.  A functional selection model explains evolutionary robustness despite plasticity in regulatory networks , 2012 .