Large-scale integrative network-based analysis identifies common pathways disrupted by copy number alterations across cancers

BackgroundMany large-scale studies analyzed high-throughput genomic data to identify altered pathways essential to the development and progression of specific types of cancer. However, no previous study has been extended to provide a comprehensive analysis of pathways disrupted by copy number alterations across different human cancers. Towards this goal, we propose a network-based method to integrate copy number alteration data with human protein-protein interaction networks and pathway databases to identify pathways that are commonly disrupted in many different types of cancer.ResultsWe applied our approach to a data set of 2,172 cancer patients across 16 different types of cancers, and discovered a set of commonly disrupted pathways, which are likely essential for tumor formation in majority of the cancers. We also identified pathways that are only disrupted in specific cancer types, providing molecular markers for different human cancers. Analysis with independent microarray gene expression datasets confirms that the commonly disrupted pathways can be used to identify patient subgroups with significantly different survival outcomes. We also provide a network view of disrupted pathways to explain how copy number alterations affect pathways that regulate cell growth, cycle, and differentiation for tumorigenesis.ConclusionsIn this work, we demonstrated that the network-based integrative analysis can help to identify pathways disrupted by copy number alterations across 16 types of human cancers, which are not readily identifiable by conventional overrepresentation-based and other pathway-based methods. All the results and source code are available at http://compbio.cs.umn.edu/NetPathID/.

[1]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[2]  T. Jacks,et al.  The cell cycle and cancer. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[3]  C. Sherr,et al.  Tumor surveillance via the ARF-p53 pathway. , 1998, Genes & development.

[4]  P. Donahoe,et al.  The type I serine/threonine kinase receptor ActRIA (ALK2) is required for gastrulation of the mouse embryo. , 1999, Development.

[5]  D. Hanahan,et al.  The Hallmarks of Cancer , 2000, Cell.

[6]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[7]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[8]  D. Albertson,et al.  Chromosome aberrations in solid tumors , 2003, Nature Genetics.

[9]  S. Elledge,et al.  Multiple Tumor Suppressor Pathways Negatively Regulate Telomerase , 2003, Cell.

[10]  M. A. Goldman The role of telomeres and telomerase in cancer. , 2003, Drug discovery today.

[11]  K. Azuma,et al.  Ran, a Small GTPase Gene, Encodes Cytotoxic T Lymphocyte (CTL) Epitopes Capable of Inducing HLA-A33–restricted and Tumor-Reactive CTLs in Cancer Patients , 2004, Clinical Cancer Research.

[12]  A. Danchin,et al.  Bmc Genomics , 2004 .

[13]  Roded Sharan,et al.  PathBLAST: a tool for alignment of protein interaction networks , 2004, Nucleic Acids Res..

[14]  David Martin,et al.  GOToolBox: functional analysis of gene datasets based on Gene Ontology , 2004, Genome Biology.

[15]  Jiangqin Zhao,et al.  MDM2 negatively regulates the human telomerase RNA gene promoter , 2005, BMC Cancer.

[16]  Cunji Gao,et al.  Mechanisms of PECAM-1-mediated cytoprotection and implications for cancer cell survival , 2005, Leukemia & lymphoma.

[17]  Christian V. Forst,et al.  Differential network expression during drug and stress response , 2005, Bioinform..

[18]  Pankaj Agarwal,et al.  Inferring pathways from gene lists using a literature-derived network of biological relationships , 2005, Bioinform..

[19]  J. Foekens,et al.  Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer , 2005, The Lancet.

[20]  D. Loukinov,et al.  CTCF binds the proximal exonic region of hTERT and inhibits its transcription , 2005, Nucleic acids research.

[21]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[23]  E. Lander,et al.  Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma , 2007, Proceedings of the National Academy of Sciences.

[24]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[25]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[26]  R. Tothill,et al.  Novel Molecular Subtypes of Serous and Endometrioid Ovarian Cancer Linked to Clinical Outcome , 2008, Clinical Cancer Research.

[27]  D. Busam,et al.  An Integrated Genomic Analysis of Human Glioblastoma Multiforme , 2008, Science.

[28]  David Warde-Farley,et al.  GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function , 2008, Genome Biology.

[29]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[30]  J. Massagué,et al.  TGFβ in Cancer , 2008, Cell.

[31]  Igor Jurisica,et al.  Gene expression–based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study , 2008, Nature Medicine.

[32]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[33]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[34]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[35]  Vipin Kumar,et al.  Robust and efficient identification of biomarkers by classifying features on graphs , 2008, Bioinform..

[36]  Doheon Lee,et al.  Inferring Pathway Activity toward Precise Disease Classification , 2008, PLoS Comput. Biol..

[37]  R. Bernards,et al.  Enabling personalized cancer medicine through analysis of gene-expression patterns , 2008, Nature.

[38]  G. Parmigiani,et al.  Core Signaling Pathways in Human Pancreatic Cancers Revealed by Global Genomic Analyses , 2008, Science.

[39]  Ron Shamir,et al.  Identifying functional modules using expression profiles and confidence-scored protein interactions , 2009, Bioinform..

[40]  R. Toillon,et al.  TrkA overexpression enhances growth and metastasis of breast cancer cells , 2009, Oncogene.

[41]  A. Shlien,et al.  Copy number variations and cancer , 2009, Genome Medicine.

[42]  TaeHyun Hwang,et al.  A hypergraph-based learning algorithm for classifying gene expression and arrayCGH data with prior knowledge , 2009, Bioinform..

[43]  D. Jacqmin,et al.  The sonic hedgehog signaling pathway is reactivated in human renal cell carcinoma and plays orchestral role in tumor growth , 2009, Molecular Cancer.

[44]  P. Donnelly,et al.  Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region , 2010 .

[45]  Jing Zhu,et al.  Edge-based scoring and searching method for identifying condition-responsive protein-protein interaction sub-network , 2007, Bioinform..

[46]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[47]  A. Sparks,et al.  The mutation spectrum revealed by paired genome sequences from a lung cancer patient , 2010, Nature.

[48]  Joel Dudley,et al.  Network-Based Elucidation of Human Disease Similarities Reveals Common Functional Modules Enriched for Pluripotent Drug Targets , 2010, PLoS Comput. Biol..

[49]  Derek Y. Chiang,et al.  The landscape of somatic copy-number alteration across human cancers , 2010, Nature.

[50]  TaeHyun Hwang,et al.  A Heterogeneous Label Propagation Algorithm for Disease Gene Discovery , 2010, SDM.

[51]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[52]  Takuro Nakamura,et al.  Trib1 links the MEK1/ERK pathway in myeloid leukemogenesis. , 2010, Blood.

[53]  Yong Liu,et al.  Vascular endothelial platelet endothelial cell adhesion molecule 1 (PECAM-1) regulates advanced metastatic progression , 2010, Proceedings of the National Academy of Sciences.

[54]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[55]  Abhijit Bhat,et al.  Targeting the ANGPT–TIE2 pathway in malignancy , 2010, Nature Reviews Cancer.

[56]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[57]  C. Sander,et al.  Automated Network Analysis Identifies Core Pathways in Glioblastoma , 2010, PloS one.

[58]  TaeHyun Hwang,et al.  Inferring disease and gene set associations with rank coherence in networks , 2011, Bioinform..

[59]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[60]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[61]  D. Liggitt,et al.  Vascular endothelial platelet endothelial cell adhesion molecule 1 (PECAM-1) regulates advanced metastatic progression (Proceedings of the National Academy Sciences of the United States of America (2010) 107, 43 (18616-18621)) , 2011 .

[62]  Zev A. Binder,et al.  The Genetic Landscape of the Childhood Cancer Medulloblastoma , 2011, Science.

[63]  A. McKenna,et al.  The Mutational Landscape of Head and Neck Squamous Cell Carcinoma , 2011, Science.

[64]  A. Sonabend,et al.  Inhibition of Sonic Hedgehog and Notch Pathways Enhances Sensitivity of CD133+ Glioma Stem Cells to Temozolomide Therapy , 2011, Molecular medicine.

[65]  J. Shay,et al.  Role of telomeres and telomerase in cancer. , 2011, Seminars in cancer biology.

[66]  Steven J. M. Jones,et al.  Comprehensive genomic characterization of squamous cell lung cancers , 2012, Nature.

[67]  A. Bitton,et al.  ASSOCIATION BETWEEN GENETIC VARIANTS IN THE HNF4A GENE AND CHILDHOOD-ONSET CROHN’S DISEASE , 2012, Genes and Immunity.

[68]  K. Stoeber,et al.  The cell cycle and cancer , 2012, The Journal of pathology.

[69]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[70]  Steven J. M. Jones,et al.  Integrated genomic characterization of endometrial carcinoma , 2013, Nature.