Empowering biologists with multi-omics data: colorectal cancer as a paradigm

Motivation: Recent completion of the global proteomic characterization of The Cancer Genome Atlas (TCGA) colorectal cancer (CRC) cohort resulted in the first tumor dataset with complete molecular measurements at DNA, RNA and protein levels. Using CRC as a paradigm, we describe the application of the NetGestalt framework to provide easy access and interpretation of multi-omics data. Results: The NetGestalt CRC portal includes genomic, epigenomic, transcriptomic, proteomic and clinical data for the TCGA CRC cohort, data from other CRC tumor cohorts and cell lines, and existing knowledge on pathways and networks, giving a total of more than 17 million data points. The portal provides features for data query, upload, visualization and integration. These features can be flexibly combined to serve various needs of the users, maximizing the synergy among omics data, human visualization and quantitative analysis. Using three case studies, we demonstrate that the portal not only provides user-friendly data query and visualization but also enables efficient data integration within a single omics data type, across multiple omics data types, and over biological networks. Availability and implementation: The NetGestalt CRC portal can be freely accessed at http://www.netgestalt.org. Contact: bing.zhang@vanderbilt.edu Supplementary Information: Supplementary data are available at Bioinformatics online.

[1]  Manuela Gariboldi,et al.  Integrative approach for prioritizing cancer genes in sporadic colon cancer , 2009, Genes, chromosomes & cancer.

[2]  P. Karp,et al.  Computational prediction of human metabolic pathways from the complete human genome , 2004, Genome Biology.

[3]  S. Baylin,et al.  Promoter methylation of genes in and around the candidate lung cancer susceptibility locus 6q23-25. , 2008, Cancer research.

[4]  David Haussler,et al.  The UCSC Interaction Browser: multidimensional data views in pathway context , 2013, Nucleic Acids Res..

[5]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[6]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[7]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[8]  Tetsuo Ito,et al.  Hypermethylation of the AKAP12 Promoter is a Biomarker of Barrett's-Associated Esophageal Neoplastic Progression , 2008, Cancer Epidemiology Biomarkers & Prevention.

[9]  Matthew A. Hibbs,et al.  Visualization of omics data for systems biology , 2010, Nature Methods.

[10]  Leonhard Müllauer,et al.  Genome-wide transcriptional response to 5-aza-2'-deoxycytidine and trichostatin a in multiple myeloma cells. , 2008, Cancer research.

[11]  Yuanfang Guan,et al.  Tissue-Specific Functional Networks for Prioritizing Phenotype and Disease Genes , 2012, PLoS Comput. Biol..

[12]  Bing Zhang,et al.  NetGestalt: integrating multidimensional omics data over biological networks , 2013, Nature Methods.

[13]  T. Ørntoft,et al.  Metastasis-Associated Gene Expression Changes Predict Poor Outcomes in Patients with Dukes Stage B and C Colorectal Cancer , 2009, Clinical Cancer Research.

[14]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[15]  Georgios A. Pavlopoulos,et al.  Arena3D: visualization of biological networks in 3D , 2008, BMC Systems Biology.

[16]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[17]  Ting Wang,et al.  The UCSC Cancer Genomics Browser , 2009, Nature Methods.

[18]  Christian Pilarsky,et al.  An expression module of WIPF1-coexpressed genes identifies patients with favorable prognosis in three tumor types , 2009, Journal of Molecular Medicine.

[19]  Roded Sharan,et al.  Enhancing the Prioritization of Disease-Causing Genes through Tissue Specific Protein Interaction Networks , 2012, PLoS Comput. Biol..

[20]  Tetsuo Ito,et al.  A genome-wide search identifies epigenetic silencing of somatostatin, tachykinin-1, and 5 other genes in colon cancer. , 2006, Gastroenterology.

[21]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of human colon and rectal cancer , 2012, Nature.

[22]  T. Yeatman,et al.  Experimentally derived metastasis gene expression profile predicts recurrence and death in patients with colon cancer. , 2010, Gastroenterology.

[23]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Tae-You Kim,et al.  AKAP12/Gravin is inactivated by epigenetic mechanism in human gastric carcinoma and shows growth suppressor activity , 2004, Oncogene.

[25]  I. Dubchak,et al.  Visualizing genomes: techniques and challenges , 2010, Nature Methods.

[26]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[27]  Bing Su,et al.  Quantitative assessment of AKAP12 promoter methylation in colorectal cancer using methylation-sensitive high resolution melting: Correlation with Duke's stage , 2010, Cancer biology & therapy.

[28]  P. Massion,et al.  Integrative Genomics Analysis Identifies Candidate Drivers at 3q26-29 Amplicon in Squamous Cell Carcinoma of the Lung , 2013, Clinical Cancer Research.

[29]  J. Mesirov,et al.  Systematic investigation of genetic vulnerabilities across cancer cell lines reveals lineage-specific dependencies in ovarian cancer , 2011, Proceedings of the National Academy of Sciences.

[30]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[31]  Jeffrey R. Whiteaker,et al.  Proteogenomic characterization of human colon and rectal cancer , 2014, Nature.

[32]  M. Tyers,et al.  Osprey: a network visualization system , 2003, Genome Biology.

[33]  Tamara Munzner,et al.  Cerebral: Visualizing Multiple Experimental Conditions on a Graph with Biological Context , 2008, IEEE Transactions on Visualization and Computer Graphics.

[34]  Ian M. Donaldson,et al.  iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence , 2010, Database J. Biol. Databases Curation.

[35]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[36]  C. Niemeyer,et al.  AKAP12, a gene with tumour suppressor properties, is a target of promoter DNA methylation in childhood myeloid malignancies , 2007, British journal of haematology.

[37]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[38]  Yan Wang,et al.  VisANT 3.5: multi-scale network visualization, analysis and inference based on the gene ontology , 2009, Nucleic Acids Res..