iGPSe: A visual analytic system for integrative genomic based cancer patient stratification

BackgroundCancers are highly heterogeneous with different subtypes. These subtypes often possess different genetic variants, present different pathological phenotypes, and most importantly, show various clinical outcomes such as varied prognosis and response to treatment and likelihood for recurrence and metastasis. Recently, integrative genomics (or panomics) approaches are often adopted with the goal of combining multiple types of omics data to identify integrative biomarkers for stratification of patients into groups with different clinical outcomes.ResultsIn this paper we present a visual analytic system called Interactive Genomics Patient Stratification explorer (iGPSe) which significantly reduces the computing burden for biomedical researchers in the process of exploring complicated integrative genomics data. Our system integrates unsupervised clustering with graph and parallel sets visualization and allows direct comparison of clinical outcomes via survival analysis. Using a breast cancer dataset obtained from the The Cancer Genome Atlas (TCGA) project, we are able to quickly explore different combinations of gene expression (mRNA) and microRNA features and identify potential combined markers for survival prediction.ConclusionsVisualization plays an important role in the process of stratifying given population patients. Visual tools allowed for the selection of possibly features across various datasets for the given patient population. We essentially made a case for visualization for a very important problem in translational informatics.

[1]  Aleix Prat Aparicio Comprehensive molecular portraits of human breast tumours , 2012 .

[2]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Li-Xuan Qin,et al.  An Integrative Analysis of microRNA and mRNA Expression—A Case Study , 2008, Cancer informatics.

[4]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[5]  Mark T. W. Ebbert,et al.  PAM50 Breast Cancer Subtyping by RT-qPCR and Concordance with Standard Clinical Molecular Markers , 2012, BMC Medical Genomics.

[6]  Stefan Michiels,et al.  Prediction of cancer outcome with microarrays: a multiple random validation strategy , 2005, The Lancet.

[7]  F. Markowetz,et al.  Quantitative Image Analysis of Cellular Heterogeneity in Breast Tumors Complements Genomic Profiling , 2012, Science Translational Medicine.

[8]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[9]  Jean-Philippe Vert,et al.  The Influence of Feature Selection Methods on Accuracy, Stability and Interpretability of Molecular Signatures , 2011, PloS one.

[10]  D. Haussler,et al.  Exploring TCGA Pan-Cancer Data at the UCSC Cancer Genomics Browser , 2013, Scientific Reports.

[11]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of human colon and rectal cancer , 2012, Nature.

[12]  Shi-Hua Zhang,et al.  Identifying multi-layer gene regulatory modules from multi-dimensional genomic data , 2012, Bioinform..

[13]  Jeffrey Heer,et al.  D³ Data-Driven Documents , 2011, IEEE Transactions on Visualization and Computer Graphics.

[14]  Jeffrey Heer,et al.  SpanningAspectRatioBank Easing FunctionS ArrayIn ColorIn Date Interpolator MatrixInterpola NumObjecPointI Rectang ISchedu Parallel Pause Scheduler Sequen Transition Transitioner Transiti Tween Co DelimGraphMLCon IData JSONCon DataField DataSc Dat DataSource Data DataUtil DirtySprite LineS RectSprite , 2011 .

[15]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[16]  Danny Holten,et al.  Hierarchical Edge Bundles: Visualization of Adjacency Relations in Hierarchical Data , 2006, IEEE Transactions on Visualization and Computer Graphics.

[17]  Xifeng Yan,et al.  Frequent Pattern Discovery in Multiple Biological Networks: Patterns and Algorithms , 2012 .

[18]  Dieter Schmalstieg,et al.  StratomeX: Visual Analysis of Large‐Scale Heterogeneous Genomics Data for Cancer Subtype Characterization , 2012, Comput. Graph. Forum.

[19]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[20]  J. Foekens,et al.  miRNA expression profiling of 51 human breast cancer cell lines reveals subtype and driver mutation-specific miRNAs , 2013, Breast Cancer Research.

[21]  Gunnar Rätsch,et al.  CANCER PANOMICS: COMPUTATIONAL METHODS AND INFRASTRUCTURE FOR INTEGRATIVE ANALYSIS OF CANCER HIGH-THROUGHPUT “OMICS” DATA- SESSION INTRODUCTION , 2013 .

[22]  Eytan Domany,et al.  Outcome signature genes in breast cancer: is there a unique set? , 2004, Breast Cancer Research.

[23]  Wei Zhang,et al.  Aurora kinase A induces miR-17-92 cluster through regulation of E2F1 transcription factor , 2010, Cellular and Molecular Life Sciences.

[24]  A. Nobel,et al.  Supervised risk predictor of breast cancer based on intrinsic subtypes. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[25]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[26]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[27]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[28]  HeerJeffrey,et al.  D3 Data-Driven Documents , 2011 .

[29]  Juan Liu,et al.  A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules , 2011, Bioinform..

[30]  Dieter Schmalstieg,et al.  VisBricks: Multiform Visualization of Large, Inhomogeneous Data , 2011, IEEE Transactions on Visualization and Computer Graphics.

[31]  Sabine Tejpar,et al.  Biopsies: next-generation biospecimens for tailoring therapy , 2013, Nature Reviews Clinical Oncology.

[32]  Doron Betel,et al.  Genetic dissection of the miR-17~92 cluster of microRNAs in Myc-induced B-cell lymphomas. , 2009, Genes & development.

[33]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[34]  C. Croce,et al.  Integrated MicroRNA and mRNA Signatures Associated with Survival in Triple Negative Breast Cancer , 2013, PloS one.

[35]  Yang Xiang,et al.  Weighted Frequent Gene Co-expression Network Mining to Identify Genes Involved in Genome Stability , 2012, PLoS Comput. Biol..

[36]  Helwig Hauser,et al.  Integrating cluster formation and cluster evaluation in interactive visual analysis , 2011, SCC.

[37]  T. Barrette,et al.  ONCOMINE: a cancer microarray database and integrated data-mining platform. , 2004, Neoplasia.