Pathway-based subnetworks enable cross-disease biomarker discovery

Biomarkers lie at the heart of precision medicine. Surprisingly, while rapid genomic profiling is becoming ubiquitous, the development of biomarkers usually involves the application of bespoke techniques that cannot be directly applied to other datasets. There is an urgent need for a systematic methodology to create biologically-interpretable molecular models that robustly predict key phenotypes. Here we present SIMMS (Subnetwork Integration for Multi-Modal Signatures): an algorithm that fragments pathways into functional modules and uses these to predict phenotypes. We apply SIMMS to multiple data types across five diseases, and in each it reproducibly identifies known and novel subtypes, and makes superior predictions to the best bespoke approaches. To demonstrate its ability on a new dataset, we profile 33 genes/nodes of the PI3K pathway in 1734 FFPE breast tumors and create a four-subnetwork prediction model. This model out-performs a clinically-validated molecular test in an independent cohort of 1742 patients. SIMMS is generic and enables systematic data integration for robust biomarker discovery.Accurate and actionable biomarkers that integrate diverse molecular, functional and clinical information hold great promise in precision medicine. Here, the authors develop SIMMS, a method for pathway-based cross-disease biomarker discovery.

[1]  J. Veltman,et al.  De novo mutations in human genetic disease , 2012, Nature Reviews Genetics.

[2]  Gur Pines,et al.  The ERBB network: at last, cancer therapy meets systems biology , 2012, Nature Reviews Cancer.

[3]  John M S Bartlett,et al.  Estrogen receptor and progesterone receptor as predictive biomarkers of response to endocrine therapy: a prospectively powered pathology study in the Tamoxifen and Exemestane Adjuvant Multinational trial. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[4]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[5]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[6]  Biao He,et al.  A practical molecular assay to predict survival in resected non-squamous, non-small-cell lung cancer: development and international validation studies , 2012, The Lancet.

[7]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of human colon and rectal cancer , 2012, Nature.

[8]  Tatsuhiko Tsunoda,et al.  High-Risk Ovarian Cancer Based on 126-Gene Expression Signature Is Uniquely Characterized by Downregulation of Antigen Presentation Pathway , 2012, Clinical Cancer Research.

[9]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[10]  Paul A. Bates,et al.  Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis , 2006, BMC Bioinformatics.

[11]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[12]  Jianxin Wang,et al.  BPG: Seamless, automated and interactive visualization of scientific data , 2017, BMC Bioinformatics.

[13]  Hein Putter,et al.  Adjuvant tamoxifen and exemestane in early breast cancer (TEAM): a randomised phase 3 trial , 2011, The Lancet.

[14]  Aleix Prat Aparicio Comprehensive molecular portraits of human breast tumours , 2012 .

[15]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[16]  Lincoln Stein,et al.  Reactome: a database of reactions, pathways and biological processes , 2010, Nucleic Acids Res..

[17]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[18]  Carlos Caldas,et al.  Patterns of Immune Infiltration in Breast Cancer and Their Clinical Implications: A Gene-Expression-Based Retrospective Study , 2016, PLoS medicine.

[19]  Chris Sander,et al.  Using MEMo to Discover Mutual Exclusivity Modules in Cancer , 2013, Current protocols in bioinformatics.

[20]  Muriel Médard,et al.  Network deconvolution as a general method to distinguish direct dependencies in networks , 2013, Nature Biotechnology.

[21]  Doheon Lee,et al.  Inferring Pathway Activity toward Precise Disease Classification , 2008, PLoS Comput. Biol..

[22]  W. Sauerbrei,et al.  Reporting recommendations for tumor marker prognostic studies (REMARK). , 2005, Journal of the National Cancer Institute.

[23]  Thomas D. Wu,et al.  Comprehensive genomic analysis of malignant pleural mesothelioma identifies recurrent mutations, gene fusions and splicing alterations , 2016, Nature Genetics.

[24]  J. Cuzick,et al.  Prognostic value of a combined estrogen receptor, progesterone receptor, Ki-67, and human epidermal growth factor receptor 2 immunohistochemical score and comparison with the Genomic Health recurrence score in early breast cancer. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[25]  Paul C. Boutros,et al.  NanoStringNorm: an extensible R package for the pre-processing of NanoString mRNA and miRNA data , 2012, Bioinform..

[26]  Ronan M. T. Fleming,et al.  A community-driven global reconstruction of human metabolism , 2013, Nature Biotechnology.

[27]  M. King,et al.  Genetic Heterogeneity in Human Disease , 2010, Cell.

[28]  Michael L. Gatza,et al.  A pathway-based classification of human breast cancer , 2010, Proceedings of the National Academy of Sciences.

[29]  M. Barnes,et al.  Analysis of gene expression in two large schizophrenia cohorts identifies multiple changes associated with nerve terminal function , 2009, Molecular Psychiatry.

[30]  J. R. Reeves,et al.  Expression of the HER1–4 family of receptor tyrosine kinases in breast cancer , 2003, The Journal of pathology.

[31]  Nikolina Radulovich,et al.  Prognostic gene-expression signature of carcinoma-associated fibroblasts in non-small cell lung cancer , 2011, Proceedings of the National Academy of Sciences.

[32]  Guanming Wu,et al.  A network module-based method for identifying cancer prognostic signatures , 2012, Genome Biology.

[33]  The Cancer Genome Atlas Research Network COMPREHENSIVE MOLECULAR CHARACTERIZATION OF CLEAR CELL RENAL CELL CARCINOMA , 2013, Nature.

[34]  John M S Bartlett,et al.  Biomarkers and patient selection for PI3K/Akt/mTOR targeted therapies: current status and future directions. , 2010, Clinical breast cancer.

[35]  Qing Wang,et al.  Towards precise classification of cancers based on robust gene functional expression profiles , 2005, BMC Bioinformatics.

[36]  Roded Sharan,et al.  Network-Based Integration of Disparate Omic Data To Identify "Silent Players" in Cancer , 2015, PLoS Comput. Biol..

[37]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[38]  Chris Sander,et al.  Emerging landscape of oncogenic signatures across human cancers , 2013, Nature Genetics.

[39]  Michal Sheffer,et al.  Pathway-based personalized analysis of cancer , 2013, Proceedings of the National Academy of Sciences.

[40]  Arno Lukas,et al.  Characterization of protein-interaction networks in tumors , 2007, BMC Bioinformatics.

[41]  Holger Fröhlich,et al.  Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics , 2013, PloS one.

[42]  Benjamin J. Raphael,et al.  The evolutionary history of 2,658 cancers , 2017, Nature.

[43]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Mira Ayadi,et al.  Gene Expression Classification of Colon Cancer into Molecular Subtypes: Characterization, Validation, and Prognostic Value , 2013, PLoS medicine.

[45]  F. Markowetz,et al.  The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups , 2012, Nature.

[46]  Baolin Wu,et al.  Network-based Survival Analysis Reveals Subnetwork Signatures for Predicting Outcomes of Ovarian Cancer Treatment , 2013, PLoS Comput. Biol..

[47]  John P A Ioannidis,et al.  Beyond genome-wide association studies: genetic heterogeneity and individual predisposition to cancer. , 2010, Trends in genetics : TIG.

[48]  Syed Haider,et al.  BPG: Seamless, Automated and Interactive Visualization of Scientific Data , 2017 .

[49]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[50]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[51]  David Warde-Farley,et al.  Dynamic modularity in protein interaction networks predicts breast cancer outcome , 2009, Nature Biotechnology.

[52]  Richard Simon,et al.  Gene expression-based prognostic signatures in lung cancer: ready for clinical use? , 2010, Journal of the National Cancer Institute.

[53]  R. Myers,et al.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data , 2005, Nucleic acids research.

[54]  John M S Bartlett,et al.  Validation of the IHC4 Breast Cancer Prognostic Algorithm Using Multiple Approaches on the Multinational TEAM Clinical Trial. , 2016, Archives of pathology & laboratory medicine.

[55]  Marylyn D. Ritchie,et al.  Using knowledge-driven genomic interactions for multi-omics data analysis: metadimensional models for predicting clinical outcomes in ovarian carcinoma , 2017, J. Am. Medical Informatics Assoc..

[56]  Alan Ashworth,et al.  Translating cancer research into targeted therapeutics , 2010, Nature.

[57]  K. Gunsalus,et al.  Network modeling links breast cancer susceptibility and centrosome dysfunction. , 2007, Nature genetics.

[58]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[59]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[60]  A. Nobel,et al.  Supervised risk predictor of breast cancer based on intrinsic subtypes. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[61]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[62]  A. Butte,et al.  Systematic pan-cancer analysis of tumour purity , 2015, Nature Communications.

[63]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of clear cell renal cell carcinoma , 2013, Nature.

[64]  Andrea Ciliberto,et al.  Low duplicability and network fragility of cancer genes. , 2008, Trends in genetics : TIG.

[65]  C. Sander,et al.  Automated Network Analysis Identifies Core Pathways in Glioblastoma , 2010, PloS one.