Network-based Survival Analysis Reveals Subnetwork Signatures for Predicting Outcomes of Ovarian Cancer Treatment

Cox regression is commonly used to predict the outcome by the time to an event of interest and in addition, identify relevant features for survival analysis in cancer genomics. Due to the high-dimensionality of high-throughput genomic data, existing Cox models trained on any particular dataset usually generalize poorly to other independent datasets. In this paper, we propose a network-based Cox regression model called Net-Cox and applied Net-Cox for a large-scale survival analysis across multiple ovarian cancer datasets. Net-Cox integrates gene network information into the Cox's proportional hazard model to explore the co-expression or functional relation among high-dimensional gene expression features in the gene network. Net-Cox was applied to analyze three independent gene expression datasets including the TCGA ovarian cancer dataset and two other public ovarian cancer datasets. Net-Cox with the network information from gene co-expression or functional relations identified highly consistent signature genes across the three datasets, and because of the better generalization across the datasets, Net-Cox also consistently improved the accuracy of survival prediction over the Cox models regularized by or . This study focused on analyzing the death and recurrence outcomes in the treatment of ovarian carcinoma to identify signature genes that can more reliably predict the events. The signature genes comprise dense protein-protein interaction subnetworks, enriched by extracellular matrix receptors and modulators or by nuclear signaling components downstream of extracellular signal-regulated kinases. In the laboratory validation of the signature genes, a tumor array experiment by protein staining on an independent patient cohort from Mayo Clinic showed that the protein expression of the signature gene FBN1 is a biomarker significantly associated with the early recurrence after 12 months of the treatment in the ovarian cancer patients who are initially sensitive to chemotherapy. Net-Cox toolbox is available at http://compbio.cs.umn.edu/Net-Cox/.

[1]  Norman Breslow,et al.  Discussion of Professor Cox''s paper , 1974 .

[2]  Dong Xie,et al.  Ovarian Carcinomas: CCN Genes Are Aberrantly Expressed and CCN1 Promotes Proliferation of these Cells , 2005, Clinical Cancer Research.

[3]  W. Wong,et al.  A gene signature predictive for outcome in advanced ovarian cancer identifies a survival factor: microfibril-associated glycoprotein 2. , 2009, Cancer cell.

[4]  Yang Jing L1 Regularization Path Algorithm for Generalized Linear Models , 2008 .

[5]  Michel Tenenhaus,et al.  PLS generalised linear regression , 2005, Comput. Stat. Data Anal..

[6]  Vipin Kumar,et al.  Robust and efficient identification of biomarkers by classifying features on graphs , 2008, Bioinform..

[7]  Hongzhe Li,et al.  Dimension reduction methods for microarrays with application to censored survival data , 2004, Bioinform..

[8]  Hongzhe Li,et al.  Kernel Cox Regression Models for Linking Gene Expression Profiles to Censored Survival Data , 2002, Pacific Symposium on Biocomputing.

[9]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[10]  Ronald D. Alvarez,et al.  Lipoxygenase Pathway Receptor Expression in Ovarian Cancer , 2008, Reproductive Sciences.

[11]  Rebecca Sutphen,et al.  Lysophospholipids are potential biomarkers of ovarian cancer. , 2004, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[12]  Trevor Hastie,et al.  Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[13]  Ross S Berkowitz,et al.  Up-regulation of stromal versican expression in advanced stage serous ovarian cancer. , 2010, Gynecologic oncology.

[14]  R. Tibshirani,et al.  Prediction by Supervised Principal Components , 2006 .

[15]  Torsten Hothorn,et al.  Bagging survival trees , 2002, Statistics in medicine.

[16]  L. V. van't Veer,et al.  Cross‐validated Cox regression on microarray gene expression data , 2006, Statistics in medicine.

[17]  Philippe Bastien,et al.  PLS-Cox model: Application to gene expression data , 2004 .

[18]  N. Mantel Evaluation of survival data and two new rank order statistics arising in its consideration. , 1966, Cancer chemotherapy reports.

[19]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[20]  Anne-Laure Boulesteix,et al.  Partial least squares: a versatile tool for the analysis of high-dimensional genomic data , 2006, Briefings Bioinform..

[21]  Jiang Gui,et al.  Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data , 2005, Bioinform..

[22]  Yan Xu,et al.  Lysophosphatidic acid downregulates tissue inhibitor of metalloproteinases, which are negatively involved in lysophosphatidic acid-induced cell invasion , 2007, Oncogene.

[23]  Insuk Sohn,et al.  Gradient lasso for Cox proportional hazards model , 2009, Bioinform..

[24]  Donald R Schwartz,et al.  Remodeling of the extracellular matrix through overexpression of collagen VI contributes to cisplatin resistance in ovarian cancer cells. , 2003, Cancer cell.

[25]  TaeHyun Hwang,et al.  A hypergraph-based learning algorithm for classifying gene expression and arrayCGH data with prior knowledge , 2009, Bioinform..

[26]  Laurent Ozbun,et al.  A gene signature predicting for survival in suboptimally debulked patients with ovarian cancer. , 2008, Cancer research.

[27]  Anil K Sood,et al.  Biological significance of focal adhesion kinase in ovarian cancer: role in migration and invasion. , 2004, The American journal of pathology.

[28]  H. Mason,et al.  Linkage of regulators of TGF‐β activity in the fetal ovary to polycystic ovary syndrome , 2011, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[29]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[30]  Hongzhe Li,et al.  In Response to Comment on "Network-constrained regularization and variable selection for analysis of genomic data" , 2008, Bioinform..

[31]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[32]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[33]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[34]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[35]  T. Lumley,et al.  Time‐Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker , 2000, Biometrics.

[36]  M. Segal Microarray gene expression data with linked survival phenotypes: diffuse large-B-cell lymphoma revisited. , 2006, Biostatistics.

[37]  P. Bühlmann,et al.  Survival ensembles. , 2006, Biostatistics.

[38]  Meland,et al.  THE USE OF MOLECULAR PROFILING TO PREDICT SURVIVAL AFTER CHEMOTHERAPY FOR DIFFUSE LARGE-B-CELL LYMPHOMA , 2002 .

[39]  Arnoldo Frigessi,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm305 Gene expression Predicting survival from microarray data—a comparative study , 2022 .

[40]  R. Tothill,et al.  Novel Molecular Subtypes of Serous and Endometrioid Ovarian Cancer Linked to Clinical Outcome , 2008, Clinical Cancer Research.

[41]  D.,et al.  Regression Models and Life-Tables , 2022 .

[42]  Jiang Gui,et al.  Partial Cox regression analysis for high-dimensional microarray gene expression data , 2004, ISMB/ECCB.

[43]  S. Ochsner,et al.  Novel signaling pathways that control ovarian follicular development, ovulation, and luteinization. , 2002, Recent progress in hormone research.

[44]  Jian Huang,et al.  BMC Bioinformatics BioMed Central Methodology article Supervised group Lasso with applications to microarray data , 2007 .

[45]  M Szymanski,et al.  Microvessel density and CpG island methylation of the THBS2 gene in malignant ovarian tumors. , 2008, Journal of physiology and pharmacology : an official journal of the Polish Physiological Society.

[46]  J. Cavanaugh Biostatistics , 2005, Definitions.

[47]  B. Nan,et al.  Survival Analysis with High-Dimensional Covariates , 2010 .

[48]  Uc San Francisco,et al.  Microarray Gene Expression Data with Linked Survival Phenotypes: Diffuse Large-B-Cell Lymphoma Revisited , 2005 .

[49]  Ashu Sharma,et al.  Role of galaptin in ovarian carcinoma adhesion to extracellular matrix in vitro , 1990, Journal of cellular biochemistry.

[50]  Teresa M. Przytycka,et al.  Identifying Causal Genes and Dysregulated Pathways in Complex Diseases , 2011, PLoS Comput. Biol..

[51]  R. Tibshirani,et al.  Efficient quadratic regularization for expression arrays. , 2004, Biostatistics.

[52]  J. Reubi,et al.  Neuropeptide Y receptor expression in human primary ovarian neoplasms , 2004, Laboratory Investigation.

[53]  Chris Sander,et al.  CancerGenes: a gene selection resource for cancer genome projects , 2006, Nucleic Acids Res..

[54]  Yanling Hu,et al.  ADIPOQ gene polymorphisms and susceptibility to polycystic ovary syndrome: a HuGE survey and meta-analysis. , 2012, European journal of obstetrics, gynecology, and reproductive biology.

[55]  Hanno Steen,et al.  Development of human protein reference database as an initial platform for approaching systems biology in humans. , 2003, Genome research.

[56]  R L Stouffer,et al.  Decorin is a part of the ovarian extracellular matrix in primates and may act as a signaling molecule. , 2012, Human reproduction.

[57]  Ekaterina Voronina,et al.  Cell-type-selective induction of c-jun by TAF4b directs ovarian-specific transcription networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[58]  Jeffrey R. Marks,et al.  Role of Eotaxin-1 Signaling in Ovarian Cancer , 2009, Clinical Cancer Research.

[59]  Cherie Blenkiron,et al.  Identification of clinically relevant genes on chromosome 11 in a functional model of ovarian cancer tumor suppression. , 2003, Cancer research.

[60]  Wen Su,et al.  Production and binding of endothelin-2 (EDN2) in the rat ovary: endothelin receptor subtype A (EDNRA)-mediated contraction. , 2010, Reproduction, fertility, and development.

[61]  Danh V. Nguyen,et al.  Partial least squares proportional hazard regression for application to DNA microarray survival data , 2002, Bioinform..

[62]  Noah Simon,et al.  A Sparse-Group Lasso , 2013 .

[63]  Anne-Laure Boulesteix,et al.  Survival prediction using gene expression data: A review and comparison , 2009, Comput. Stat. Data Anal..

[64]  Y Pawitan,et al.  Gene expression profiling for prognosis using Cox regression , 2004, Statistics in medicine.

[65]  Wei Tang,et al.  Palmitoylation supports assembly and function of integrin–tetraspanin complexes , 2004, The Journal of cell biology.

[66]  Matthew A. Hibbs,et al.  Exploring the human genome with functional maps. , 2009, Genome research.

[67]  Alfonso Baldi,et al.  Serine protease HtrA1 modulates chemotherapy-induced cytotoxicity. , 2006, The Journal of clinical investigation.

[68]  R. Berkowitz,et al.  SPARC (secreted protein acidic and rich in cysteine) induces apoptosis in ovarian cancer cells. , 2001, The American journal of pathology.

[69]  Lieve Moons,et al.  CXCL12 and vascular endothelial growth factor synergistically induce neoangiogenesis in human ovarian cancers. , 2005, Cancer research.

[70]  Srinivasan Parthasarathy,et al.  Construction of a reference gene association network from multiple profiling data: application to data analysis , 2007, Bioinform..

[71]  R. Tibshirani,et al.  Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data , 2004, PLoS biology.

[72]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[73]  H. Hollema,et al.  Survival-Related Profile, Pathways, and Transcription Factors in Ovarian Cancer , 2009, PLoS medicine.

[74]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.