Identification of Coordinately Dysregulated Subnetworks in Complex Phenotypes

In the study of complex phenotypes, single gene markers can only provide limited insights into the manifestation of phenotype. To this end, protein-protein interaction (PPI) networks prove useful in the identification of multiple interacting markers. Recent studies show that, when considered together, many proteins that are connected via physical and functional interactions exhibit significant differential expression with respect to various complex phenotypes, including cancers. As compared to single gene markers, these "coordinately dysregulated subnetworks" improve diagnosis and prognosis of cancer significantly and offer novel insights into the network dynamics of phenotype. However, the problem of identifying coordinately dysregulated subnetworks presents significant algorithmic challenges. Existing approaches utilize heuristics that aim to greedily maximize information-theoretic class separability measures, however, by definition of "coordinate" dysregulation, such greedy algorithms do not suit well to this problem. In this paper, we formulate coordinate dysregulation in the context of the well-known set-cover problem, with a view to capturing the coordination between multiple genes at a sample-specific resolution. Based on this formulation, we adapt state-of-the-art approximation algorithms for set-cover to the identification of coordinately dysregulated subnetworks. Comprehensive experimental results on human colorectal cancer (CRC) show that, when compared to existing algorithms, the proposed algorithm, NETCOVER, improves diagnosis of cancer and prediction of metastasis significantly. Our results also demonstrate that subnetworks in the neighborhood of known CRC driver genes exhibit significant coordinate dysregulation, indicating that the notion of coordinate dysregulation may indeed be useful in understanding the network dynamics of complex phenotypes.

[1]  M. West,et al.  Gene expression profiling and genetic markers in glioblastoma survival. , 2005, Cancer research.

[2]  Wojciech Szpankowski,et al.  Biclustering gene-feature matrices for statistically significant dense patterns , 2004, Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004..

[3]  Vasek Chvátal,et al.  The tail of the hypergeometric distribution , 1979, Discret. Math..

[4]  Qiang Yu,et al.  DACT3 is an epigenetic regulator of Wnt/beta-catenin signaling in colorectal cancer and is a therapeutic target of histone modifications. , 2008, Cancer cell.

[5]  E. Latulippe,et al.  Comprehensive gene expression analysis of prostate cancer reveals distinct transcriptional programs associated with metastatic disease. , 2002, Cancer research.

[6]  A. Ashworth,et al.  Hallmarks of 'BRCAness' in sporadic cancers , 2004, Nature Reviews Cancer.

[7]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[8]  V. Yang,et al.  The role of inflammation in the pathogenesis of colorectal cancer , 2009, Current colorectal cancer reports.

[9]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.

[10]  C. Wijmenga,et al.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. , 2006, American journal of human genetics.

[11]  Satoru Miyano,et al.  Selecting Informative Genes for Cancer Classification Using Gene Expression Data , 2003 .

[12]  Nigel C Bird,et al.  The role of cell adhesion molecules in the progression of colorectal cancer and the development of liver metastasis. , 2009, Cellular signalling.

[13]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[14]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[15]  S. Zucker,et al.  Role of matrix metalloproteinases (MMPs) in colorectal cancer , 2004, Cancer and Metastasis Reviews.

[16]  M. Dash,et al.  Feature selection via set cover , 1997, Proceedings 1997 IEEE Knowledge and Data Engineering Exchange Workshop.

[17]  Serban Nacu,et al.  Gene expression network analysis and applications to immunology , 2007, Bioinform..

[18]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[19]  R. Tibshirani,et al.  Gene expression profiling identifies clinically relevant subtypes of prostate cancer. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[20]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[21]  S. Dhanasekaran,et al.  Delineation of prognostic biomarkers in prostate cancer , 2001, Nature.

[22]  S. Rha,et al.  Whole genome analysis for liver metastasis gene signatures in colorectal cancer , 2007, International journal of cancer.

[23]  A. Chadli THE CANCER CELL , 1924, La Presse medicale.

[24]  Ilya Shmulevich,et al.  Binary analysis and optimization-based normalization of gene expression data , 2002, Bioinform..

[25]  D. Anastassiou Computational analysis of the synergy among multiple interacting genes , 2007, Molecular systems biology.

[26]  A. Dunker The pacific symposium on biocomputing , 1998 .

[27]  A. Chinnaiyan,et al.  Integrative analysis of the cancer transcriptome , 2005, Nature Genetics.

[28]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[29]  Roded Sharan,et al.  A Network-Based Method for Predicting Disease-Causing Genes , 2009, J. Comput. Biol..

[30]  M. Moran,et al.  Large-scale mapping of human protein–protein interactions by mass spectrometry , 2007, Molecular systems biology.

[31]  Pankaj Agarwal,et al.  Inferring pathways from gene lists using a literature-derived network of biological relationships , 2005, Bioinform..

[32]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[33]  Wojciech Szpankowski,et al.  An efficient algorithm for detecting frequent subgraphs in biological networks , 2004, ISMB/ECCB.

[34]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[35]  品子 本田,et al.  "International Journal of Cancer"投稿論文について , 1971 .

[36]  J. Hardcastle,et al.  Colorectal cancer , 1993, Europe Against Cancer European Commission Series for General Practitioners.

[37]  E. Lander,et al.  A molecular signature of metastasis in primary solid tumors , 2003, Nature Genetics.

[38]  Padhraic Smyth,et al.  An Information Theoretic Approach to Rule Induction from Databases , 1992, IEEE Trans. Knowl. Data Eng..

[39]  M. Gerstein,et al.  Relating whole-genome expression data with protein-protein interactions. , 2002, Genome research.

[40]  Christian V. Forst,et al.  Differential network expression during drug and stress response , 2005, Bioinform..

[41]  J. Witte,et al.  Genetic dissection of complex traits , 1996, Nature Genetics.

[42]  Richard M. Karp,et al.  Detecting Disease-Specific Dysregulated Pathways Via Analysis of Clinical Expression Profiles , 2008, RECOMB.

[43]  A. Pühler,et al.  Molecular systems biology , 2007 .

[44]  Shan Sun,et al.  ROCK-II mediates colon cancer invasion via regulation of MMP-2 and MMP-13 at the site of invadopodia as revealed by multiphoton imaging , 2007, Laboratory Investigation.

[45]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[46]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[47]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[48]  Jason A. Papin,et al.  Reconstruction of cellular signalling networks and analysis of their properties , 2005, Nature Reviews Molecular Cell Biology.

[49]  Charles Auffray,et al.  Deciphering cellular states of innate tumor drug responses , 2006, Genome Biology.

[50]  Michal A. Kurowski,et al.  Transcriptome Profile of Human Colorectal Adenomas , 2007, Molecular Cancer Research.

[51]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[52]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[53]  Robert M Thrall,et al.  Mathematics of Operations Research. , 1978 .

[54]  K. Ho,et al.  A Susceptibility Gene Set for Early Onset Colorectal Cancer That Integrates Diverse Signaling Pathways: Implication for Tumorigenesis , 2007, Clinical Cancer Research.

[55]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[56]  Haidong Wang,et al.  Discovering molecular pathways from protein interaction and gene expression data , 2003, ISMB.

[57]  Rod K. Nibbe,et al.  Discovery and Scoring of Protein Interaction Subnetworks Discriminative of Late Stage Human Colon Cancer*S , 2009, Molecular & Cellular Proteomics.

[58]  M. Bertagnolli,et al.  Molecular origins of cancer: Molecular basis of colorectal cancer. , 2009, The New England journal of medicine.

[59]  J. Nadeau,et al.  Finding Genes That Underlie Complex Traits , 2002, Science.