Graph Theoretic and Pearson Correlation-Based Discovery of Network Biomarkers for Cancer

Two graph theoretic concepts—clique and bipartite graphs—are explored to identify the network biomarkers for cancer at the gene network level. The rationale is that a group of genes work together by forming a cluster or a clique-like structures to initiate a cancer. After initiation, the disease signal goes to the next group of genes related to the second stage of a cancer, which can be represented as a bipartite graph. In other words, bipartite graphs represent the cross-talk among the genes between two disease stages. To prove this hypothesis, gene expression values for three cancers— breast invasive carcinoma (BRCA), colorectal adenocarcinoma (COAD) and glioblastoma multiforme (GBM)—are used for analysis. First, a co-expression gene network is generated with highly correlated gene pairs with a Pearson correlation coefficient ≥ 0.9. Second, clique structures of all sizes are isolated from the co-expression network. Then combining these cliques, three different biomarker modules are developed—maximal clique-like modules, 2-clique-1-bipartite modules, and 3-clique-2-bipartite modules. The list of biomarker genes discovered from these network modules are validated as the essential genes for causing a cancer in terms of network properties and survival analysis. This list of biomarker genes will help biologists to design wet lab experiments for further elucidating the complex mechanism of cancer.

[1]  Alexandra Maertens,et al.  Weighted Gene Correlation Network Analysis (WGCNA) Reveals Novel Transcription Factors Associated With Bisphenol A Dose-Response , 2018, Front. Genet..

[2]  Ling Wang,et al.  Prognostic genes of breast cancer revealed by gene co-expression network analysis. , 2017, Oncology letters.

[3]  Jianjun Hu,et al.  Scored Protein-Protein Interaction to Predict Subcellular Localizations for Yeast Using Diffusion Kernel , 2013, PReMI.

[4]  Tijana Milenkovic,et al.  Dynamic networks reveal key players in aging , 2014, BCB.

[5]  Jianjun Hu,et al.  Network based prediction of protein localisation using diffusion Kernel , 2014, Int. J. Data Min. Bioinform..

[6]  J. Bader,et al.  Finding friends and enemies in an enemies-only network: a graph diffusion kernel for predicting novel genetic interactions and co-complex membership from yeast genetic interactions. , 2008, Genome research.

[7]  Cong Zhang 张 丛,et al.  Weighted gene co-expression network analysis of gene modules for the prognosis of esophageal cancer , 2017, Journal of Huazhong University of Science and Technology [Medical Sciences].

[8]  Jianjun Hu,et al.  Protein Localization by Integrating Multiple Protein Correlation Networks , 2012 .

[9]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Martin Kuiper,et al.  BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks , 2005, Bioinform..

[11]  Michael A. Langston,et al.  Threshold selection in gene co-expression networks using spectral graph theory techniques , 2009, BMC Bioinformatics.

[12]  M. Mondal Ananda,et al.  Graph Theoretic Concepts as the Building Blocks for Disease Initiation and Progression at Protein Network Level: Identification and Challenges , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[13]  Wei Zhang,et al.  Associating transcriptional modules with colon cancer survival through weighted gene co-expression network analysis , 2017, BMC Genomics.

[14]  R. Wolfe,et al.  Transient effects in the Cox proportional hazards regression model. , 1995, Statistics in medicine.

[15]  Jing Wang,et al.  LinkedOmics: analyzing multi-omics data within and across 32 cancer types , 2017, Nucleic Acids Res..

[16]  I. Jurisica,et al.  Immune-enrichment of non-small cell lung cancer baseline biopsies for multiplex profiling define prognostic immune checkpoint combinations for patient stratification , 2019, Journal of Immunotherapy for Cancer.

[17]  A. Hu,et al.  Identification of Key Gene Modules in Human Osteosarcoma by Co‐Expression Analysis Weighted Gene Co‐Expression Network Analysis (WGCNA) , 2017, Journal of cellular biochemistry.

[18]  Eric T. Dawson,et al.  ReactomeFIViz: the Reactome FI Cytoscape app for pathway and network-based data analysis. , 2014, F1000Research.

[19]  R. Chapkin,et al.  Functional link between plasma membrane spatiotemporal dynamics, cancer biology, and dietary membrane-altering agents , 2018, Cancer and Metastasis Reviews.

[20]  Dong-qing Zhang,et al.  Identification of hub genes and pathways associated with bladder cancer based on co-expression network analysis. , 2017, Oncology letters.

[21]  Hongyu Liu,et al.  A prognostic prediction system for hepatocellular carcinoma based on gene co-expression network , 2019, Experimental and therapeutic medicine.

[22]  Jianjun Hu,et al.  NetLoc: Network based protein localization prediction using protein-protein interaction and co-expression networks , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[23]  Sourav Bandyopadhyay,et al.  Rewiring of Genetic Networks in Response to DNA Damage , 2010, Science.

[24]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[25]  S. Horvath,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[26]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008 .

[27]  Bing Zhang,et al.  Co-expression module analysis reveals biological processes, genomic gain, and regulatory mechanisms associated with breast cancer progression , 2010, BMC Systems Biology.

[28]  T. Ideker,et al.  Integrating phenotypic and expression profiles to map arsenic-response networks , 2004, Genome Biology.

[29]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[30]  M. J. van de Vijver,et al.  Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. , 2006, Journal of the National Cancer Institute.

[31]  Jianjun Hu,et al.  Minimalist ensemble algorithms for genome-wide protein localization prediction , 2012, BMC Bioinformatics.

[32]  M. Mondal Ananda,et al.  STRING PPI Score to Characterize Protein Subnetwork Biomarkers for Human Diseases and Pathways , 2014, 2014 IEEE International Conference on Bioinformatics and Bioengineering.

[33]  B. Tiwary,et al.  Identification of Molecular Biomarkers for Ovarian Cancer using Computational Approaches. , 2019, Carcinogenesis.

[34]  Chung-Yen Lin,et al.  cytoHubba: identifying hub objects and sub-networks from complex interactome , 2014, BMC Systems Biology.

[35]  Jianjun Hu,et al.  Network based subcellular localization prediction for multi-label proteins , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW).

[36]  Atul J. Butte,et al.  Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks , 2005, BMC Bioinformatics.

[37]  Zhi-Ping Liu,et al.  Detecting pathway biomarkers of diabetic progression with differential entropy , 2018, J. Biomed. Informatics.

[38]  Jiangbo Ren,et al.  Overexpression of ASPM, CDC20, and TTK Confer a Poorer Prognosis in Breast Cancer Identified by Gene Co-expression Network Analysis , 2019, Front. Oncol..

[39]  Ting Chen,et al.  Diffusion kernel-based logistic regression models for protein function prediction. , 2006, Omics : a journal of integrative biology.