LPRP: A Gene–Gene Interaction Network Construction Algorithm and Its Application in Breast Cancer Data Analysis

The importance of the construction of gene–gene interaction (GGI) network to better understand breast cancer has previously been highlighted. In this study, we propose a novel GGI network construction method called linear and probabilistic relations prediction (LPRP) and used it for gaining system level insight into breast cancer mechanisms. We construct separate genome-wide GGI networks for tumor and normal breast samples, respectively, by applying LPRP on their gene expression datasets profiled by The Cancer Genome Atlas. According to our analysis, a large loss of gene interactions in the tumor GGI network was observed (7436; 88.7 % reduction), which also contained fewer functional genes (4757; 32 % reduction) than the normal network. Tumor GGI network was characterized by a bigger network diameter and a longer characteristic path length but a smaller clustering coefficient and much sparse network connections. In addition, many known cancer pathways, especially immune response pathways, are enriched by genes in the tumor GGI network. Furthermore, potential cancer genes are filtered in this study, which may act as drugs targeting genes. These findings will allow for a better understanding of breast cancer mechanisms.

[1]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[2]  Y. Lee,et al.  Gene-Gene and Gene-Environmental Interactions of Childhood Asthma: A Multifactor Dimension Reduction Approach , 2012, PloS one.

[3]  G. Bianconi,et al.  Differential network entropy reveals cancer system hallmarks , 2012, Scientific Reports.

[4]  Michael A. White,et al.  Use of Data-Biased Random Walks on Graphs for the Retrieval of Context-Specific Networks from Genomic Data , 2010, PLoS Comput. Biol..

[5]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[6]  V. Devita,et al.  Two hundred years of cancer research. , 2012, The New England journal of medicine.

[7]  Sha Cao,et al.  Cancer may be a pathway to cell survival under persistent hypoxia and elevated ROS: A model for solid‐cancer initiation and early development , 2015, International journal of cancer.

[8]  Kahn Rhrissorrakrai,et al.  MINE: Module Identification in Networks , 2011, BMC Bioinformatics.

[9]  Illés J. Farkas,et al.  CFinder: locating cliques and overlapping modules in biological networks , 2006, Bioinform..

[10]  Gary D Bader,et al.  A travel guide to Cytoscape plugins , 2012, Nature Methods.

[11]  Gianluca Bontempi,et al.  minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information , 2008, BMC Bioinformatics.

[12]  Naoaki Ono,et al.  An Unsupervised Approach to Predict Functional Relations between Genes Based on Expression Data , 2014, BioMed research international.

[13]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[14]  Hon Wai Leong,et al.  A survey of computational methods for protein complex prediction from protein interaction networks , 2012, J. Bioinform. Comput. Biol..

[15]  Sapna Kumari,et al.  Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery , 2012, PloS one.

[16]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[17]  Arno Lukas,et al.  Characterization of protein-interaction networks in tumors , 2007, BMC Bioinformatics.

[18]  Yoav Freund,et al.  A classification-based framework for predicting and analyzing gene regulatory response , 2006, BMC Bioinformatics.

[19]  Haiyan Huang,et al.  Review on statistical methods for gene network reconstruction using expression data. , 2014, Journal of theoretical biology.

[20]  Patrick E. Meyer,et al.  Inferring mutual information networks using the minet package , 2008 .

[21]  Peng Jiang,et al.  SPICi: a fast clustering algorithm for large biological networks , 2010, Bioinform..

[22]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[23]  D. Tripathy,et al.  Oncogenes and tumor suppressor genes in breast cancer: potential diagnostic and therapeutic applications. , 2004, The oncologist.

[24]  Mehmet Koyutürk,et al.  An Integrative -omics Approach to Identify Functional Sub-Networks in Human Colorectal Cancer , 2010, PLoS Comput. Biol..

[25]  Zhifeng Shao,et al.  A system level analysis of gastric cancer across tumor stages with RNA-seq data. , 2015, Molecular bioSystems.

[26]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[27]  O. Elemento,et al.  Revealing global regulatory perturbations across human cancers. , 2009, Molecular cell.

[28]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[29]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[30]  Rainer König,et al.  Regulation patterns in signaling networks of cancer , 2010, BMC Systems Biology.

[31]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[32]  Changning Liu,et al.  Exploring hierarchical and overlapping modular structure in the yeast protein interaction network , 2010, BMC Genomics.

[33]  Antonio Reverter,et al.  A Differential Wiring Analysis of Expression Data Correctly Identifies the Gene Containing the Causal Mutation , 2009, PLoS Comput. Biol..

[34]  L. Hood,et al.  Towards predictive stochastic dynamical modeling of cancer genesis and progression , 2010, Interdisciplinary Sciences: Computational Life Sciences.

[35]  Suzana de Siqueira Santos,et al.  A comparative study of statistical methods used to identify dependencies between gene expression signals , 2014, Briefings Bioinform..

[36]  BontempiGianluca,et al.  Information-theoretic inference of large transcriptional regulatory networks , 2007 .

[37]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[38]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[39]  Mark A. Ragan,et al.  Supervised, semi-supervised and unsupervised inference of gene regulatory networks , 2013, Briefings Bioinform..

[40]  Richard Durbin,et al.  Gene-gene and gene-environment interactions detected by transcriptome sequence analysis in twins , 2014, Nature Genetics.

[41]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[42]  Kening Li,et al.  ICan: An Integrated Co-Alteration Network to Identify Ovarian Cancer-Related Genes , 2015, PloS one.

[43]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[44]  Junjie Chen,et al.  DBC1 is a negative regulator of SIRT1 , 2008, Nature.

[45]  L. Korde,et al.  Genetics of breast cancer: a topic in evolution. , 2015, Annals of oncology : official journal of the European Society for Medical Oncology.

[46]  Amir K. Foroushani,et al.  Pathway-GPS and SIGORA: identifying relevant pathways based on the over-representation of their gene-pair signatures , 2013, PeerJ.

[47]  Benjamin Haibe-Kains,et al.  A network model for angiogenesis in ovarian cancer , 2015, BMC Bioinformatics.

[48]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[49]  David Tuck,et al.  Characterizing disease states from topological properties of transcriptional regulatory networks , 2006, BMC Bioinformatics.

[50]  Michael Mitzenmacher,et al.  Detecting Novel Associations in Large Data Sets , 2011, Science.

[51]  Yuan Tian,et al.  GECluster: a novel protein complex prediction method , 2014, Biotechnology, biotechnological equipment.

[52]  C E Shannon,et al.  The mathematical theory of communication. 1963. , 1997, M.D. computing : computers in medical practice.

[53]  C. Huttenhower,et al.  Passing Messages between Biological Networks to Refine Predicted Interactions , 2013, PloS one.

[54]  C. George Priya Doss,et al.  Predicting the impact of deleterious single point mutations in SMAD gene family using structural bioinformatics approach , 2012, Interdisciplinary Sciences: Computational Life Sciences.

[55]  Riet De Smet,et al.  Advantages and limitations of current network inference methods , 2010, Nature Reviews Microbiology.

[56]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[57]  Ankush Mittal,et al.  Question Processing and Clustering in INDOC: A Biomedical Question Answering System , 2007, EURASIP J. Bioinform. Syst. Biol..