Functional clusters analysis and research based on differential coexpression networks

ABSTRACT Differential coexpression analysis has gradually become an important approach to improve the conventional method of analysing differentially expressed genes. With this approach, it is possible to discover disease mechanisms and underlying regulatory dynamics which remain obscure in differential expression analysis. The detection of differential coexpression links and functional clusters between different disease states is a demanding task. Nevertheless, there is no gold standard for detecting differential coexpression links and functional clusters. Consequently, we developed a novel fusion algorithm FDvDe (Fusion of differential vertex and differential edge) to detect differential coexpression links by aggregating the set of ‘differential vertex’ and ‘differential edge.’ Then, we constructed differential coexpression networks between normal and tumour states by integrating the differential coexpression links. With this approach, we identified 1823 genes and 29370 links. Then, we developed the algorithms GTHC (GO term hierarchical clusters) to identify functional modules. The distance matrix used in the hierarchical process was formed by the GO semantic similarity. Furthermore, we aggregated the densities among clusters describing the connectivity among clusters and topological property analysis to discover the hub genes and hub pathways which play an important role in disease mechanism. In this paper, we showed that our approach worked well on a data set of breast cancer samples (68 tumour samples) and normal samples (73 normal samples), and revealed the crucial role and biological significance of the modules and hub genes found in this approach.

[1]  S. Kwak,et al.  miR-181b-3p promotes epithelial-mesenchymal transition in breast cancer cells through Snail stabilization by directly targeting YWHAG. , 2016, Biochimica et biophysica acta.

[2]  K. Choromanski,et al.  Scale-Free Graph with Preferential Attachment and Evolving Internal Vertex Structure , 2013 .

[3]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[4]  J. Weinstein,et al.  Activation of aminoflavone (NSC 686288) by a sulfotransferase is required for the antiproliferative effect of the drug and for induction of histone gamma-H2AX. , 2006, Cancer research.

[5]  Thomas Lengauer,et al.  A new measure for functional similarity of gene products based on Gene Ontology , 2006, BMC Bioinformatics.

[6]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[7]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.

[8]  A. Barabasi,et al.  Interactome Networks and Human Disease , 2011, Cell.

[9]  Hsuan-Cheng Huang,et al.  Functional Analysis and Characterization of Differential Coexpression Networks , 2015, Scientific Reports.

[10]  Antonio Reverter,et al.  Simultaneous identification of differential gene expression and connectivity in inflammation, adipogenesis and cancer , 2006, Bioinform..

[11]  W. Gerald,et al.  An estrogen receptor-negative breast cancer subset characterized by a hormonally regulated transcriptional program and response to androgen , 2006, Oncogene.

[12]  H. Yokozaki,et al.  Gene expression and pathway analysis of CTNNB1 in cancer and stem cells , 2016, World journal of stem cells.

[13]  S. Rodenhuis,et al.  SERPINA6, BEX1, AGTR1, SLC26A3, and LAPTM4B are markers of resistance to neoadjuvant chemotherapy in HER2-negative breast cancer , 2012, Breast Cancer Research and Treatment.

[14]  A. Spurdle,et al.  Mutation analysis of FANCD2, BRIP1/BACH1, LMO4 and SFN in familial breast cancer , 2005, Breast Cancer Research.

[15]  R. Linding,et al.  Personalized Network‐Based Treatments in Oncology , 2013, Clinical pharmacology and therapeutics.

[16]  R. Hammer,et al.  Acetate Dependence of Tumors , 2014, Cell.

[17]  Carsten Denkert,et al.  New network topology approaches reveal differential correlation patterns in breast cancer , 2013, BMC Systems Biology.

[18]  T. Ideker,et al.  Systematic interpretation of genetic interactions using protein networks , 2005, Nature Biotechnology.

[19]  A. G. de la Fuente From 'differential expression' to 'differential networking' - identification of dysfunctional regulatory networks in diseases. , 2010, Trends in genetics : TIG.

[20]  D. Noh,et al.  Differential profiling of breast cancer plasma proteome by isotope-coded affinity tagging method reveals biotinidase as a breast cancer biomarker , 2010, BMC Cancer.

[21]  Hui Yu,et al.  Link-based quantitative methods to identify differentially coexpressed genes and gene Pairs , 2011, BMC Bioinformatics.

[22]  Eric E Schadt,et al.  Elucidating the role of gonadal hormones in sexually dimorphic gene coexpression networks. , 2009, Endocrinology.

[23]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[24]  Li Li,et al.  High‐resolution genomic and expression analyses of copy number alterations in breast tumors , 2008, Genes, chromosomes & cancer.

[25]  Brian S. Roberts,et al.  Recurrent read-through fusion transcripts in breast cancer , 2014, Breast Cancer Research and Treatment.

[26]  R. Kodet,et al.  Identification of potential human oncogenes by mapping the common viral integration sites in avian nephroblastoma. , 2006, Cancer research.

[27]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[28]  M. Ramanathan,et al.  Network‐Based Approaches in Drug Discovery and Early Development , 2013, Clinical pharmacology and therapeutics.

[29]  T. Ideker,et al.  Integrative approaches for finding modular structure in biological networks , 2013, Nature Reviews Genetics.

[30]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[31]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[32]  Robert Clarke,et al.  Differential dependency network analysis to identify condition-specific topological changes in biological networks , 2009, Bioinform..

[33]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[34]  T. Ideker,et al.  Differential network biology , 2012, Molecular systems biology.

[35]  Michael Costanzo,et al.  Charting the genetic interaction map of a cell. , 2011, Current opinion in biotechnology.

[36]  A. Fuente,et al.  From ‘differential expression’ to ‘differential networking’ – identification of dysfunctional regulatory networks in diseases , 2010 .

[37]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[38]  A. Zwinderman,et al.  Transcriptome sequencing (RNA-Seq) of human endobronchial biopsies: asthma versus controls , 2013, European Respiratory Journal.

[39]  Lucinda K. Southworth,et al.  Aging Mice Show a Decreasing Correlation of Gene Expression within Genetic Modules , 2009, PLoS genetics.

[40]  Stanley Fields,et al.  Lethal combinations , 2003, Nature Genetics.