CEMiTool: a Bioconductor package for performing comprehensive modular co-expression analyses

BackgroundThe analysis of modular gene co-expression networks is a well-established method commonly used for discovering the systems-level functionality of genes. In addition, these studies provide a basis for the discovery of clinically relevant molecular pathways underlying different diseases and conditions.ResultsIn this paper, we present a fast and easy-to-use Bioconductor package named CEMiTool that unifies the discovery and the analysis of co-expression modules. Using the same real datasets, we demonstrate that CEMiTool outperforms existing tools, and provides unique results in a user-friendly html report with high quality graphs. Among its features, our tool evaluates whether modules contain genes that are over-represented by specific pathways or that are altered in a specific sample group, as well as it integrates transcriptomic data with interactome information, identifying the potential hubs on each network. We successfully applied CEMiTool to over 1000 transcriptome datasets, and to a new RNA-seq dataset of patients infected with Leishmania, revealing novel insights of the disease’s physiopathology.ConclusionThe CEMiTool R package provides users with an easy-to-use method to automatically implement gene co-expression network analyses, obtain key information about the discovered gene modules using additional downstream analyses and retrieve publication-ready results via a high-quality interactive report.

[1]  Steve Horvath,et al.  Weighted Network Analysis , 2011 .

[2]  Derek S. Chiu,et al.  diceR: an R package for class discovery using an ensemble driven approach , 2018, BMC Bioinformatics.

[3]  Gennady Korotkevich,et al.  Fast gene set enrichment analysis , 2021 .

[4]  Jing Liu,et al.  Weighted gene co-expression network analysis identifies specific modules and hub genes related to coronary artery disease , 2016, BMC Cardiovascular Disorders.

[5]  Trai-Ming Yeh,et al.  MCP-1, a highly expressed chemokine in dengue haemorrhagic fever/dengue shock syndrome patients, may cause permeability change, possibly through reduced tight junctions of vascular endothelium cells. , 2006, The Journal of general virology.

[6]  Guan Yu,et al.  Variance stabilizing transformations of Poisson, binomial and negative binomial distributions , 2009 .

[7]  Shankar Subramaniam,et al.  Gene-expression measurement: variance-modeling considerations for robust data analysis , 2012, Nature Immunology.

[8]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[9]  Byung-Soo Kim,et al.  Extensive Psoriasis Induced by Pegylated Interferon Alfa-2a and Ribavirin in the Treatment of Chronic Hepatitis C , 2013, Annals of dermatology.

[10]  S. Horvath,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[11]  Tom C. Freeman,et al.  Transcriptome-Based Network Analysis Reveals a Spectrum Model of Human Macrophage Activation , 2014, Immunity.

[12]  Lincoln Stein,et al.  Reactome: a database of reactions, pathways and biological processes , 2010, Nucleic Acids Res..

[13]  Bin Zhang,et al.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R , 2008, Bioinform..

[14]  R. Gallo,et al.  Induction and exacerbation of psoriasis with Interferon‐alpha therapy for hepatitis C: A review and analysis of 36 cases , 2013, Journal of the European Academy of Dermatology and Venereology : JEADV.

[15]  Gary D. Bader,et al.  The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function , 2010, Nucleic Acids Res..

[16]  Sara Ballouz,et al.  Guidance for RNA-seq co-expression network construction and analysis: safety in numbers , 2015, Bioinform..

[17]  Shano Naseem,et al.  Hematologic Changes in Visceral Leishmaniasis/Kala Azar , 2010, Indian journal of hematology & blood transfusion : an official journal of Indian Society of Hematology and Blood Transfusion.

[18]  Zhining Wang,et al.  Sequential Waves of Gene Expression in Patients with Clinically Defined Dengue Illnesses Reveal Subtle Disease Phases and Predict Disease Severity , 2013, PLoS neglected tropical diseases.

[19]  Rainer Breitling,et al.  DiffCoEx: a simple and sensitive method to find differentially coexpressed gene modules , 2010, BMC Bioinformatics.

[20]  Guangchuang Yu,et al.  clusterProfiler: an R package for comparing biological themes among gene clusters. , 2012, Omics : a journal of integrative biology.

[21]  Hideyuki Suzuki,et al.  CoP: a database for characterizing co-expressed gene modules with biological information in plants , 2010, Bioinform..

[22]  Kim-Anh Do,et al.  DINGO: differential network analysis in genomics , 2015, Bioinform..

[23]  Richard Simon,et al.  A random variance model for detection of differential gene expression in small microarray experiments , 2003, Bioinform..

[24]  Kristi Abram,et al.  Transcriptional landscape of psoriasis identifies the involvement of IL36 and IL36RN , 2015, BMC Genomics.

[25]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[26]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[27]  Alexey Sergushichev,et al.  An algorithm for fast preranked gene set enrichment analysis using cumulative statistic calculation , 2016 .

[28]  Sasha Silva-Barrios,et al.  Protozoan Parasites and Type I IFNs , 2017, Front. Immunol..

[29]  S. Horvath,et al.  Conservation and evolution of gene coexpression networks in human and chimpanzee brains , 2006, Proceedings of the National Academy of Sciences.

[30]  Frederick C. Harris,et al.  petal: Co-expression network modelling in R , 2016, BMC Systems Biology.

[31]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[32]  Oscar Meruvia-Pastor,et al.  GeNET: a web application to explore and share Gene Co-expression Network Analysis data , 2017, PeerJ.

[33]  Michael Watson,et al.  CoXpress: differential co-expression in gene expression data , 2006, BMC Bioinformatics.

[34]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[35]  James T. Elder,et al.  Analysis of long non-coding RNAs highlights tissue-specific expression patterns and epigenetic profiles in normal and psoriatic skin , 2015, Genome Biology.

[36]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[37]  S. Horvath Weighted Network Analysis: Applications in Genomics and Systems Biology , 2011 .