Biostatistical approaches for the reconstruction of gene co-expression networks based on transcriptomic data.

Techniques in molecular biology have permitted the gathering of an extremely large amount of information relating organisms and their genes. The current challenge is assigning a putative function to thousands of genes that have been detected in different organisms. One of the most informative types of genomic data to achieve a better knowledge of protein function is gene expression data. Based on gene expression data and assuming that genes involved in the same function should have a similar or correlated expression pattern, a function can be attributed to those genes with unknown functions when they appear to be linked in a gene co-expression network (GCN). Several tools for the construction of GCNs have been proposed and applied to plant gene expression data. Here, we review recent methodologies used for plant gene expression data and compare the results, advantages and disadvantages in order to help researchers in their choice of a method for the construction of GCNs.

[1]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[2]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[3]  Carsten O. Daub,et al.  The mutual information: Detecting and evaluating dependencies between variables , 2002, ECCB.

[4]  Hiroyuki Toh,et al.  Inference of a genetic network by a combined approach of cluster analysis and graphical Gaussian modeling , 2002, Bioinform..

[5]  Kalpathi R. Subramanian,et al.  Interactive Analysis of Gene Interactions Using Graphical gaussian model , 2003, BIOKDD.

[6]  Carsten O. Daub,et al.  Estimating mutual information using B-spline functions – an improved similarity measure for analysing gene expression data , 2004, BMC Bioinformatics.

[7]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[8]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[9]  Paul M. Magwene,et al.  Estimating genomic coexpression networks using first-order conditional independence , 2004, Genome Biology.

[10]  Paul P. Wang,et al.  Advances to Bayesian network inference for generating causal networks from observational biological data , 2004, Bioinform..

[11]  Satoru Miyano,et al.  Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks , 2004, J. Bioinform. Comput. Biol..

[12]  P. Bühlmann,et al.  Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana , 2004, Genome Biology.

[13]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[14]  Min Zou,et al.  A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data , 2005, Bioinform..

[15]  M. Hirai,et al.  Elucidation of Gene-to-Gene and Metabolite-to-Gene Networks in Arabidopsis by Integration of Metabolomics and Transcriptomics* , 2005, Journal of Biological Chemistry.

[16]  K. Strimmer,et al.  Statistical Applications in Genetics and Molecular Biology A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics , 2011 .

[17]  C. Morcia,et al.  From Single Genes to Co-Expression Networks: Extracting Knowledge from Barley Functional Genomics , 2005, Plant Molecular Biology.

[18]  Korbinian Strimmer,et al.  An empirical Bayes approach to inferring large-scale gene association networks , 2005, Bioinform..

[19]  Réka Albert,et al.  Elucidation of directionality for co-expressed genes: predicting intra-operon termination sites , 2006, Bioinform..

[20]  Kevin Murphy,et al.  Modelling Gene Expression Data using Dynamic Bayesian Networks , 2006 .

[21]  Jens Timmer,et al.  Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge , 2007, BMC Systems Biology.

[22]  R. Steuer,et al.  Metabolomic networks in plants: Transitions from pattern recognition to biological interpretation. , 2006, Bio Systems.

[23]  Marco Grzegorczyk,et al.  Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks , 2006, Bioinform..

[24]  Terence P. Speed,et al.  Genome analysis A genotype calling algorithm for affymetrix SNP arrays , 2005 .

[25]  Qingqiu Gong,et al.  An Arabidopsis gene network based on the graphical Gaussian model. , 2007, Genome research.

[26]  D. Husmeier,et al.  Reconstructing Gene Regulatory Networks with Bayesian Networks by Combining Expression Data with Multiple Sources of Prior Knowledge , 2007, Statistical applications in genetics and molecular biology.

[27]  R. Albert,et al.  Systems-level insights into cellular regulation: inferring, analysing, and modelling intracellular networks. , 2007, IET systems biology.

[28]  Claudio Altafini,et al.  Comparing association network algorithms for reverse engineering of large-scale gene regulatory networks: synthetic versus real data , 2007, Bioinform..

[29]  Rainer Spang,et al.  Inferring cellular networks – a review , 2007, BMC Bioinformatics.

[30]  Eve Syrkin Wurtele,et al.  Articulation of three core metabolic processes in Arabidopsis: Fatty acid biosynthesis, leucine catabolism and starch metabolism , 2008, BMC Plant Biology.

[31]  Dirk Husmeier,et al.  Gene Regulatory Network Reconstruction by Bayesian Integration of Prior Knowledge and/or Different Experimental Conditions , 2008, J. Bioinform. Comput. Biol..

[32]  Heribert Hirt,et al.  Arabidopsis MAPKs: a complex signalling network involved in multiple biological processes. , 2008, The Biochemical journal.

[33]  Staffan Persson,et al.  GeneCAT—novel webtools that combine BLAST and co-expression analyses , 2008, Nucleic Acids Res..

[34]  Marco Grzegorczyk,et al.  Modelling non-stationary gene regulatory processes with a non-homogeneous Bayesian network and the allocation sampler , 2008, Bioinform..

[35]  M. Hirai,et al.  Decoding genes with coexpression networks and metabolomics - 'majority report by precogs'. , 2008, Trends in plant science.

[36]  Hadar Less,et al.  Coordinations between gene modules control the operation of plant amino acid metabolic networks , 2009, BMC Systems Biology.

[37]  Grier P. Page,et al.  Bioinformatic Tools for Inferring Functional Information from Plant Microarray Data II: Analysis Beyond Single Gene , 2008, International journal of plant genomics.

[38]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[39]  H. Bohnert,et al.  Gene Q 1 networks in Arabidopsis thaliana for metabolic and environmental functions , 2007 .

[40]  Oliver Ebenhöh,et al.  Measuring correlations in metabolomic networks with mutual information. , 2008, Genome informatics. International Conference on Genome Informatics.

[41]  Scott T. Weiss,et al.  A graphical model approach for inferring large-scale networks integrating gene expression and genetic polymorphism , 2009, BMC Systems Biology.

[42]  Julie A. Dickerson,et al.  Arabidopsis gene co-expression network and its functional modules , 2009, BMC Bioinformatics.

[43]  Andrew J. Bulpitt,et al.  From gene expression to gene regulatory networks in Arabidopsis thaliana , 2009, BMC Systems Biology.

[44]  Kengo Kinoshita,et al.  ATTED-II provides coexpressed gene networks for Arabidopsis , 2008, Nucleic Acids Res..

[45]  Bonnie Berger,et al.  IsoRankN: spectral methods for global alignment of multiple protein networks , 2009, Bioinform..

[46]  Weixiong Zhang,et al.  A general co-expression network-based approach to gene expression analysis: comparison and applications , 2010, BMC Systems Biology.

[47]  Aureliano Bombarely,et al.  TobEA: an atlas of tobacco gene expression from seed to senescence , 2010, BMC Genomics.

[48]  K. Yamaguchi-Shinozaki,et al.  Transcriptional Regulatory Networks in Response to Abiotic Stresses in Arabidopsis and Grasses1 , 2009, Plant Physiology.

[49]  A. Loraine,et al.  Assembly of an Interactive Correlation Network for the Arabidopsis Genome Using a Novel Heuristic Clustering Algorithm1[W] , 2009, Plant Physiology.

[50]  Daniel A. Chamovitz,et al.  Large-scale analysis of Arabidopsis transcription reveals a basal co-regulation network , 2009, BMC Systems Biology.

[51]  Anne-Laure Boulesteix,et al.  Regularized estimation of large-scale gene association networks using graphical Gaussian models , 2009, BMC Bioinformatics.

[52]  Dirk Inzé,et al.  CORNET: A User-Friendly Tool for Data Mining and Integration1[W] , 2010, Plant Physiology.

[53]  Karthik Raman,et al.  Construction and analysis of protein–protein interaction networks , 2010, Automated experimentation.

[54]  Fabian J. Theis,et al.  Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data , 2011, BMC Systems Biology.

[55]  Bor-Sen Chen,et al.  Constructing gene regulatory networks for long term photosynthetic light acclimation in Arabidopsis thaliana , 2011, BMC Bioinformatics.

[56]  Olga G. Troyanskaya,et al.  Integrated functional networks of process, tissue, and developmental stage specific interactions in Arabidopsis thaliana , 2010, BMC Systems Biology.

[57]  E. Marcotte,et al.  Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana , 2010, Nature Biotechnology.

[58]  F. Feltus,et al.  Gene Coexpression Network Alignment and Conservation of Gene Modules between Two Grass Species: Maize and Rice[C][W][OA] , 2011, Plant Physiology.

[59]  B. Usadel,et al.  PlaNet: Combined Sequence and Expression Comparisons across Plant Networks Derived from Seven Species[W][OA] , 2011, Plant Cell.

[60]  E. Marcotte,et al.  Systematic prediction of gene function in Arabidopsis thaliana using a probabilistic functional gene network , 2011, Nature Protocols.

[61]  Sanghamitra Bandyopadhyay,et al.  A Biologically Inspired Measure for Coexpression Analysis , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[62]  Jonathan D. G. Jones,et al.  Evidence for Network Evolution in an Arabidopsis Interactome Map , 2011, Science.

[63]  Kevin L. Childs,et al.  Gene Coexpression Network Analysis as a Source of Functional Annotation for Rice Genes , 2011, PloS one.

[64]  M. S. Mukhtar,et al.  Independently Evolved Virulence Effectors Converge onto Hubs in a Plant Immune System Network , 2011, Science.

[65]  Y. Ouyang,et al.  Genomic survey, expression profile and co-expression network analysis of OsWD40 family in rice , 2012, BMC Genomics.

[66]  Insuk Lee,et al.  Towards Establishment of a Rice Stress Response Interactome , 2011, PLoS genetics.

[67]  D. Husmeier,et al.  Dynamic Bayesian networks in molecular plant science: inferring gene regulatory networks from multiple gene expression time series , 2012, Euphytica.

[68]  Lonnie R. Welch,et al.  AGRIS: the Arabidopsis Gene Regulatory Information Server, an update , 2010, Nucleic Acids Res..

[69]  Kazuo Shinozaki,et al.  Global Landscape of a Co-Expressed Gene Network in Barley and its Application to Gene Discovery in Triticeae Crops , 2011, Plant & cell physiology.

[70]  E. Marcotte,et al.  Genetic dissection of the biotic stress response using a genome-scale gene network for rice , 2011, Proceedings of the National Academy of Sciences.

[71]  S. Seno,et al.  An estimation method for inference of gene regulatory net-work using Bayesian network with uniting of partial problems , 2012, BMC Genomics.

[72]  Sakarindr Bhumiratana,et al.  Inferring transcriptional gene regulation network of starch metabolism in Arabidopsis thaliana leaves using graphical Gaussian model , 2012, BMC Systems Biology.

[73]  Min Chen,et al.  Comparing Statistical Methods for Constructing Large Scale Gene Networks , 2012, PloS one.

[74]  A. Fukushima,et al.  Exploring Tomato Gene Functions Based on Coexpression Modules Using Graph Clustering and Differential Coexpression Approaches1[C][W][OA] , 2012, Plant Physiology.