The Association of Multiple Interacting Genes with Specific Phenotypes in Rice Using Gene Coexpression Networks1[C][W][OA]

Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes.

[1]  M. Daly,et al.  Guilt by association , 2000, Nature Genetics.

[2]  Audrey Kauffmann,et al.  Bioinformatics Applications Note Arrayqualitymetrics—a Bioconductor Package for Quality Assessment of Microarray Data , 2022 .

[3]  Joaquín Dopazo,et al.  The role of the environment in Parkinson's disease. , 1996, Nucleic Acids Res..

[4]  Paramvir S. Dehal,et al.  Snapshot of iron response in Shewanella oneidensis by gene network reconstruction , 2009, BMC Genomics.

[5]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[6]  T. Pham,et al.  RiceArrayNet: A Database for Correlating Gene Expression from Transcriptome Profiling, and Its Application to the Analysis of Coexpressed Genes in Rice1[C][W][OA] , 2009, Plant Physiology.

[7]  Atul J. Butte,et al.  Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks , 2005, BMC Bioinformatics.

[8]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[9]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[10]  Eve Syrkin Wurtele,et al.  Articulation of three core metabolic processes in Arabidopsis: Fatty acid biosynthesis, leucine catabolism and starch metabolism , 2008, BMC Plant Biology.

[11]  Peter Widmayer,et al.  Genevestigator V3: A Reference Expression Database for the Meta-Analysis of Transcriptomes , 2008, Adv. Bioinformatics.

[12]  Fidel Ramírez,et al.  Computing topological parameters of biological networks , 2008, Bioinform..

[13]  Hailin Chen,et al.  STARNET 2: a web-based tool for accelerating discovery of gene regulatory networks using microarray co-expression data , 2009, BMC Bioinformatics.

[14]  Daniel A. Chamovitz,et al.  Large-scale analysis of Arabidopsis transcription reveals a basal co-regulation network , 2009, BMC Systems Biology.

[15]  M. Robles,et al.  University of Birmingham High throughput functional annotation and data mining with the Blast2GO suite , 2022 .

[16]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[17]  Yoshiyuki Ogata,et al.  A database for poplar gene co-expression analysis for systematic understanding of biological processes, including stress responses , 2009, Journal of Wood Science.

[18]  E. Marcotte,et al.  Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana , 2010, Nature Biotechnology.

[19]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[20]  Julie A. Dickerson,et al.  Arabidopsis gene co-expression network and its functional modules , 2009, BMC Bioinformatics.

[21]  Homin K. Lee,et al.  Coexpression analysis of human genes across many microarray data sets. , 2004, Genome research.

[22]  A. Miyao,et al.  Target Site Specificity of the Tos17 Retrotransposon Shows a Preference for Insertion within Genes and against Insertion in Retrotransposon-Rich Regions of the Genome Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.012559. , 2003, The Plant Cell Online.

[23]  C. Morcia,et al.  From Single Genes to Co-Expression Networks: Extracting Knowledge from Barley Functional Genomics , 2005, Plant Molecular Biology.

[24]  Edward R B McCabe,et al.  Weighted gene co-expression network analysis identifies biomarkers in glycerol kinase deficient mice. , 2009, Molecular genetics and metabolism.

[25]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[26]  Aureliano Bombarely,et al.  TobEA: an atlas of tobacco gene expression from seed to senescence , 2010, BMC Genomics.

[27]  Akiyasu C. Yoshizawa,et al.  KAAS: an automatic genome annotation and pathway reconstruction server , 2007, Environmental health perspectives.

[28]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[29]  A. Loraine,et al.  Transcriptional Coordination of the Metabolic Network in Arabidopsis1[W][OA] , 2006, Plant Physiology.

[30]  Staffan Persson,et al.  Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Douglas A. Hosack,et al.  Identifying biological themes within lists of genes with EASE , 2003, Genome Biology.

[32]  S. Horvath,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[33]  Yoshiyuki Ogata,et al.  Approaches for extracting practical information from gene co-expression networks in plant biology. , 2007, Plant & cell physiology.

[34]  Hideyuki Suzuki,et al.  CoP: a database for characterizing co-expressed gene modules with biological information in plants , 2010, Bioinform..

[35]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[36]  A. Loraine,et al.  Assembly of an Interactive Correlation Network for the Arabidopsis Genome Using a Novel Heuristic Clustering Algorithm1[W] , 2009, Plant Physiology.

[37]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[38]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[39]  Alex Bateman,et al.  The InterPro database, an integrated documentation resource for protein families, domains and functional sites , 2001, Nucleic Acids Res..

[40]  Feng Luo,et al.  Constructing gene co-expression networks and predicting functions of unknown genes by random matrix theory , 2007, BMC Bioinformatics.

[41]  Kengo Kinoshita,et al.  ATTED-II provides coexpressed gene networks for Arabidopsis , 2008, Nucleic Acids Res..

[42]  John W. Pinney,et al.  Arabidopsis Co-expression Tool (ACT): web server tools for microarray-based gene expression analysis , 2006, Nucleic Acids Res..

[43]  John A. Hamilton,et al.  The TIGR Rice Genome Annotation Resource: improvements and new features , 2006, Nucleic Acids Res..

[44]  H. Hirochika,et al.  Retrotransposons of rice involved in mutations induced by tissue culture. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Zongli Hu,et al.  Function Annotation of an SBP-box Gene in Arabidopsis Based on Analysis of Co-expression Networks and Promoters , 2009, International journal of molecular sciences.

[46]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[47]  D. Landsman,et al.  Identification of cis-regulatory elements in gene co-expression networks using A-GLAM. , 2009, Methods in molecular biology.