COXPRESdb v7: a gene coexpression database for 11 animal species supported by 23 coexpression platforms for technical evaluation and evolutionary inference

Abstract The advent of RNA-sequencing and microarray technologies has led to rapid growth of transcriptome data generated for a wide range of organisms, under various cellular, organ and individual conditions. Since the number of possible combinations of intercellular and extracellular conditions is almost unlimited, cataloging all transcriptome conditions would be an immeasurable challenge. Gene coexpression refers to the similarity of gene expression patterns under various conditions, such as disease states, tissue types, and developmental stages. Since the quality of gene coexpression data depends on the quality and quantity of transcriptome data, timely usage of the growing data is key to promoting individual research in molecular biology. COXPRESdb (http://coxpresdb.jp) is a database providing coexpression information for 11 animal species. One characteristic feature of COXPRESdb is its ability to compare multiple coexpression data derived from different transcriptomics technologies and different species, which strongly reduces false positive relationships in individual gene coexpression data. Here, we summarized the current version of this database, including 23 coexpression platforms with the highest-level quality till date. Using various functionalities in COXPRESdb, the new coexpression data would support a broader area of research from molecular biology to medical sciences.

[1]  K. Kinoshita,et al.  Rank of Correlation Coefficient as a Comparable Measure for Biological Significance of Gene Coexpression , 2009, DNA research : an international journal for rapid publication of reports on genes and genomes.

[2]  Sara Ballouz,et al.  Guidance for RNA-seq co-expression network construction and analysis: safety in numbers , 2015, Bioinform..

[3]  Kengo Kinoshita,et al.  COXPRESdb in 2015: coexpression database for animal species by DNA-microarray and RNAseq-based expression data with multiple quality assessment systems , 2014, Nucleic Acids Res..

[4]  Staffan Persson,et al.  Co-expression tools for plant biology: opportunities for hypothesis generation and caveats. , 2009, Plant, cell & environment.

[5]  Kengo Kinoshita,et al.  Multi-dimensional correlations for gene coexpression and application to the large-scale data of Arabidopsis , 2009, Bioinform..

[6]  Kengo Kinoshita,et al.  COXPRESdb: a database of coexpressed gene networks in mammals , 2007, Nucleic Acids Res..

[7]  Kengo Kinoshita,et al.  ATTED-II in 2016: A Plant Coexpression Database Towards Lineage-Specific Coexpression , 2015, Plant & cell physiology.

[8]  Kengo Kinoshita,et al.  ATTED-II in 2018: A Plant Coexpression Database Based on Investigation of the Statistical Property of the Mutual Rank Index , 2018, Plant & cell physiology.

[9]  Kengo Kinoshita,et al.  Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways , 2010, Journal of Plant Research.

[10]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[11]  S. Rhee,et al.  Towards revealing the functions of all genes in plants. , 2014, Trends in plant science.

[12]  Melissa J. Landrum,et al.  RefSeq: an update on mammalian reference sequences , 2013, Nucleic Acids Res..

[13]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[14]  A. Brazma,et al.  Reuse of public genome-wide gene expression data , 2012, Nature Reviews Genetics.

[15]  Kengo Kinoshita,et al.  COXPRESdb: a database to compare gene coexpression in seven model animals , 2010, Nucleic Acids Res..

[16]  Robert Petryszak,et al.  ArrayExpress update—simplifying data submissions , 2014, Nucleic Acids Res..

[17]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[18]  Kengo Kinoshita,et al.  Matataki: an ultrafast mRNA quantification method for large-scale reanalysis of RNA-Seq data , 2018, BMC Bioinformatics.

[19]  Kengo Kinoshita,et al.  ATTED-II in 2018: A Plant Coexpression Database Based on Investigation of the Statistical Property of the Mutual Rank Index , 2017, Plant & cell physiology.

[20]  Toshihisa Takagi,et al.  DNA Data Bank of Japan: 30th anniversary , 2017, Nucleic Acids Res..

[21]  Kengo Kinoshita,et al.  COXPRESdb: a database of comparative gene coexpression networks of eleven species for mammals , 2012, Nucleic Acids Res..

[22]  Staffan Persson,et al.  Beyond Genomics: Studying Evolution with Gene Coexpression Networks. , 2017, Trends in plant science.

[23]  Yoshiyuki Ogata,et al.  Approaches for extracting practical information from gene co-expression networks in plant biology. , 2007, Plant & cell physiology.