A web-based tool for the prediction of rice transcription factor function

Abstract Transcription factors (TFs) are an important class of regulatory molecules. Despite their importance, only a small number of genes encoding TFs have been characterized in Oryza sativa (rice), often because gene duplication and functional redundancy complicate their analysis. To address this challenge, we developed a web-based tool called the Rice Transcription Factor Phylogenomics Database (RTFDB) and demonstrate its application for predicting TF function. The RTFDB hosts transcriptome and co-expression analyses. Sources include high-throughput data from oligonucleotide microarray (Affymetrix and Agilent) as well as RNA-Seq-based expression profiles. We used the RTFDB to identify tissue-specific and stress-related gene expression. Subsequently, 273 genes preferentially expressed in specific tissues or organs, 455 genes showing a differential expression pattern in response to 4 abiotic stresses, 179 genes responsive to infection of various pathogens and 512 genes showing differential accumulation in response to various hormone treatments were identified through the meta-expression analysis. Pairwise Pearson correlation coefficient analysis between paralogous genes in a phylogenetic tree was used to assess their expression collinearity and thereby provides a hint on their genetic redundancy. Integrating transcriptome with the gene evolutionary information reveals the possible functional redundancy or dominance played by paralog genes in a highly duplicated genome such as rice. With this method, we estimated a predominant role for 83.3% (65/78) of the TF or transcriptional regulator genes that had been characterized via loss-of-function studies. In this regard, the proposed method is applicable for functional studies of other plant species with annotated genome.

[1]  H. Aburatani,et al.  Interpreting expression profiles of cancers by genome-wide survey of breadth of expression in normal tissues. , 2005, Genomics.

[2]  Dhinesh Kumar,et al.  Transcription factor-mediated cell-to-cell signalling in plants. , 2014, Journal of experimental botany.

[3]  NAC transcription factor expression, amino acid concentration and growth of elite rice cultivars upon salt stress , 2014, Acta Physiologiae Plantarum.

[4]  Brian R. King,et al.  ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes , 2007, Genome biology.

[5]  S. Masiero,et al.  Functional Characterization of OsMADS18, a Member of the AP1/SQUA Subfamily of MADS Box Genes1[w] , 2004, Plant Physiology.

[6]  D. Schwartz,et al.  Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data , 2013, Rice.

[7]  Pinky Agarwal,et al.  Three Rice NAC Transcription Factors Heteromerize and Are Associated with Seed Size , 2016, Front. Plant Sci..

[8]  Functional classification of rice flanking sequence tagged genes using MapMan terms and global understanding on metabolic and regulatory pathways affected by dxr mutant having defects in light response , 2016, Rice.

[9]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[10]  H. Shao,et al.  Recent Advances in Utilizing Transcription Factors to Improve Plant Abiotic Stress Tolerance by Transgenic Technology , 2016, Front. Plant Sci..

[11]  T. Tai Generation of rice mutants by chemical mutagenesis. , 2013, Methods in molecular biology.

[12]  J. Bailey-Serres,et al.  Sub1A is an ethylene-response-factor-like gene that confers submergence tolerance to rice , 2006, Nature.

[13]  Miao Sun,et al.  SAGE is far more sensitive than EST for detecting low-abundance transcripts , 2004, BMC Genomics.

[14]  Christophe Périn,et al.  GreenPhylDB: a database for plant comparative genomics , 2007, Nucleic Acids Res..

[15]  Marc Robinson-Rechavi,et al.  A benchmark of gene expression tissue-specificity metrics , 2015, bioRxiv.

[16]  J A Eisen,et al.  Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis. , 1998, Genome research.

[17]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[18]  Masaki Endo,et al.  Multigene Knockout Utilizing Off-Target Mutations of the CRISPR/Cas9 System in Rice , 2014, Plant & cell physiology.

[19]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[20]  B. Mueller‐Roeber,et al.  SALT-RESPONSIVE ERF1 Regulates Reactive Oxygen Species–Dependent Signaling during the Initial Response to Salt Stress in Rice[W] , 2013, Plant Cell.

[21]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[22]  Georg Haberer,et al.  The genome sequence of African rice (Oryza glaberrima) and evidence for independent domestication , 2014, Nature Genetics.

[23]  S. Brunak,et al.  SignalP 4.0: discriminating signal peptides from transmembrane regions , 2011, Nature Methods.

[24]  Nobutaka Mitsuda,et al.  OsNAC111, a blast disease-responsive transcription factor in rice, positively regulates the expression of defense-related genes. , 2014, Molecular plant-microbe interactions : MPMI.

[25]  L. Xiong,et al.  Overexpressing a NAM, ATAF, and CUC (NAC) transcription factor enhances drought resistance and salt tolerance in rice , 2006, Proceedings of the National Academy of Sciences.

[26]  Dabing Zhang,et al.  DWARF TILLER1, a WUSCHEL-Related Homeobox Transcription Factor, Is Required for Tiller Growth in Rice , 2014, PLoS genetics.

[27]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[28]  Ge Gao,et al.  DRTF: a database of rice transcription factors , 2006, Bioinform..

[29]  Yan Li,et al.  Sequencing and de novo assembly of a near complete indica rice genome , 2017, Nature Communications.

[30]  Yan Mei,et al.  The RNA-binding protein hnRNPLL induces a T cell alternative splicing program delineated by differential intron retention in polyadenylated RNA , 2014, Genome Biology.

[31]  Daeseok Choi,et al.  UC Davis UC Davis Previously Published Works Title The Rice Oligonucleotide Array Database : an atlas of rice gene expression , 2012 .

[32]  H. Hirochika Insertional mutagenesis with Tos17 for functional analysis of rice genes , 2010 .

[33]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[34]  Dennis B. Troup,et al.  NCBI GEO: archive for functional genomics data sets—10 years on , 2010, Nucleic Acids Res..

[35]  O. Gascuel,et al.  New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. , 2010, Systematic biology.

[36]  M. Kojima,et al.  WUSCHEL-RELATED HOMEOBOX4 Is Involved in Meristem Maintenance and Is Negatively Regulated by the CLE Gene FCP1 in Rice[W] , 2013, Plant Cell.

[37]  G. Martin,et al.  iTAK: A Program for Genome-wide Prediction and Classification of Plant Transcription Factors, Transcriptional Regulators, and Protein Kinases. , 2016, Molecular plant.

[38]  K. Jung,et al.  T-DNA insertional mutagenesis for functional genomics in rice. , 2000, The Plant journal : for cell and molecular biology.

[39]  R. Wing,et al.  Efficient insertional mutagenesis in rice using the maize En/Spm elements. , 2005, The Plant journal : for cell and molecular biology.

[40]  Gaston H. Gonnet,et al.  The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements , 2014, Nucleic Acids Res..

[41]  Erik L. L. Sonnhammer,et al.  InParanoid 8: orthology analysis between 273 proteomes, mostly eukaryotic , 2014, Nucleic Acids Res..

[42]  Liwei Sun,et al.  Proteomic Analyses Provide Novel Insights into Plant Growth and Ginsenoside Biosynthesis in Forest Cultivated Panax ginseng (F. Ginseng) , 2016, Front. Plant Sci..

[43]  K. Jung,et al.  The Rice Kinase Phylogenomics Database: a guide for systematic analysis of the rice kinase super-family. , 2010, Trends in plant science.

[44]  G. Theißen,et al.  Functional Conservation of MIKC*-Type MADS Box Genes in Arabidopsis and Rice Pollen Maturation[C][W] , 2013, Plant Cell.

[45]  rice genomes The 3,000 rice genomes project , 2014, GigaScience.

[46]  Mark Stitt,et al.  A guide to using MapMan to visualize and compare Omics data in plants: a case study in the crop species, Maize. , 2009, Plant, cell & environment.

[47]  Yoshiaki Nagamura,et al.  RiceXPro: a platform for monitoring gene expression in japonica rice grown under natural field conditions , 2010, Nucleic Acids Res..

[48]  M. Gribskov,et al.  Predicting N-terminal myristoylation sites in plant proteins , 2004, BMC Genomics.

[49]  Y. Ouyang,et al.  funRiceGenes dataset for comprehensive understanding and application of rice functional genes , 2017, GigaScience.

[50]  Ananda Mustafiz,et al.  Ascribing Functions to Genes: Journey Towards Genetic Improvement of Rice Via Functional Genomics , 2016, Current genomics.

[51]  Mukesh Jain,et al.  RiceSRTFDB: A database of rice transcription factors containing comprehensive expression, cis-regulatory element and mutant information to facilitate gene function analysis , 2013, Database J. Biol. Databases Curation.

[52]  Yongsheng Liu,et al.  CHIMERIC FLORAL ORGANS1, Encoding a Monocot-Specific MADS Box Protein, Regulates Floral Organ Identity in Rice1[C][W] , 2012, Plant Physiology.

[53]  M. K. Reddy,et al.  Expression of OsDREB2A transcription factor confers enhanced dehydration and salt stress tolerance in rice (Oryza sativa L.) , 2011, Biotechnology Letters.

[54]  Rafael C. Jimenez,et al.  The IntAct molecular interaction database in 2012 , 2011, Nucleic Acids Res..

[55]  Honglin Chen,et al.  Overexpression of a rice OsDREB1F gene increases salt, drought, and low temperature tolerance in both Arabidopsis and rice , 2008, Plant Molecular Biology.

[56]  Rebecca L Poole The TAIR database. , 2007, Methods in molecular biology.

[57]  Javier F. Palatnik,et al.  Control of Jasmonate Biosynthesis and Senescence by miR319 Targets , 2008, PLoS biology.

[58]  Birgit Kersten,et al.  PlnTFDB: updated content and new features of the plant transcription factor database , 2009, Nucleic Acids Res..

[59]  J. Schmutz,et al.  The Sequences of 1504 Mutants in the Model Rice Variety Kitaake Facilitate Rapid Functional Genomic Studies , 2017, The Plant Cell.

[60]  G. An,et al.  Identification of class B and class C floral organ identity genes from rice plants , 1998, Plant Molecular Biology.

[61]  M. Fornari,et al.  The D-lineage MADS-box gene OsMADS13 controls ovule identity in rice. , 2007, The Plant journal : for cell and molecular biology.

[62]  Eiji Yamamoto,et al.  OGRO: The Overview of functionally characterized Genes in Rice online database , 2012, Rice.

[63]  L. Xiong,et al.  Mutant resources for the functional analysis of the rice genome. , 2013, Molecular plant.

[64]  M. Matsuoka,et al.  Overexpression of rice OSH genes induces ectopic shoots on leaf sheaths of transgenic rice plants. , 2000, Developmental biology.