Discovering structural cis-regulatory elements by modeling the behaviors of mRNAs

Gene expression is regulated at each step from chromatin remodeling through translation and degradation. Several known RNA‐binding regulatory proteins interact with specific RNA secondary structures in addition to specific nucleotides. To provide a more comprehensive understanding of the regulation of gene expression, we developed an integrative computational approach that leverages functional genomics data and nucleotide sequences to discover RNA secondary structure‐defined cis‐regulatory elements (SCREs). We applied our structural cis‐regulatory element detector (StructRED) to microarray and mRNA sequence data from Saccharomyces cerevisiae, Drosophila melanogaster, and Homo sapiens. We recovered the known specificities of Vts1p in yeast and Smaug in flies. In addition, we discovered six putative SCREs in flies and three in humans. We characterized the SCREs based on their condition‐specific regulatory influences, the annotation of the transcripts that contain them, and their locations within transcripts. Overall, we show that modeling functional genomics data in terms of combined RNA structure and sequence motifs is an effective method for discovering the specificities and regulatory roles of RNA‐binding proteins.

[1]  Joaquín Moreno,et al.  Genomics and gene transcription kinetics in yeast. , 2007, Trends in genetics : TIG.

[2]  Cécile Robard,et al.  Phosphorylation status of the Kep1 protein alters its affinity for its protein binding partner alternative splicing factor ASF/SF2. , 2006, The Biochemical journal.

[3]  Amos Tanay,et al.  Extensive low-affinity transcriptional interactions in the yeast genome. , 2006, Genome research.

[4]  C. Smibert,et al.  Smaug Recruits the CCR4/POP2/NOT Deadenylase Complex to Trigger Maternal Transcript Localization in the Early Drosophila Embryo , 2005, Current Biology.

[5]  Eran Segal,et al.  Computational prediction of RNA structural motifs involved in posttranscriptional regulatory processes , 2008, Proceedings of the National Academy of Sciences.

[6]  Daniel Herschlag,et al.  Genome-wide identification of mRNAs associated with the translational regulator PUMILIO in Drosophila melanogaster. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[7]  F. E. Grubbs Procedures for Detecting Outlying Observations in Samples , 1969 .

[8]  M. Hattori,et al.  A large-scale full-length cDNA analysis to explore the budding yeast transcriptome , 2006, Proceedings of the National Academy of Sciences.

[9]  H. Bussemaker,et al.  Regulatory element detection using correlation with expression , 2001, Nature Genetics.

[10]  S. Richard,et al.  kep1 interacts genetically with dredd/Caspase-8, and kep1 mutants alter the balance of dredd isoforms , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Tzvi Aviv,et al.  Sequence-specific recognition of RNA hairpins by the SAM domain of Vts1p , 2006, Nature Structural &Molecular Biology.

[12]  M. Gorospe,et al.  Global analysis of stress-regulated mRNA turnover by using cDNA arrays , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[13]  E. Groisman,et al.  The intricate world of riboswitches. , 2007, Current opinion in microbiology.

[14]  P. Silver,et al.  Genome-wide identification of functionally distinct subsets of cellular mRNAs associated with two nucleocytoplasmic-shuttling mammalian splicing factors , 2006, Genome Biology.

[15]  Barrett C. Foat,et al.  Predictive modeling of genome-wide mRNA expression: from modules to molecules. , 2007, Annual review of biophysics and biomolecular structure.

[16]  Timothy R Hughes,et al.  SMAUG is a major regulator of maternal mRNA destabilization in Drosophila and its translation is activated by the PAN GU kinase. , 2007, Developmental cell.

[17]  G. Rubin,et al.  Global analyses of mRNA translational control during early Drosophila embryogenesis , 2007, Genome Biology.

[18]  J. Keene RNA regulons: coordination of post-transcriptional events , 2007, Nature Reviews Genetics.

[19]  Raffaele Fronza,et al.  Global alterations in mRNA polysomal recruitment in a cell model of colorectal cancer progression to metastasis. , 2006, Carcinogenesis.

[20]  Xin Wang,et al.  Solution structure of the Vts1 SAM domain in the presence of RNA. , 2006, Journal of molecular biology.

[21]  L. Vardy,et al.  Regulating translation of maternal messages: multiple repression mechanisms. , 2007, Trends in cell biology.

[22]  T. D. Schneider,et al.  Quantitative analysis of the relationship between nucleotide sequence and functional activity. , 1986, Nucleic acids research.

[23]  O. Ohara,et al.  Post‐transcriptional effects of phorbol 12‐myristate 13‐acetate on transcriptome of U937 cells , 2004, FEBS letters.

[24]  G. Boccaccio,et al.  Mammalian Smaug Is a Translational Repressor That Forms Cytoplasmic Foci Similar to Stress Granules* , 2005, Journal of Biological Chemistry.

[25]  Barrett C. Foat,et al.  Profiling condition-specific, genome-wide regulation of mRNA stability in yeast. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[26]  D. Sankoff Simultaneous Solution of the RNA Folding, Alignment and Protosequence Problems , 1985 .

[27]  Wolfgang Huber,et al.  A high-resolution map of transcription in the yeast genome. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[28]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[29]  L. Furic,et al.  A genome-wide approach identifies distinct but overlapping subsets of cellular mRNAs associated with Staufen1- and Staufen2-containing ribonucleoprotein complexes. , 2007, RNA.

[30]  Thomas Lecuit,et al.  Developmental control of nuclear morphogenesis and anchoring by charleston, identified in a functional genomic screen of Drosophila cellularisation , 2006, Development.

[31]  Joshua L. Goodman,et al.  FlyBase: integration and improvements to query tools , 2007, Nucleic Acids Res..

[32]  Alexandre V. Morozov,et al.  Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE , 2006, ISMB.

[33]  C. Smibert,et al.  Smaug, a novel and conserved protein, contributes to repression of nanos mRNA translation in vitro. , 1999, RNA.

[34]  Stanley Fields,et al.  A conserved RNA-binding protein that regulates sexual fates in the C. elegans hermaphrodite germ line , 1997, Nature.

[35]  M. Ashburner,et al.  Systematic determination of patterns of gene expression during Drosophila embryogenesis , 2002, Genome Biology.

[36]  O. Elemento,et al.  Unmasking Activation of the Zygotic Genome Using Chromosomal Deletions in the Drosophila Embryo , 2007, PLoS Biology.

[37]  Philip E. Johnson,et al.  RNA recognition by the Vts1p SAM domain , 2006, Nature Structural &Molecular Biology.

[38]  C. Smibert,et al.  S. cerevisiae Vts1p induces deadenylation-dependent transcript degradation and interacts with the Ccr4p-Pop2p-Not deadenylase complex. , 2008, RNA.

[39]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..

[40]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[41]  Tzvi Aviv,et al.  The NMR and X-ray structures of the Saccharomyces cerevisiae Vts1 SAM domain define a surface for the recognition of RNA hairpins. , 2006, Journal of molecular biology.

[42]  Rainer Breitling,et al.  Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments , 2004, FEBS letters.

[43]  Robert F. Tate,et al.  Correlation Between a Discrete and a Continuous Variable. Point-Biserial Correlation , 1954 .

[44]  C. Smibert,et al.  Drosophila Maternal Hsp83 mRNA Destabilization Is Directed by Multiple SMAUG Recognition Elements in the Open Reading Frame , 2008, Molecular and Cellular Biology.

[45]  S. Cohen,et al.  microRNA functions. , 2007, Annual review of cell and developmental biology.

[46]  David Botstein,et al.  SGD: Saccharomyces Genome Database , 1998, Nucleic Acids Res..

[47]  O. Larsson,et al.  Eukaryotic translation initiation factor 4E induced progression of primary human mammary epithelial cells along the cancer pathway is associated with targeted translational deregulation of oncogenic drivers and inhibitors. , 2007, Cancer research.

[48]  R. Wharton,et al.  Smaug, a novel RNA-binding protein that operates a translational switch in Drosophila. , 1999, Molecular cell.

[49]  Tzvi Aviv,et al.  The RNA-binding SAM domain of Smaug defines a new family of post-transcriptional regulators , 2003, Nature Structural Biology.

[50]  Joseph Lev,et al.  The Point Biserial Coefficient of Correlation , 1949 .

[51]  M. Gorospe,et al.  Control of gene expression during T cell activation: alternate regulation of mRNA transcription and mRNA stability , 2005, BMC Genomics.

[52]  Florian C. Oberstrass,et al.  Shape-specific recognition in the structure of the Vts1p SAM domain with RNA , 2006, Nature Structural &Molecular Biology.