Identification of putative regulatory upstream ORFs in the yeast genome using heuristics and evolutionary conservation

BackgroundThe translational efficiency of an mRNA can be modulated by upstream open reading frames (uORFs) present in certain genes. A uORF can attenuate translation of the main ORF by interfering with translational reinitiation at the main start codon. uORFs also occur by chance in the genome, in which case they do not have a regulatory role. Since the sequence determinants for functional uORFs are not understood, it is difficult to discriminate functional from spurious uORFs by sequence analysis.ResultsWe have used comparative genomics to identify novel uORFs in yeast with a high likelihood of having a translational regulatory role. We examined uORFs, previously shown to play a role in regulation of translation in Saccharomyces cerevisiae, for evolutionary conservation within seven Saccharomyces species. Inspection of the set of conserved uORFs yielded the following three characteristics useful for discrimination of functional from spurious uORFs: a length between 4 and 6 codons, a distance from the start of the main ORF between 50 and 150 nucleotides, and finally a lack of overlap with, and clear separation from, neighbouring uORFs. These derived rules are inherently associated with uORFs with properties similar to the GCN4 locus, and may not detect most uORFs of other types. uORFs with high scores based on these rules showed a much higher evolutionary conservation than randomly selected uORFs. In a genome-wide scan in S. cerevisiae, we found 34 conserved uORFs from 32 genes that we predict to be functional; subsequent analysis showed the majority of these to be located within transcripts. A total of 252 genes were found containing conserved uORFs with properties indicative of a functional role; all but 7 are novel. Functional content analysis of this set identified an overrepresentation of genes involved in transcriptional control and development.ConclusionEvolutionary conservation of uORFs in yeasts can be traced up to 100 million years of separation. The conserved uORFs have certain characteristics with respect to length, distance from each other and from the main start codon, and folding energy of the sequence. These newly found characteristics can be used to facilitate detection of other conserved uORFs.

[1]  F. Dietrich,et al.  Identification and characterization of upstream open reading frames (uORF) in the 5′ untranslated regions (UTR) of genes in Saccharomyces cerevisiae , 2005, Current Genetics.

[2]  T. Fox,et al.  Pet111p, an Inner Membrane-bound Translational Activator That Limits Expression of the Saccharomyces cerevisiaeMitochondrial Gene COX2 * , 2001, The Journal of Biological Chemistry.

[3]  Mark L Crowe,et al.  Evidence for conservation and selection of upstream open reading frames suggests probable encoding of bioactive peptides , 2006, BMC Genomics.

[4]  Tapash Chandra Ghosh,et al.  Shannon's uncertainty principle and gene expression levels , 2004 .

[5]  M. Wright,et al.  Amino acid substitutions in membrane-spanning domains of Hol1, a member of the major facilitator superfamily of transporters, confer nonselective cation uptake in Saccharomyces cerevisiae , 1996, Journal of bacteriology.

[6]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[7]  Graziano Pesole,et al.  UTRdb: a specialized database of 5'- and 3'-untranslated regions of eukaryotic mRNAs , 1998, Nucleic Acids Res..

[8]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[9]  C. Rodrigues-Pousada,et al.  The yeast transcription factor genes YAP1 and YAP2 are subject to differential control at the levels of both translation and mRNA stability. , 1998, Nucleic acids research.

[10]  T. Graves,et al.  Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis. , 2001, Genome research.

[11]  D. Morris,et al.  Upstream Open Reading Frames as Regulators of mRNA Translation , 2000, Molecular and Cellular Biology.

[12]  J. McCarthy,et al.  Regulation of fungal gene expression via short open reading frames in the mRNA 5′untranslated region , 2003, Molecular microbiology.

[13]  A. Hinnebusch,et al.  7 Translational Control of GCN4: Gene-specific Regulation by Phosphorylation of elF2 , 1996 .

[14]  Bonnie Berger,et al.  Methods in Comparative Genomics: Genome Correspondence, Gene Identification and Regulatory Motif Discovery , 2004, J. Comput. Biol..

[15]  H. Miyasaka,et al.  The positive relationship between codon usage bias and translation initiation AUG context in Saccharomyces cerevisiae , 1999, Yeast.

[16]  Per Sunnerhagen,et al.  Rck2 is required for reprogramming of ribosomes during oxidative stress. , 2005, Molecular biology of the cell.

[17]  Edward H. Shortliffe,et al.  A model of inexact reasoning in medicine , 1990 .

[18]  Per Sunnehagen,et al.  Comparative genomics : using fungi as models , 2006 .

[19]  Markus Ringnér,et al.  Folding Free Energies of 5′-UTRs Impact Post-Transcriptional Regulation on a Genomic Scale in Yeast , 2005, PLoS Comput. Biol..

[20]  C. Ball,et al.  Saccharomyces Genome Database. , 2002, Methods in enzymology.

[21]  M. Polymenis,et al.  Coupling of cell division to cell growth by translational control of the G1 cyclin CLN3 in yeast. , 1997, Genes & development.

[22]  F. Dietrich,et al.  Mapping of transcription start sites in Saccharomyces cerevisiae using 5′ SAGE , 2005, Nucleic acids research.

[23]  M. Kozak,et al.  Pushing the limits of the scanning mechanism for initiation of translation , 2002, Gene.

[24]  G. Rödel,et al.  AUG codons in the RNA leader sequences of the yeast PET genes CBS1 and SCO1 have no influence on translation efficiency , 1991, Current Genetics.

[25]  Michael Hampsey,et al.  Molecular Genetics of the RNA Polymerase II General Transcriptional Machinery , 1998, Microbiology and Molecular Biology Reviews.

[26]  B. G. Luukkonen,et al.  Efficiency of reinitiation of translation on human immunodeficiency virus type 1 mRNAs is determined by the length of the upstream open reading frame and by intercistronic distance , 1995, Journal of virology.

[27]  John D. Storey,et al.  Genome-wide analysis of mRNA translation profiles in Saccharomyces cerevisiae , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  G M Edelman,et al.  Transcript leader regions of two Saccharomyces cerevisiae mRNAs contain internal ribosome entry sites that function in living cells. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[29]  L. Fulton,et al.  Finding Functional Features in Saccharomyces Genomes by Phylogenetic Footprinting , 2003, Science.

[30]  Wolfgang Huber,et al.  A high-resolution map of transcription in the yeast genome. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[31]  P Sarnow,et al.  Cap-dependent and cap-independent translation by internal initiation of mRNAs in cell extracts prepared from Saccharomyces cerevisiae , 1994, Molecular and cellular biology.

[32]  C. Rodrigues-Pousada,et al.  Post‐termination ribosome interactions with the 5′UTR modulate yeast mRNA stability , 1999, The EMBO journal.

[33]  B. M. Jackson,et al.  Suppression of ribosomal reinitiation at upstream open reading frames in amino acid-starved cells forms the basis for GCN4 translational control , 1991, Molecular and cellular biology.

[34]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[35]  Christina A. Cuomo,et al.  Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae , 2005, Nature.

[36]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[37]  H. Meijer,et al.  Control of eukaryotic protein synthesis by upstream open reading frames in the 5'-untranslated region of an mRNA. , 2002, The Biochemical journal.

[38]  Fatima Sanchez-Cabo,et al.  Global Gene Expression Profiling Reveals Widespread yet Distinctive Translational Responses to Different Eukaryotic Translation Initiation Factor 2B-Targeting Stress Pathways , 2005, Molecular and Cellular Biology.

[39]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[40]  Cletus P. Kurtzman,et al.  Taxonomy and phylogenetic diversity among the yeasts , 2006 .

[41]  F. Messenguy,et al.  A segment of mRNA encoding the leader peptide of the CPA1 gene confers repression by arginine on a heterologous yeast gene transcript , 1994, Molecular and cellular biology.

[42]  Allan Jacobson,et al.  Ribosome occupancy of the yeast CPA1 upstream open reading frame termination codon modulates nonsense-mediated mRNA decay. , 2005, Molecular cell.