Evolutionary conservation suggests a regulatory function of AUG triplets in 5′-UTRs of eukaryotic genes

By comparing sequences of human, mouse and rat orthologous genes, we show that in 5′-untranslated regions (5′-UTRs) of mammalian cDNAs but not in 3′-UTRs or coding sequences, AUG is conserved to a significantly greater extent than any of the other 63 nt triplets. This effect is likely to reflect, primarily, bona fide evolutionary conservation, rather than cDNA annotation artifacts, because the excess of conserved upstream AUGs (uAUGs) is seen in 5′-UTRs containing stop codons in-frame with the start AUG and many of the conserved AUGs are found in different frames, consistent with the location in authentic non-coding sequences. Altogether, conserved uAUGs are present in at least 20–30% of mammalian genes. Qualitatively similar results were obtained by comparison of orthologous genes from different species of the yeast genus Saccharomyces. Together with the observation that mammalian and yeast 5′-UTRs are significantly depleted in overall AUG content, these findings suggest that AUG triplets in 5′-UTRs are subject to the pressure of purifying selection in two opposite directions: the uAUGs that have no specific function tend to be deleterious and get eliminated during evolution, whereas those uAUGs that do serve a function are conserved. Most probably, the principal role of the conserved uAUGs is attenuation of translation at the initiation stage, which is often additionally regulated by alternative splicing in the mammalian 5′-UTRs. Consistent with this hypothesis, we found that open reading frames starting from conserved uAUGs are significantly shorter than those starting from non-conserved uAUGs, possibly, owing to selection for optimization of the level of attenuation.

[1]  A E Willis,et al.  Translational control of growth factor and proto-oncogene expression. , 1999, The international journal of biochemistry & cell biology.

[2]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[3]  J. McCarthy,et al.  Posttranscriptional Control of Gene Expression in Yeast , 1998, Microbiology and Molecular Biology Reviews.

[4]  M. Hentze,et al.  Molecular mechanisms of translational control , 2004, Nature Reviews Molecular Cell Biology.

[5]  J. Fütterer,et al.  Translation in plants-rules and exceptions , 1996, Plant Molecular Biology.

[6]  M. Kozak,et al.  A short leader sequence impairs the fidelity of initiation by eukaryotic ribosomes. , 1991, Gene expression.

[7]  G. Pesole,et al.  Structural and compositional features of untranslated regions of eukaryotic mRNAs. , 1997, Gene.

[8]  M. Kozak,et al.  Constraints on reinitiation of translation in mammals. , 2001, Nucleic acids research.

[9]  M. Kozak Alternative ways to think about mRNA sequences and proteins that appear to promote internal initiation of translation. , 2003, Gene.

[10]  G. Edelman,et al.  Biochemical and functional analysis of a 9-nt RNA sequence that affects translation efficiency in eukaryotic cells. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Anatoly S. Frolov,et al.  Prediction of eukaryotic mRNA translational properties , 1999, Bioinform..

[12]  A Suyama,et al.  Statistical analysis of the 5' untranslated region of human mRNA using "Oligo-Capped" cDNA libraries. , 2000, Genomics.

[13]  M. Kozak,et al.  Do the 5'untranslated domains of human cDNAs challenge the rules for initiation of translation (or is it vice versa)? , 2000, Genomics.

[14]  D J Lipman,et al.  Lineage-specific loss and divergence of functionally linked genes in eukaryotes. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[15]  E. Krebs,et al.  Translational activation of the lck proto-oncogene , 1988, Nature.

[16]  Michael Ruogu Zhang,et al.  CART classification of human 5' UTR sequences. , 2000, Genome research.

[17]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[18]  C. Watanabe,et al.  Compilation and comparison of the sequence context around the AUG startcodons in Saccharomyces cerevisiae mRNAs. , 1987, Nucleic acids research.

[19]  Luciano Milanesi,et al.  Presence of ATG triplets in 5' untranslated regions of eukaryotic cDNAs correlates with a 'weak' context of the start codon , 2001, Bioinform..

[20]  M. Kozak,et al.  Pushing the limits of the scanning mechanism for initiation of translation , 2002, Gene.

[21]  Elena Rivas,et al.  Noncoding RNA gene detection using comparative sequence analysis , 2001, BMC Bioinformatics.

[22]  E. Koonin Orthologs, paralogs, and evolutionary genomics. , 2005, Annual review of genetics.

[23]  M. Kozak An analysis of 5'-noncoding sequences from 699 vertebrate messenger RNAs. , 1987, Nucleic acids research.

[24]  L. Vitale,et al.  mRNA 5' region sequence incompleteness: a potential source of systematic errors in translation initiation codon assignment in human mRNAs. , 2003, Gene.

[25]  M. Hentze,et al.  New Ways of Initiating Translation in Eukaryotes? , 2001, Molecular and Cellular Biology.

[26]  F Sherman,et al.  mRNA sequences influencing translation and the selection of AUG initiator codons in the yeast Saccharomyces cerevisiae , 1996, Molecular microbiology.

[27]  N. Gray,et al.  Regulation of mRNA translation by 5 0-and 3 0-UTR-binding factors q , 2003 .

[28]  Luciano Milanesi,et al.  Prediction and Phylogenetic Analysis of Mammalian Short Interspersed Elements (SINEs) , 2000, Briefings Bioinform..

[29]  M. Gelfand,et al.  Frequent alternative splicing of human genes. , 1999, Genome research.

[30]  E. J. de la Rosa,et al.  Upstream AUGs in embryonic proinsulin mRNA control its low translation level , 2003, The EMBO journal.

[31]  C Saccone,et al.  Sequence analysis and compositional properties of untranslated regions of human mRNAs. , 1994, Gene.

[32]  D. Morris,et al.  Initiation codons within 5'-leaders of mRNAs as regulators of translation. , 1994, Trends in biochemical sciences.

[33]  M. Kozak,et al.  Recognition of AUG and alternative initiator codons is augmented by G in position +4 but is not generally affected by the nucleotides in positions +5 and +6 , 1997, The EMBO journal.

[34]  L Milanesi,et al.  Protein-coding regions prediction combining similarity searches and conservative evolutionary properties of protein-coding sequences. , 1999, Gene.

[35]  N A Kolchanov,et al.  Eukaryotic mRNAs encoding abundant and scarce proteins are statistically dissimilar in many structural features , 1998, FEBS letters.

[36]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[37]  N. Gray,et al.  Regulation of mRNA translation by 5'- and 3'-UTR-binding factors. , 2003, Trends in biochemical sciences.

[38]  Y. Dorokhov,et al.  Polypurine (A)-rich sequences promote cross-kingdom conservation of internal ribosome entry , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[39]  H. Meijer,et al.  Control of eukaryotic protein synthesis by upstream open reading frames in the 5'-untranslated region of an mRNA. , 2002, The Biochemical journal.

[40]  A. Sachs Cell Cycle–Dependent Translation Initiation IRES Elements Prevail , 2000, Cell.

[41]  C Saccone,et al.  Analysis of oligonucleotide AUG start codon context in eukariotic mRNAs. , 2000, Gene.

[42]  D. Cavener,et al.  Ultrabithorax and Antennapedia 5' untranslated regions promote developmentally regulated internal translation initiation , 1997, Molecular and cellular biology.

[43]  A. Kochetov AUG codons at the beginning of protein coding sequences are frequent in eukaryotic mRNAs with a suboptimal start codon context , 2005, Bioinform..

[44]  Nikolay A. Kolchanov,et al.  The role of alternative translation start sites in the generation of human protein diversity , 2005, Molecular Genetics and Genomics.

[45]  E. Koonin Orthologs, Paralogs, and Evolutionary Genomics 1 , 2005 .

[46]  A. Cigan,et al.  Mutational analysis of the HIS4 translational initiator region in Saccharomyces cerevisiae , 1988, Molecular and cellular biology.

[47]  A. Willis,et al.  Cellular internal ribosome entry segments: structures, trans-acting factors and regulation of gene expression , 2004, Oncogene.

[48]  D. Cavener,et al.  Eukaryotic start and stop translation sites. , 1991, Nucleic acids research.

[49]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[50]  Xue-Qing Wang,et al.  5'-untranslated regions with multiple upstream AUG codons can support low-level translation via leaky scanning and reinitiation. , 2004, Nucleic acids research.

[51]  Vincent L. Chiang,et al.  Context sequences of translation initiation codon in plants , 1997, Plant Molecular Biology.

[52]  A. W. van der Velden,et al.  The role of the 5' untranslated region of an mRNA in translation regulation during development. , 1999, The international journal of biochemistry & cell biology.

[53]  Chankyu Park,et al.  A splice variant acquiring an extra transcript leader region decreases the translation of glutamine synthetase gene. , 2003, The Biochemical journal.

[54]  M. Kozak The scanning model for translation: an update , 1989, The Journal of cell biology.

[55]  Graziano Pesole,et al.  uAUG and uORFs in human and rodent 5'untranslated mRNAs. , 2005, Gene.

[56]  J P Reboud,et al.  [Initiation of protein synthesis in eukaryotic cells]. , 1969, Comptes rendus hebdomadaires des seances de l'Academie des sciences. Serie D: Sciences naturelles.

[57]  Eugene V Koonin,et al.  Comparative analysis of orthologous eukaryotic mRNAs: potential hidden functional signals. , 2004, Nucleic acids research.

[58]  G. Glazko,et al.  [Context organization of mRNA 5'-untranslated regions of higher plants]. , 2002, Molekuliarnaia biologiia.

[59]  M. Kozak Determinants of translational fidelity and efficiency in vertebrate mRNAs , 1994, Biochimie.

[60]  Graziano Pesole,et al.  Evolutionary Dynamics of Mammalian MRNA Untranslated Regions by Comparative Analysis of Orthologous Human, Artiodactyl and Rodent Gene Pairs , 2002, Comput. Chem..

[61]  C P Joshi,et al.  An inspection of the domain between putative TATA box and translation start site in 79 plant genes. , 1987, Nucleic acids research.

[62]  S Y Le,et al.  A common RNA structural motif involved in the internal initiation of translation of cellular mRNAs. , 1997, Nucleic acids research.

[63]  D. Morris,et al.  Upstream Open Reading Frames as Regulators of mRNA Translation , 2000, Molecular and Cellular Biology.

[64]  P. Sarnow,et al.  Internal ribosome entry sites in eukaryotic mRNA molecules. , 2001, Genes & development.

[65]  V. Agol,et al.  Molecular mechanisms of translation initiation in eukaryotes , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[66]  P. Sarnow,et al.  Internal initiation of translation mediated by the 5′ leader of a cellular mRNA , 1991, Nature.