The role of alternative translation start sites in the generation of human protein diversity

According to the scanning model, 40S ribosomal subunits initiate translation at the first (5′ proximal) AUG codon they encounter. However, if the first AUG is in a suboptimal context, it may not be recognized, and translation can then initiate at downstream AUG(s). In this way, a single RNA can produce several variant products. Earlier experiments suggested that some of these additional protein variants might be functionally important. We have analysed human mRNAs that have AUG triplets in 5′ untranslated regions and mRNAs in which the annotated translational start codon is located in a suboptimal context. It was found that 3% of human mRNAs have the potential to encode N-terminally extended variants of the annotated proteins and 12% could code for N-truncated variants. The predicted subcellular localizations of these protein variants were compared: 31% of the N-extended proteins and 30% of the N-truncated proteins were predicted to localize to subcellular compartments that differed from those targeted by the annotated protein forms. These results suggest that additional AUGs may frequently be exploited for the synthesis of proteins that possess novel functional properties.

[1]  Tim Hubbard Finishing the euchromatic sequence of the human genome , 2004 .

[2]  International Human Genome Sequencing Consortium Finishing the euchromatic sequence of the human genome , 2004 .

[3]  S. Brunak,et al.  Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. , 2000, Journal of molecular biology.

[4]  M. Kozak Determinants of translational fidelity and efficiency in vertebrate mRNAs , 1994, Biochimie.

[5]  M. Hentze,et al.  New Ways of Initiating Translation in Eukaryotes? , 2001, Molecular and Cellular Biology.

[6]  Xue-Qing Wang,et al.  5'-untranslated regions with multiple upstream AUG codons can support low-level translation via leaky scanning and reinitiation. , 2004, Nucleic acids research.

[7]  Graziano Pesole,et al.  Evolutionary Dynamics of Mammalian MRNA Untranslated Regions by Comparative Analysis of Orthologous Human, Artiodactyl and Rodent Gene Pairs , 2002, Comput. Chem..

[8]  M. Kozak,et al.  New Ways of Initiating Translation in Eukaryotes? , 2001, Molecular and Cellular Biology.

[9]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[10]  M. Boguski,et al.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  H. Meijer,et al.  Ribosomes stalling on uORF1 in the Xenopus Cx41 5' UTR inhibit downstream translation initiation. , 2003, Nucleic acids research.

[12]  B. Frenkel,et al.  Biosynthesis of Osteogenic Growth Peptide via Alternative Translational Initiation at AUG85 of Histone H4 mRNA* , 1999, The Journal of Biological Chemistry.

[13]  G. Gyapay,et al.  Numerous novel annotations of the human genome sequence supported by a 5'-end-enriched cDNA collection. , 2004, Genome Research.

[14]  Sumio Sugano,et al.  Analysis of small human proteins reveals the translation of upstream open reading frames of mRNAs. , 2004, Genome research.

[15]  H. Meijer,et al.  Control of eukaryotic protein synthesis by upstream open reading frames in the 5'-untranslated region of an mRNA. , 2002, The Biochemical journal.

[16]  D. Morris,et al.  Upstream Open Reading Frames as Regulators of mRNA Translation , 2000, Molecular and Cellular Biology.

[17]  Christian E. V. Storm,et al.  Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. , 2001, Journal of molecular biology.

[18]  N. A. Kolchanov,et al.  Interrelations between the efficiency of translation start sites and other sequence features of yeast mRNAs , 2003, Molecular Genetics and Genomics.

[19]  Y. Matsuzawa,et al.  T lymphocytes constitutively produce an interferonlike cytokine limitin characterized as a heat- and acid-stable and heparin-binding glycoprotein. , 2003, Blood.

[20]  Akinori Sarai,et al.  Translational polymorphism as a potential source of plant proteins variety in Arabidopsis thaliana , 2004, Bioinform..

[21]  M. Ueda,et al.  Sorting of peroxisomal and mitochondrial carnitine acetyltransferase isozymes in the diploid yeast, Candida tropicalis , 2000, Cell Biochemistry and Biophysics.

[22]  Serafim Batzoglou,et al.  Eukaryotic regulatory element conservation analysis and identification using comparative genomics. , 2004, Genome research.

[23]  V. Culotta,et al.  Alternative Start Sites in the Saccharomyces cerevisiae GLR1 Gene Are Responsible for Mitochondrial and Cytosolic Isoforms of Glutathione Reductase* , 2004, Journal of Biological Chemistry.

[24]  A Suyama,et al.  Statistical analysis of the 5' untranslated region of human mRNA using "Oligo-Capped" cDNA libraries. , 2000, Genomics.

[25]  L. Corcos,et al.  The human caspase-2 gene: alternative promoters, pre-mRNA splicing and AUG usage direct isoform-specific expression , 2003, Oncogene.

[26]  P. Porras,et al.  Two isoforms of Saccharomyces cerevisiae glutaredoxin 2 are expressed in vivo and localize to different subcellular compartments. , 2002, The Biochemical journal.

[27]  C. Coleclough,et al.  Clonal proteomics: One gene – family of proteins , 2002, Proteomics.

[28]  C. Gissi,et al.  Untranslated regions of mRNAs , 2002, Genome Biology.

[29]  Eugene V Koonin,et al.  Comparative analysis of orthologous eukaryotic mRNAs: potential hidden functional signals. , 2004, Nucleic acids research.

[30]  M. Kozak,et al.  Pushing the limits of the scanning mechanism for initiation of translation , 2002, Gene.

[31]  L. Vitale,et al.  mRNA 5' region sequence incompleteness: a potential source of systematic errors in translation initiation codon assignment in human mRNAs. , 2003, Gene.

[32]  N A Kolchanov,et al.  Eukaryotic mRNAs encoding abundant and scarce proteins are statistically dissimilar in many structural features , 1998, FEBS letters.

[33]  Luciano Milanesi,et al.  Presence of ATG triplets in 5' untranslated regions of eukaryotic cDNAs correlates with a 'weak' context of the start codon , 2001, Bioinform..

[34]  A. Kochetov AUG codons at the beginning of protein coding sequences are frequent in eukaryotic mRNAs with a suboptimal start codon context , 2005, Bioinform..

[35]  N. Watanabe,et al.  Dual Targeting of Spinach Protoporphyrinogen Oxidase II to Mitochondria and Chloroplasts by Alternative Use of Two In-frame Initiation Codons* , 2001, The Journal of Biological Chemistry.