Discovery of noncanonical translation initiation sites through mass spectrometric analysis of protein N termini

Translation initiation generally occurs at AUG codons in eukaryotes, although it has been shown that non-AUG or noncanonical translation initiation can also occur. However, the evidence for noncanonical translation initiation sites (TISs) is largely indirect and based on ribosome profiling (Ribo-seq) studies. Here, using a strategy specifically designed to enrich N termini of proteins, we demonstrate that many human proteins are translated at noncanonical TISs. The large majority of TISs that mapped to 5′ untranslated regions were noncanonical and led to N-terminal extension of annotated proteins or translation of upstream small open reading frames (uORF). It has been controversial whether the amino acid corresponding to the start codon is incorporated at the TIS or methionine is still incorporated. We found that methionine was incorporated at almost all noncanonical TISs identified in this study. Comparison of the TISs determined through mass spectrometry with ribosome profiling data revealed that about two-thirds of the novel annotations were indeed supported by the available ribosome profiling data. Sequence conservation across species and a higher abundance of noncanonical TISs than canonical ones in some cases suggests that the noncanonical TISs can have biological functions. Overall, this study provides evidence of protein translation initiation at noncanonical TISs and argues that further studies are required for elucidation of functional implications of such noncanonical translation initiation.

[1]  G. Menschaert,et al.  eIF1 modulates the recognition of suboptimal translation initiation sites and steers gene expression via uORFs , 2017, Nucleic Acids Research.

[2]  P. Willems,et al.  N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana , 2017, Molecular & Cellular Proteomics.

[3]  Joshua G. Dunn,et al.  Translation from unconventional 5′ start sites drives tumour initiation , 2017, Nature.

[4]  Joshua G. Dunn,et al.  Plastid: nucleotide-resolution analysis of next-generation sequencing and genomics data , 2016, BMC Genomics.

[5]  W. Van Criekinge,et al.  PROTEOFORMER: deep proteome coverage through ribosome profiling and MS integration , 2014, Nucleic acids research.

[6]  Shu-Bing Qian,et al.  Quantitative profiling of initiating ribosomes in vivo , 2014, Nature Methods.

[7]  T. Arnesen,et al.  Molecular, cellular, and physiological significance of N-terminal acetylation. , 2015, International review of cell and molecular biology.

[8]  Nicholas T Ingolia,et al.  Ribosome profiling reveals pervasive translation outside of annotated protein-coding genes. , 2014, Cell reports.

[9]  Alan G Hinnebusch,et al.  The scanning mechanism of eukaryotic translation initiation. , 2014, Annual review of biochemistry.

[10]  Gary D Bader,et al.  A draft map of the human proteome , 2014, Nature.

[11]  W. Van Criekinge,et al.  N-terminal Proteomics and Ribosome Profiling Provide a Comprehensive View of the Alternative Translation Initiation Landscape in Mice and Men* , 2014, Molecular & Cellular Proteomics.

[12]  Philipp F. Lange,et al.  Annotating N Termini for the Human Proteome Project: N Termini and Nα-Acetylation Status Differentiate Stable Cleaved Protein Species from Degradation Remnants in the Human Erythrocyte Proteome , 2014, Journal of proteome research.

[13]  Ji Wan,et al.  TISdb: a database for alternative translation initiation in mammalian cells , 2013, Nucleic Acids Res..

[14]  Desmond G. Higgins,et al.  GWIPS-viz: development of a ribo-seq genome browser , 2013, Nucleic Acids Res..

[15]  Tamir Tuller,et al.  New Universal Rules of Eukaryotic Translation Initiation Fidelity , 2013, PLoS Comput. Biol..

[16]  真田 昌 骨髄異形成症候群のgenome-wide analysis , 2013 .

[17]  K. Gevaert,et al.  Deep Proteome Coverage Based on Ribosome Profiling Aids Mass Spectrometry-based Protein and Peptide Discovery and Provides Evidence of Alternative Translation Products and Near-cognate Translation Initiation Events* , 2013, Molecular & Cellular Proteomics.

[18]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[19]  B. Shen,et al.  Global mapping of translation initiation sites in mammalian cells at single-nucleotide resolution , 2012, Proceedings of the National Academy of Sciences.

[20]  N. Shastri,et al.  Leucine-tRNA Initiates at CUG Start Codons for Protein Synthesis and Presentation by MHC Class I , 2012, Science.

[21]  Nicholas T. Ingolia,et al.  Ribosome Profiling of Mouse Embryonic Stem Cells Reveals the Complexity and Dynamics of Mammalian Proteomes , 2011, Cell.

[22]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[23]  Christopher M Overall,et al.  Identifying and quantifying proteolytic events and the natural N terminome by terminal amine isotopic labeling of substrates , 2011, Nature Protocols.

[24]  Feiran Zhang,et al.  Protein N-terminal processing: substrate specificity of Escherichia coli and human methionine aminopeptidases. , 2010, Biochemistry.

[25]  L. Foster,et al.  Isotopic labeling of terminal amines in complex samples identifies protein N-termini and protease cleavage products , 2010, Nature Biotechnology.

[26]  Gerard Cagney,et al.  An Overview of Label-Free Quantitation Methods in Proteomics by Mass Spectrometry , 2010, Proteome Bioinformatics.

[27]  V. Gladyshev,et al.  CUG Start Codon Generates Thioredoxin/Glutathione Reductase Isoforms in Mouse Testes*♦ , 2009, Journal of Biological Chemistry.

[28]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[29]  Yvan Saeys,et al.  Translation initiation site prediction on a genomic scale: beauty in simplicity , 2007, ISMB/ECCB.

[30]  Thierry Meinnel,et al.  The Proteomics of N-terminal Methionine Cleavage*S , 2006, Molecular & Cellular Proteomics.

[31]  Doree Sitkoff,et al.  models homology modeling : From sequence alignments to structural A comparative study of available software for high-accuracy , 2005 .

[32]  N. Shastri,et al.  Unanticipated Antigens: Translation Initiation at CUG with Leucine , 2004, PLoS biology.

[33]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.