Annotation of the non-canonical translatome reveals that CHO cell microproteins are a new class of mAb drug product impurity

Chinese hamster ovary (CHO) cells are used to produce almost 90% of therapeutic monoclonal antibodies (mAbs). The annotation of non-canonical translation events in these cellular factories remains incomplete, limiting not only our ability to study CHO cell biology but also detect host cell protein (HCP) contaminants in the final mAb drug product. We utilised ribosome footprint profiling (Ribo-seq) to identify novel open reading frames (ORFs) including N-terminal extensions and thousands of short ORFs (sORFs) predicted to encode microproteins. Mass spectrometry-based HCP analysis of four commercial mAb drug products using the extended protein sequence database revealed the presence of microprotein impurities for the first time. We also show that microprotein abundance varies with growth phase and can be affected by the cell culture environment. In addition, our work provides a vital resource to facilitate future studies of non-canonical translation as well as the regulation of protein synthesis in CHO cell lines.

[1]  G. Walsh,et al.  Biopharmaceutical benchmarks 2022 , 2022, Nature biotechnology.

[2]  S. Harding,et al.  Methods for addressing host cell protein impurities in biopharmaceutical product development , 2022, Biotechnology journal.

[3]  Yuansheng Yang,et al.  Vector design for enhancing expression level and assembly of knob-into-hole based FabscFv-Fc bispecific antibodies in CHO cells , 2022, Antibody therapeutics.

[4]  S. Rosser,et al.  Synthetic biology approaches for dynamic CHO cell engineering. , 2022, Current opinion in biotechnology.

[5]  Jonathan M. Mudge,et al.  Standardized annotation of translated open reading frames , 2022, Nature Biotechnology.

[6]  C. Ahrens,et al.  A Practical Guide to Small Protein Discovery and Characterization Using Mass Spectrometry , 2021, Journal of bacteriology.

[7]  J. Weissman,et al.  The dark proteome: translation from noncanonical open reading frames. , 2021, Trends in cell biology.

[8]  N. Barron,et al.  Proteomic Landscape of Adeno-Associated Virus (AAV)-Producing HEK293 Cells , 2021, International journal of molecular sciences.

[9]  E. Mørtz,et al.  Monitoring process-related impurities in biologics–host cell protein analysis , 2021, Analytical and Bioanalytical Chemistry.

[10]  Asher Mullard FDA approves 100th monoclonal antibody product , 2021, Nature Reviews Drug Discovery.

[11]  R. Molden,et al.  Toward unbiased identification and comparative quantification of host cell protein impurities by automated iterative LC-MS/MS (HCP-AIMS) for therapeutic protein development. , 2021, Journal of pharmaceutical and biomedical analysis.

[12]  Yirong Wang,et al.  Determinants of genome-wide distribution and evolution of uORFs in eukaryotes , 2021, Nature Communications.

[13]  P. Schultz,et al.  A short ORF-encoded transcriptional regulator , 2021, Proceedings of the National Academy of Sciences.

[14]  S. Letarte,et al.  Identification and characterization of a residual host cell protein hexosaminidase B associated with N‐glycan degradation during the stability study of a therapeutic recombinant monoclonal antibody product , 2020, bioRxiv.

[15]  Manolis Kellis,et al.  Translation Initiation Site Profiling Reveals Widespread Synthesis of Non-AUG-Initiated Protein Isoforms in Yeast. , 2020, Cell systems.

[16]  Madolyn L. MacDonald,et al.  Chromosome‐scale scaffolds for the Chinese hamster reference genome assembly to facilitate the study of the CHO epigenome , 2020, Biotechnology and bioengineering.

[17]  M. Schwartz,et al.  The coding capacity of SARS-CoV-2 , 2020, Nature.

[18]  C. Clarke,et al.  Subphysiological temperature induces pervasive alternative splicing in Chinese hamster ovary cells , 2020, Biotechnology and bioengineering.

[19]  C. L. Wee,et al.  Mitochondrial peptide BRAWNIN is essential for vertebrate respiratory complex III assembly , 2020, Nature Communications.

[20]  M. Mann,et al.  Pervasive functional translation of noncanonical human open reading frames , 2020, Science.

[21]  Maxim N. Shokhirev,et al.  Accurate annotation of human protein-coding small open reading frames , 2019, Nature Chemical Biology.

[22]  S. Honda,et al.  Anxa2‐ and Ctsd‐knockout CHO cell lines to diminish the risk of contamination with host cell proteins , 2019, Biotechnology progress.

[23]  Simon Anders,et al.  proDA: Probabilistic Dropout Analysis for Identifying Differentially Abundant Proteins in Label-Free Mass Spectrometry , 2019, bioRxiv.

[24]  Nathan E. Lewis,et al.  Multiplex secretome engineering enhances recombinant protein production and purity , 2019, bioRxiv.

[25]  D. Bracewell,et al.  Identification of upstream culture conditions and harvest time parameters that affect host cell protein clearance , 2019, Biotechnology progress.

[26]  O. FitzGerald,et al.  Comprehensive characterisation of the heterogeneity of adalimumab via charge variant analysis hyphenated on-line to native high resolution Orbitrap mass spectrometry , 2018, mAbs.

[27]  Alex Bateman,et al.  RNAcentral: a hub of information for non-coding RNA sequences , 2018, Nucleic Acids Res..

[28]  Haibin Luo,et al.  Cathepsin L Causes Proteolytic Cleavage of Chinese‐Hamster‐Ovary Cell Expressed Proteins During Processing and Storage: Identification, Characterization, and Mitigation , 2018, Biotechnology progress.

[29]  V. Bafna,et al.  Proteogenomic annotation of the Chinese hamster reveals extensive novel translation events and endogenous retroviral elements , 2018, bioRxiv.

[30]  Christopher S. Hughes,et al.  Single-pot, solid-phase-enhanced sample preparation for proteomics experiments , 2018, Nature Protocols.

[31]  John R Yates,et al.  MIEF1 Microprotein Regulates Mitochondrial Translation. , 2018, Biochemistry.

[32]  Isaac Shamie,et al.  The emerging role of systems biology for engineering protein production in CHO cells. , 2018, Current opinion in biotechnology.

[33]  M. Huss,et al.  Discovery of coding regions in the human genome by integrated proteogenomics analysis workflow , 2018, Nature Communications.

[34]  C. Kontoravdi,et al.  Mild hypothermic culture conditions affect residual host cell protein composition post-Protein A chromatography , 2018, mAbs.

[35]  Gerben Menschaert,et al.  An update on sORFs.org: a repository of small ORFs identified by ribosome profiling , 2017, Nucleic Acids Res..

[36]  C. Kontoravdi,et al.  Cascading effect in bioprocessing—The impact of mild hypothermia on CHO cell behavior and host cell protein composition , 2017, Biotechnology and bioengineering.

[37]  Tao Liu,et al.  Genome-wide identification and differential analysis of translational initiation , 2017, Nature Communications.

[38]  J. Wilusz,et al.  Non-AUG translation: a new start for protein synthesis in eukaryotes , 2017, Genes & development.

[39]  John F. Valliere-Douglass,et al.  ELISA reagent coverage evaluation by affinity purification tandem mass spectrometry , 2017, mAbs.

[40]  Jing Wang,et al.  WebGestalt 2017: a more comprehensive, powerful, flexible and interactive gene set enrichment analysis toolkit , 2017, Nucleic Acids Res..

[41]  Kelvin H. Lee,et al.  Knockout of a difficult‐to‐remove CHO host cell protein, lipoprotein lipase, for improved polysorbate stability in monoclonal antibody formulations , 2017, Biotechnology and bioengineering.

[42]  N. Lewis,et al.  Ribosome profiling-guided depletion of an mRNA increases cell growth rate and protein secretion , 2017, Scientific Reports.

[43]  Jonathan S. Weissman,et al.  Plastid: nucleotide-resolution analysis of next-generation sequencing and genomics data , 2016, BMC Genomics.

[44]  Alexander F Schier,et al.  Conservation of uORF repressiveness and sequence features in mouse, human and zebrafish , 2016, Nature Communications.

[45]  Fidel Ramírez,et al.  deepTools2: a next generation web server for deep-sequencing data analysis , 2016, Nucleic Acids Res..

[46]  Aviv Regev,et al.  A Regression-Based Analysis of Ribosome-Profiling Data Reveals a Conserved Complexity to Mammalian Translation. , 2015, Molecular cell.

[47]  A. Regev,et al.  Many lncRNAs, 5’UTRs, and pseudogenes are translated and some are likely to express functional proteins , 2015, eLife.

[48]  Richard Francis,et al.  The future of host cell protein (HCP) identification during process development and manufacturing linked to a risk‐based management for their control , 2015, Biotechnology and bioengineering.

[49]  R. Maciuca,et al.  Lebrikizumab in moderate-to-severe asthma: pooled data from two randomised placebo-controlled studies , 2015, Thorax.

[50]  Changhan Lee,et al.  The mitochondrial-derived peptide MOTS-c promotes metabolic homeostasis and reduces obesity and insulin resistance. , 2015, Cell metabolism.

[51]  J. Zhu-Shimoni,et al.  Host cell protein testing by ELISAs and the use of orthogonal methods , 2014, Biotechnology and bioengineering.

[52]  C. Smales,et al.  Measurement and control of host cell proteins (HCPs) in CHO cell bioprocesses. , 2014, Current opinion in biotechnology.

[53]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[54]  Ying Chen Eyre-Walker,et al.  Extensive translation of small Open Reading Frames revealed by Poly-Ribo-Seq , 2014, eLife.

[55]  Yuxin Yin,et al.  PTENα, a PTEN isoform translated through alternative initiation, regulates mitochondrial function and energy metabolism. , 2014, Cell metabolism.

[56]  Nikolaus Rajewsky,et al.  Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation , 2014, The EMBO journal.

[57]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[58]  Alan Saghatelian,et al.  A Human Short Open Reading Frame (sORF)-encoded Polypeptide That Stimulates DNA End Joining* , 2014, The Journal of Biological Chemistry.

[59]  Art Hewig,et al.  Comprehensive tracking of host cell proteins during monoclonal antibody purifications using mass spectrometry , 2014, mAbs.

[60]  R. J. Masterton,et al.  The impact of process temperature on mammalian cell lines and the implications for the production of recombinant proteins in CHO cells , 2014 .

[61]  J. P. Ferreira,et al.  Tuning gene expression with synthetic upstream open reading frames , 2013, Proceedings of the National Academy of Sciences.

[62]  Daniel G Bracewell,et al.  Differential Response in Downstream Processing of CHO Cells Grown Under Mild Hypothermic Conditions , 2013, Biotechnology progress.

[63]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[64]  B. Shen,et al.  Global mapping of translation initiation sites in mammalian cells at single-nucleotide resolution , 2012, Proceedings of the National Academy of Sciences.

[65]  C. Clarke,et al.  Utilization and evaluation of CHO‐specific sequence databases for mass spectrometry based proteomics , 2012, Biotechnology and bioengineering.

[66]  Nicholas T. Ingolia,et al.  Ribosome Profiling of Mouse Embryonic Stem Cells Reveals the Complexity and Dynamics of Mammalian Proteomes , 2011, Cell.

[67]  Kelvin H. Lee,et al.  The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line , 2011, Nature Biotechnology.

[68]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[69]  Andrew E. Firth,et al.  Identification of evolutionarily conserved non-AUG-initiated N-terminal extensions in human coding sequences , 2011, Nucleic acids research.

[70]  John Hickey,et al.  Profiling of host cell proteins by two‐dimensional difference gel electrophoresis (2D‐DIGE): Implications for downstream process development , 2010, Biotechnology and bioengineering.

[71]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.