A comprehensive CHO SWATH-MS spectral library for robust quantitative profiling of 10,000 proteins

Sequential window acquisition of all theoretical fragment-ion spectra (SWATH) is a data-independent acquisition (DIA) strategy that requires a specific spectral library to generate unbiased and consistent quantitative data matrices of all peptides. SWATH-MS is a promising approach for in-depth proteomic profiling of Chinese hamster Ovary (CHO) cell lines, improving mechanistic understanding of process optimization, and real-time monitoring of process parameters in biologics R&D and manufacturing. However, no spectral library for CHO cells is publicly available. Here we present a comprehensive CHO global spectral library to measure the abundance of more than 10,000 proteins consisting of 199,102 identified peptides from a CHO-K1 cell proteome. The robustness, accuracy and consistency of the spectral library were validated for high confidence in protein identification and reproducible quantification in different CHO-derived cell lines, instrumental setups and downstream processing samples. The availability of a comprehensive SWATH CHO global spectral library will facilitate detailed characterization of upstream and downstream processes, as well as quality by design (QbD) in biomanufacturing. The data have been deposited to ProteomeXchange (PXD016047). Measurement(s) Proteomic Profile • database type spectral library Technology Type(s) mass spectrometry • SWATH MS protein profiling assay • computational modeling technique Sample Characteristic - Organism Cricetulus griseus Measurement(s) Proteomic Profile • database type spectral library Technology Type(s) mass spectrometry • SWATH MS protein profiling assay • computational modeling technique Sample Characteristic - Organism Cricetulus griseus Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.12480386

[1]  Martin Eisenacher,et al.  The PRIDE database and related tools and resources in 2019: improving support for quantification data , 2018, Nucleic Acids Res..

[2]  Kai Stühler,et al.  Retention time alignment algorithms for LC/MS data must consider non-linear shifts , 2009, Bioinform..

[3]  Yoon Pin Lim,et al.  Identification of autocrine growth factors secreted by CHO cells for applications in single-cell cloning media. , 2013, Journal of proteome research.

[4]  Lisa Urquhart,et al.  Top drugs and companies by sales in 2018 , 2019, Nature Reviews Drug Discovery.

[5]  Nathan E Lewis,et al.  Elucidation of the CHO Super-Ome (CHO-SO) by Proteoinformatics. , 2015, Journal of proteome research.

[6]  Robert C. Wolpert,et al.  A Review of the , 1985 .

[7]  Lisa Urquhart,et al.  Market watch: Top drugs and companies by sales in 2017 , 2018, Nature Reviews Drug Discovery.

[8]  William Stafford Noble,et al.  Technical advances in proteomics: new developments in data-independent acquisition , 2016, F1000Research.

[9]  Ann L. Oberg,et al.  Statistical methods for quantitative mass spectrometry proteomic experiments with labeling , 2012, BMC Bioinformatics.

[10]  Abraham M Lenhoff,et al.  Identification and characterization of host cell protein product‐associated impurities in monoclonal antibody bioprocessing , 2014, Biotechnology and bioengineering.

[11]  Ian R. Lewis,et al.  Raman spectroscopy as a process analytical technology for pharmaceutical manufacturing and bioprocessing , 2016, Analytical and Bioanalytical Chemistry.

[12]  Emilio Medina-Rivero,et al.  Optimization of a recombinant human growth hormone purification process using quality by design , 2016, Preparative biochemistry & biotechnology.

[13]  Thomas Waerner,et al.  Experience with host cell protein impurities in biopharmaceuticals , 2018, Biotechnology progress.

[14]  Christie Hunter,et al.  Simplifying the Use of Ion Libraries During Data Processing of SWATH ® Acquisition Proteomics Data Ion Library Merging with Auto Retention Time Calibration in OneOmics TM Project , 2018 .

[15]  Donald E Walker,et al.  A modular and adaptive mass spectrometry-based platform for support of bioprocess development toward optimal host cell protein clearance , 2017, mAbs.

[16]  B. Manadas,et al.  A reference library of peripheral blood mononuclear cells for SWATH‐MS analysis , 2016, Proteomics. Clinical applications.

[17]  Michael C. Borys,et al.  The use of ‘Omics technology to rationally improve industrial mammalian cell line performance , 2016, Biotechnology and bioengineering.

[18]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[19]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[20]  Dana Pascovici,et al.  SWATH Mass Spectrometry Performance Using Extended Peptide MS/MS Assay Libraries* , 2016, Molecular & Cellular Proteomics.

[21]  M. Gorenstein,et al.  The detection, correlation, and comparison of peptide precursor and product ions from data independent LC‐MS with data dependant LC‐MS/MS , 2009, Proteomics.

[22]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[23]  Alexandre Zougman,et al.  Suspension trapping (STrap) sample preparation method for bottom‐up proteomics analysis , 2014, Proteomics.

[24]  Christine Carapito,et al.  Dual Data-Independent Acquisition Approach Combining Global HCP Profiling and Absolute Quantification of Key Impurities during Bioprocess Development. , 2018, Analytical chemistry.

[25]  Eric W. Deutsch,et al.  A repository of assays to quantify 10,000 human proteins by SWATH-MS , 2014, Scientific Data.

[26]  Kai Pong Law,et al.  Recent advances in mass spectrometry: data independent analysis and hyper reaction monitoring , 2013, Expert review of proteomics.

[27]  Michael J. Betenbaugh,et al.  Lessons from the Hamster: Cricetulus griseus Tissue and CHO Cell Line Proteome Comparison. , 2017, Journal of proteome research.

[28]  Nathan E Lewis,et al.  The emerging CHO systems biology era: harnessing the 'omics revolution for biotechnology. , 2013, Current opinion in biotechnology.

[29]  Abraham M. Lenhoff,et al.  Expression of difficult‐to‐remove host cell protein impurities during extended Chinese hamster ovary cell culture and their impact on continuous bioprocessing , 2015, Biotechnology and bioengineering.

[30]  Peter Willett,et al.  What is a tutorial , 2013 .

[31]  Lufang Zhou,et al.  Comparative Proteomic Analysis of Three Chinese Hamster Ovary (CHO) Host Cells. , 2017, Biochemical engineering journal.

[32]  Abraham M Lenhoff,et al.  Host cell protein impurities in chromatographic polishing steps for monoclonal antibody purification , 2016, Biotechnology and bioengineering.

[33]  Marion Kee,et al.  Analysis , 2004, Machine Translation.

[34]  Ian W Marison,et al.  Application of multi-omics techniques for bioprocess design and optimization in chinese hamster ovary cells. , 2014, Journal of proteome research.

[35]  Jaykaran Charan,et al.  Monoclonal Antibodies: A Review. , 2017, Current clinical pharmacology.

[36]  J. Bones,et al.  Identification and tracking of problematic host cell proteins removed by a synthetic, highly functionalized nonwoven media in downstream bioprocessing of monoclonal antibodies. , 2019, Journal of chromatography. A.

[37]  Steven P Gygi,et al.  Quantitative mass spectrometry-based multiplexing compares the abundance of 5000 S. cerevisiae proteins across 10 carbon sources. , 2016, Journal of proteomics.

[38]  Nathan E. Lewis,et al.  Multiplex secretome engineering enhances recombinant protein production and purity , 2019, bioRxiv.

[39]  Ruedi Aebersold,et al.  Descriptor : Generation of a zebra fi sh SWATH-MS spectral library to quantify 10 , 000 proteins , 2019 .

[40]  Ludovic C. Gillet,et al.  Data‐independent acquisition‐based SWATH‐MS for quantitative proteomics: a tutorial , 2018, Molecular systems biology.

[41]  Juan Antonio Vizcaíno,et al.  Using the PRIDE Database and ProteomeXchange for Submitting and Accessing Public Proteomics Datasets , 2017, Current protocols in bioinformatics.

[42]  Mark Brower,et al.  On-Line Ion Exchange Liquid Chromatography as a Process Analytical Technology for Monoclonal Antibody Characterization in Continuous Bioprocessing. , 2017, Analytical chemistry.

[43]  Anurag S Rathore,et al.  Optimization of a refolding step for a therapeutic fusion protein in the quality by design (QbD) paradigm. , 2012, Journal of separation science.

[44]  I. Xenarios,et al.  UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View. , 2016, Methods in molecular biology.

[45]  Amy P Abernethy,et al.  A comparison of FDA and EMA drug approval: implications for drug development and cost of care. , 2013, Oncology.

[46]  Tudor I. Oprea,et al.  A comprehensive map of molecular drug targets , 2016, Nature Reviews Drug Discovery.