An optimized approach to the rapid assessment and detection of sequence variants in recombinant protein products

AbstractThe development of sensitive techniques to detect sequence variants (SVs), which naturally arise due to DNA mutations and errors in transcription/translation (amino acid misincorporations), has resulted in increased attention to their potential presence in protein-based biologic drugs in recent years. Often, these SVs may be below 0.1 %, adding challenges for consistent and accurate detection. Furthermore, the presence of false-positive (FP) signals, a hallmark of SV analysis, requires time-consuming analyst inspection of the data to sort true from erroneous signal. Consequently, gaps in information about the prevalence, type, and impact of SVs in marketed and in-development products are significant. Here, we report the results of a simple, straightforward, and sensitive approach to sequence variant analysis. This strategy employs mixing of two samples of an antibody or protein with the same amino acid sequence in a dilution series followed by subsequent sequence variant analysis. Using automated peptide map analysis software, a quantitative assessment of the levels of SVs in each sample can be made based on the signal derived from the mass spectrometric data. We used this strategy to rapidly detect differences in sequence variants in a monoclonal antibody after a change in process scale, and in a comparison of three mAbs as part of a biosimilar program. This approach is powerful, as true signals can be readily distinguished from FP signal, even at a level well below 0.1 %, by using a simple linear regression analysis across the data set with none to minimal inspection of the MS/MS data. Additionally, the data produced from these studies can also be used to make a quantitative assessment of relative levels of product quality attributes. The information provided here extends the published knowledge about SVs and provides context for the discussion around the potential impact of these SVs on product heterogeneity and immunogenicity. Graphical AbstractHistogram of 97 sequence variants detected in pilot and final scale mixing study of Antibody A. All Y>F adn Y>H sequence variants had an R2 value above 0.95, while several other sequence variants confirmed in both sample types had lower R2 values. The majority of false positive and unclassified sequence variants had R2 values below 0.3

[1]  A. Balland,et al.  Rapid identification of an antibody DNA construct rearrangement sequence variant by mass spectrometry , 2014, mAbs.

[2]  T. Arora,et al.  Development of a human antibody tolerant mouse model to assess the immunogenicity risk due to aggregated biotherapeutics. , 2013, Journal of pharmaceutical sciences.

[3]  M. Sliwkowski,et al.  Assessing Genetic Heterogeneity in Production Cell Lines: Detection by Peptide Mapping of a Low Level Tyr to Gln Sequence Variant in a Recombinant Antibody , 1993, Nature Biotechnology.

[4]  Zhongqi Zhang,et al.  Large-scale identification and quantification of covalent modifications in therapeutic proteins. , 2009, Analytical chemistry.

[5]  Zhongqi Zhang,et al.  G/U and certain wobble position mismatches as possible main causes of amino acid misincorporations. , 2013, Biochemistry.

[6]  C. Wilke,et al.  The evolutionary consequences of erroneous protein synthesis , 2009, Nature Reviews Genetics.

[7]  Hélène Charreire,et al.  Associations of Supermarket Characteristics with Weight Status and Body Fat: A Multilevel Analysis of Individuals within Supermarkets (RECORD Study) , 2012, PloS one.

[8]  Rafael Ponce,et al.  Immunogenicity of biologically-derived therapeutics: assessment and interpretation of nonclinical safety studies. , 2009, Regulatory toxicology and pharmacology : RTP.

[9]  Reb J. Russell,et al.  Glycine to glutamic acid misincorporation observed in a recombinant protein expressed by Escherichia coli cells , 2012, Protein science : a publication of the Protein Society.

[10]  Paul Chen,et al.  Control of misincorporation of serine for asparagine during antibody production using CHO cells , 2010, Biotechnology and bioengineering.

[11]  Zhongqi Zhang,et al.  Rearrangement of terminal amino acid residues in peptides by protease-catalyzed intramolecular transpeptidation. , 2006, Analytical biochemistry.

[12]  J. Zhu-Shimoni,et al.  Identification of a single base-pair mutation of TAA (Stop codon) → GAA (Glu) that causes light chain extension in a CHO cell derived IgG1 , 2012, mAbs.

[13]  A. Balland,et al.  Characterization of antibody charge heterogeneity resolved by preparative immobilized pH gradients. , 2010, Analytical chemistry.

[14]  Yong J. Kil,et al.  Byonic: Advanced Peptide and Protein Identification Software , 2012, Current protocols in bioinformatics.

[15]  Amy Shen,et al.  Eliminating tyrosine sequence variants in CHO cell lines producing recombinant monoclonal antibodies , 2013, Biotechnology and bioengineering.

[16]  Ingo H Gorr,et al.  Low Level Sequence Variant Analysis of Recombinant Proteins: An Optimized Approach , 2012, PloS one.

[17]  V. Katta,et al.  Detecting low level sequence variants in recombinant monoclonal antibodies , 2010, mAbs.