Proteomic Identification of Monoclonal Antibodies from Serum

Characterizing the in vivo dynamics of the polyclonal antibody repertoire in serum, such as that which might arise in response to stimulation with an antigen, is difficult due to the presence of many highly similar immunoglobulin proteins, each specified by distinct B lymphocytes. These challenges have precluded the use of conventional mass spectrometry for antibody identification based on peptide mass spectral matches to a genomic reference database. Recently, progress has been made using bottom-up analysis of serum antibodies by nanoflow liquid chromatography/high-resolution tandem mass spectrometry combined with a sample-specific antibody sequence database generated by high-throughput sequencing of individual B cell immunoglobulin variable domains (V genes). Here, we describe how intrinsic features of antibody primary structure, most notably the interspersed segments of variable and conserved amino acid sequences, generate recurring patterns in the corresponding peptide mass spectra of V gene peptides, greatly complicating the assignment of correct sequences to mass spectral data. We show that the standard method of decoy-based error modeling fails to account for the error introduced by these highly similar sequences, leading to a significant underestimation of the false discovery rate. Because of these effects, antibody-derived peptide mass spectra require increased stringency in their interpretation. The use of filters based on the mean precursor ion mass accuracy of peptide-spectrum matches is shown to be particularly effective in distinguishing between “true” and “false” identifications. These findings highlight important caveats associated with the use of standard database search and error-modeling methods with nonstandard data sets and custom sequence databases.

[1]  Olga V. Britanova,et al.  Age-Related Decrease in TCR Repertoire Diversity Measured with Deep and Normalized Sequence Profiling , 2014, The Journal of Immunology.

[2]  Sean A Beausoleil,et al.  A proteomics approach for the identification and cloning of monoclonal antibodies from serum , 2012, Nature Biotechnology.

[3]  Andrew D. Ellington,et al.  Identification and characterization of the constituent human serum antibodies elicited by vaccination , 2014, Proceedings of the National Academy of Sciences.

[4]  D. Tarlinton,et al.  Diversity Among Memory B Cells: Origin, Consequences, and Utility , 2013, Science.

[5]  A. Nesvizhskii A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics. , 2010, Journal of proteomics.

[6]  R. V. van Klaveren,et al.  Sequencing and quantifying IgG fragments and antigen-binding regions by mass spectrometry. , 2010, Journal of proteome research.

[7]  William Stafford Noble,et al.  Semi-supervised learning for peptide identification from shotgun proteomics datasets , 2007, Nature Methods.

[8]  George Georgiou,et al.  High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire , 2013, Nature Biotechnology.

[9]  P. S. Andersen,et al.  Kinetic, Affinity, and Diversity Limits of Human Polyclonal Antibody Responses against Tetanus Toxoid , 2007, The Journal of Immunology.

[10]  Sean A Beausoleil,et al.  Proteomics-directed cloning of circulating antiviral human monoclonal antibodies , 2012, Nature Biotechnology.

[11]  T. Luider,et al.  An antibody-based biomarker discovery method by mass spectrometry sequencing of complementarity determining regions , 2010, Analytical and bioanalytical chemistry.

[12]  Andrew D. Ellington,et al.  Molecular deconvolution of the monoclonal antibodies that comprise the polyclonal serum response , 2013, Proceedings of the National Academy of Sciences.

[13]  Jan Berka,et al.  Precise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire , 2009, Proceedings of the National Academy of Sciences.

[14]  Jinsam You,et al.  A simplified procedure for the reduction and alkylation of cysteine residues in proteins prior to proteolytic digestion and mass spectral analysis. , 2004, Analytical biochemistry.

[15]  P. Pevzner,et al.  Automated de novo protein sequencing of monoclonal antibodies , 2008, Nature Biotechnology.

[16]  Stephen R. Quake,et al.  Genetic measurement of memory B-cell recall using antibody repertoire sequencing , 2013, Proceedings of the National Academy of Sciences.

[17]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[18]  R. White,et al.  High-Throughput Sequencing of the Zebrafish Antibody Repertoire , 2009, Science.

[19]  M. Mann,et al.  Software Lock Mass by Two-Dimensional Minimization of Peptide Mass Errors , 2011, Journal of the American Society for Mass Spectrometry.

[20]  B. Briney,et al.  Secondary mechanisms of diversification in the human antibody repertoire , 2013, Front. Immunol..

[21]  Edward M Marcotte,et al.  How do shotgun proteomics algorithms identify proteins? , 2007, Nature Biotechnology.

[22]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[23]  Andrew R. Jones,et al.  ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination , 2014, Nature Biotechnology.

[24]  Seung Hyun Kang,et al.  Monoclonal antibodies isolated without screening by analyzing the variable-gene repertoire of plasma cells , 2010, Nature Biotechnology.