Synthetic Standards Combined With Error and Bias Correction Improve the Accuracy and Quantitative Resolution of Antibody Repertoire Sequencing in Human Naïve and Memory B Cells

High-throughput sequencing of immunoglobulin (Ig) repertoires (Ig-seq) is a powerful method for quantitatively interrogating B cell receptor sequence diversity. When applied to human repertoires, Ig-seq provides insight into fundamental immunological questions, and can be implemented in diagnostic and drug discovery projects. However, a major challenge in Ig-seq is ensuring accuracy, as library preparation protocols and sequencing platforms can introduce substantial errors and bias that compromise immunological interpretation. Here, we have established an approach for performing highly accurate human Ig-seq by combining synthetic standards with a comprehensive error and bias correction pipeline. First, we designed a set of 85 synthetic antibody heavy-chain standards (in vitro transcribed RNA) to assess correction workflow fidelity. Next, we adapted a library preparation protocol that incorporates unique molecular identifiers (UIDs) for error and bias correction which, when applied to the synthetic standards, resulted in highly accurate data. Finally, we performed Ig-seq on purified human circulating B cell subsets (naïve and memory), combined with a cellular replicate sampling strategy. This strategy enabled robust and reliable estimation of key repertoire features such as clonotype diversity, germline segment, and isotype subclass usage, and somatic hypermutation. We anticipate that our standards and error and bias correction pipeline will become a valuable tool for researchers to validate and improve accuracy in human Ig-seq studies, thus leading to potentially new insights and applications in human antibody repertoire profiling.

[1]  Mikhail Shugay,et al.  Towards error-free profiling of immune repertoires , 2014, Nature Methods.

[2]  M. V. van Zelm,et al.  Expansion of blood IgG4+ B, TH2, and regulatory T cells in patients with IgG4‐related disease , 2017, The Journal of allergy and clinical immunology.

[3]  Seung Hyun Kang,et al.  Monoclonal antibodies isolated without screening by analyzing the variable-gene repertoire of plasma cells , 2010, Nature Biotechnology.

[4]  S. Quake,et al.  The promise and challenge of high-throughput sequencing of the antibody repertoire , 2014, Nature Biotechnology.

[5]  D. Koller,et al.  High-resolution antibody dynamics of vaccine-induced immune responses , 2014, Proceedings of the National Academy of Sciences.

[6]  Robert K. Colwell,et al.  Models and estimators linking individual-based and sample-based rarefaction, extrapolation and comparison of assemblages , 2012 .

[7]  W. Robinson Sequencing the functional antibody repertoire—diagnostic and therapeutic discovery , 2015, Nature Reviews Rheumatology.

[8]  K. Kinzler,et al.  Detection and quantification of rare mutations with massively parallel sequencing , 2011, Proceedings of the National Academy of Sciences.

[9]  A. Casadevall,et al.  The immunoglobulin constant region contributes to affinity and specificity. , 2008, Trends in immunology.

[10]  M. Salit,et al.  Synthetic Spike-in Standards for Rna-seq Experiments Material Supplemental Open Access License Commons Creative , 2022 .

[11]  Hao Wu,et al.  Characterization of T and B cell repertoire diversity in patients with RAG deficiency , 2016, Science Immunology.

[12]  George Georgiou,et al.  In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire , 2014, Nature Medicine.

[13]  Aaron M. Rosenfeld,et al.  An atlas of B-cell clonal distribution in the human body , 2017, Nature Biotechnology.

[14]  Baoshan Zhang,et al.  Mining the antibodyome for HIV-1–neutralizing antibodies with next-generation sequencing and phylogenetic pairing of heavy/light chains , 2013, Proceedings of the National Academy of Sciences.

[15]  Gillian Dekkers,et al.  IgG Subclasses and Allotypes: From Structure to Effector Functions , 2014, Front. Immunol..

[16]  Tony Z. Jia,et al.  Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes , 2012, Proceedings of the National Academy of Sciences.

[17]  M. Sirota,et al.  Immunoglobulin class-switched B cells form an active immune axis between CNS and periphery in multiple sclerosis , 2014, Science Translational Medicine.

[18]  Baoshan Zhang,et al.  Molecular-level analysis of the serum antibody repertoire in young adults before and after seasonal influenza vaccination , 2016, Nature Medicine.

[19]  Patrice Duroux,et al.  IMGT®, the international ImMunoGeneTics information system® 25 years on , 2014, Nucleic Acids Res..

[20]  Jonathan R. McDaniel,et al.  Potent and broad HIV-neutralizing antibodies in memory B cells and plasma , 2017, Science Immunology.

[21]  R. Emerson,et al.  Using synthetic templates to design an unbiased multiplex PCR assay , 2013, Nature Communications.

[22]  David Kipling,et al.  High-throughput immunoglobulin repertoire analysis distinguishes between human IgM memory and switched memory B-cell populations. , 2010, Blood.

[23]  Andrew D. Ellington,et al.  Identification and characterization of the constituent human serum antibodies elicited by vaccination , 2014, Proceedings of the National Academy of Sciences.

[24]  William S. DeWitt,et al.  Replicate immunosequencing as a robust probe of B cell repertoire diversity , 2014, 1410.0350.

[25]  Mark M. Davis,et al.  Lineage Structure of the Human Antibody Repertoire in Response to Influenza Vaccination , 2013, Science Translational Medicine.

[26]  Sai T Reddy,et al.  Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting , 2016, Science Advances.

[27]  C. Vollmers,et al.  Highly Accurate Sequencing of Full-Length Immune Repertoire Amplicons Using Tn5-Enabled and Molecular Identifier–Guided Amplicon Assembly , 2016, The Journal of Immunology.

[28]  Victor Greiff,et al.  Quantitative assessment of the robustness of next-generation sequencing of antibody variable gene repertoires from immunized mice , 2014, BMC Immunology.

[29]  Mark M. Davis,et al.  IgH sequences in common variable immune deficiency reveal altered B cell development and selection , 2015, Science Translational Medicine.

[30]  Mark M. Davis,et al.  Effects of Aging, Cytomegalovirus Infection, and EBV Infection on Human B Cell Repertoires , 2014, The Journal of Immunology.

[31]  C. Nusbaum,et al.  High-Resolution Description of Antibody Heavy-Chain Repertoires in Humans , 2011, PloS one.

[32]  Sai T Reddy,et al.  Advanced Methodologies in High-Throughput Sequencing of Immune Repertoires. , 2017, Trends in biotechnology.

[33]  Daniel G. Brown,et al.  PANDAseq: paired-end assembler for illumina sequences , 2012, BMC Bioinformatics.

[34]  Sai T. Reddy,et al.  Comprehensive Evaluation and Optimization of Amplicon Library Preparation Methods for High-Throughput Antibody Sequencing , 2014, PloS one.

[35]  Evgeny S. Egorov,et al.  High-quality full-length immunoglobulin profiling with unique molecular barcoding , 2016, Nature Protocols.

[36]  Syed Ahmad Chan Bukhari,et al.  Adaptive Immune Receptor Repertoire Community recommendations for sharing immune-repertoire sequencing data , 2017, Nature Immunology.

[37]  Stephen R. Quake,et al.  Genetic measurement of memory B-cell recall using antibody repertoire sequencing , 2013, Proceedings of the National Academy of Sciences.

[38]  S. Linnarsson,et al.  Counting absolute numbers of molecules using unique molecular identifiers , 2011, Nature Methods.

[39]  Chaim A. Schramm,et al.  Developmental pathway for potent V1V2-directed HIV-neutralizing antibodies , 2014, Nature.

[40]  S. Elledge,et al.  Comprehensive serological profiling of human populations using a synthetic human virome , 2015, Science.

[41]  Enkelejda Miho,et al.  Bioinformatic and Statistical Analysis of Adaptive Immune Repertoires. , 2015, Trends in immunology.