Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting

A new experimental-bioinformatic method was developed for error and bias correction in high-throughput antibody sequencing. High-throughput antibody repertoire sequencing (Ig-seq) provides quantitative molecular information on humoral immunity. However, Ig-seq is compromised by biases and errors introduced during library preparation and sequencing. By using synthetic antibody spike-in genes, we determined that primer bias from multiplex polymerase chain reaction (PCR) library preparation resulted in antibody frequencies with only 42 to 62% accuracy. Additionally, Ig-seq errors resulted in antibody diversity measurements being overestimated by up to 5000-fold. To rectify this, we developed molecular amplification fingerprinting (MAF), which uses unique molecular identifier (UID) tagging before and during multiplex PCR amplification, which enabled tagging of transcripts while accounting for PCR efficiency. Combined with a bioinformatic pipeline, MAF bias correction led to measurements of antibody frequencies with up to 99% accuracy. We also used MAF to correct PCR and sequencing errors, resulting in enhanced accuracy of full-length antibody diversity measurements, achieving 98 to 100% error correction. Using murine MAF-corrected data, we established a quantitative metric of recent clonal expansion—the intraclonal diversity index—which measures the number of unique transcripts associated with an antibody clone. We used this intraclonal diversity index along with antibody frequencies and somatic hypermutation to build a logistic regression model for prediction of the immunological status of clones. The model was able to predict clonal status with high confidence but only when using MAF error and bias corrected Ig-seq data. Improved accuracy by MAF provides the potential to greatly advance Ig-seq and its utility in immunology and biotechnology.

[1]  Chaim A. Schramm,et al.  Co-evolution of a broadly neutralizing HIV-1 antibody and founder virus , 2013, Nature.

[2]  Enkelejda Miho,et al.  Bioinformatic and Statistical Analysis of Adaptive Immune Repertoires. , 2015, Trends in immunology.

[3]  Mohamed Uduman,et al.  Integrating B Cell Lineage Information into Statistical Tests for Detecting Selection in Ig Sequences , 2014, The Journal of Immunology.

[4]  John Shawe-Taylor,et al.  Computational analysis of stochastic heterogeneity in PCR amplification efficiency revealed by single molecule barcoding , 2015, Scientific Reports.

[5]  Claus V. Hallwirth,et al.  Impact of next-generation sequencing error on analysis of barcoded plasmid libraries of known complexity and sequence , 2014, Nucleic acids research.

[6]  Andrew D. Ellington,et al.  Corrigendum: Facile Discovery of a Diverse Panel of Anti-Ebola Virus Antibodies by Immune Repertoire Mining , 2016, Scientific Reports.

[7]  Daphne Koller,et al.  The Effects of Somatic Hypermutation on Neutralization and Binding in the PGT121 Family of Broadly Neutralizing HIV Antibodies , 2013, PLoS pathogens.

[8]  T. Kepler,et al.  Analysis of immunoglobulin transcripts and hypermutation following SHIVAD8 infection and protein-plus-adjuvant immunization , 2015, Nature Communications.

[9]  Bhuvan Unhelkar,et al.  Strategies and Applications , 2011 .

[10]  H. Kettenberger,et al.  Developability assessment during the selection of novel therapeutic antibodies. , 2015, Journal of pharmaceutical sciences.

[11]  Anne E. Magurran,et al.  Biological Diversity: Frontiers in Measurement and Assessment , 2011 .

[12]  Robert A Holt,et al.  Sequence analysis of T-cell repertoires in health and disease , 2013, Genome Medicine.

[13]  Dennis R. Burton,et al.  Toward a more accurate view of human B-cell repertoire by next-generation sequencing, unbiased repertoire capture and single-molecule barcoding , 2014, Scientific Reports.

[14]  Mark M. Davis,et al.  Lineage Structure of the Human Antibody Repertoire in Response to Influenza Vaccination , 2013, Science Translational Medicine.

[15]  J. Calis,et al.  Characterizing immune repertoires by high throughput sequencing: strategies and applications. , 2014, Trends in immunology.

[16]  S. Linnarsson,et al.  Counting absolute numbers of molecules using unique molecular identifiers , 2011, Nature Methods.

[17]  Stephen R. Quake,et al.  Genetic measurement of memory B-cell recall using antibody repertoire sequencing , 2013, Proceedings of the National Academy of Sciences.

[18]  A. Plückthun,et al.  Reliable cloning of functional antibody variable domains from hybridomas and spleen cell repertoires employing a reengineered phage display system. , 1997, Journal of immunological methods.

[19]  L. Penland,et al.  Determinism and stochasticity during maturation of the zebrafish antibody repertoire , 2011, Proceedings of the National Academy of Sciences.

[20]  George Georgiou,et al.  In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire , 2014, Nature Medicine.

[21]  R. Neher,et al.  Challenges with Using Primer IDs to Improve Accuracy of Next Generation Sequencing , 2015, PloS one.

[22]  Andrew D. Ellington,et al.  Identification and characterization of the constituent human serum antibodies elicited by vaccination , 2014, Proceedings of the National Academy of Sciences.

[23]  Mikhail Shugay,et al.  Towards error-free profiling of immune repertoires , 2014, Nature Methods.

[24]  Jan Berka,et al.  Precise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire , 2009, Proceedings of the National Academy of Sciences.

[25]  Tony Z. Jia,et al.  Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes , 2012, Proceedings of the National Academy of Sciences.

[26]  Irina Czogiel,et al.  Single‐cell based high‐throughput sequencing of full‐length immunoglobulin heavy and light chain genes , 2014, European journal of immunology.

[27]  S. Quake,et al.  The promise and challenge of high-throughput sequencing of the antibody repertoire , 2014, Nature Biotechnology.

[28]  Mark M. Davis,et al.  Human responses to influenza vaccination show seroconversion signatures and convergent antibody rearrangements. , 2014, Cell host & microbe.

[29]  Piero Carninci,et al.  Suppression of artifacts and barcode bias in high-throughput transcriptome analyses utilizing template switching , 2012, Nucleic acids research.

[30]  R. Emerson,et al.  High-throughput pairing of T cell receptor α and β sequences , 2015, Science Translational Medicine.

[31]  Baoshan Zhang,et al.  Mining the antibodyome for HIV-1–neutralizing antibodies with next-generation sequencing and phylogenetic pairing of heavy/light chains , 2013, Proceedings of the National Academy of Sciences.

[32]  George Georgiou,et al.  High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire , 2013, Nature Biotechnology.

[33]  Mikhail Shugay,et al.  Quantitative Profiling of Immune Repertoires for Minor Lymphocyte Counts Using Unique Molecular Identifiers , 2015, The Journal of Immunology.

[34]  Mikhail Shugay,et al.  Pairing of T‐cell receptor chains via emulsion PCR , 2013, European journal of immunology.

[35]  David Fenyö,et al.  A robust pipeline for rapid production of versatile nanobody repertoires , 2014, Nature Methods.

[36]  R. Emerson,et al.  Using synthetic templates to design an unbiased multiplex PCR assay , 2013, Nature Communications.

[37]  M. Salit,et al.  Synthetic Spike-in Standards for Rna-seq Experiments Material Supplemental Open Access License Commons Creative , 2022 .

[38]  Andrew D. Ellington,et al.  Molecular deconvolution of the monoclonal antibodies that comprise the polyclonal serum response , 2013, Proceedings of the National Academy of Sciences.

[39]  Joseph Kaplinsky,et al.  Antibody repertoire deep sequencing reveals antigen-independent selection in maturing B cells , 2014, Proceedings of the National Academy of Sciences.

[40]  V. Greiff,et al.  A bioinformatic framework for immune repertoire diversity profiling enables detection of immunological status , 2015, Genome Medicine.

[41]  Scott D Boyd,et al.  Convergent antibody signatures in human dengue. , 2013, Cell host & microbe.

[42]  Jérôme Lane,et al.  IMGT®, the international ImMunoGeneTics information system® , 2004, Nucleic Acids Res..

[43]  Kevin M. Clarke,et al.  Estimating Species Richness , 2005 .

[44]  J. Galson,et al.  Studying the antibody repertoire after vaccination: practical applications. , 2014, Trends in immunology.

[45]  Seung Hyun Kang,et al.  Monoclonal antibodies isolated without screening by analyzing the variable-gene repertoire of plasma cells , 2010, Nature Biotechnology.

[46]  W. M. Collinson Practical Applications , 2021, Royal Society of Health journal.

[47]  R. Veitia,et al.  Reverse transcriptase template switching and false alternative transcripts. , 2006, Genomics.

[48]  S. P. Fodor,et al.  Digital Encoding of Cellular mRNAs Enabling Precise and Absolute Gene Expression Measurement by Single-Molecule Counting , 2014, Analytical chemistry.

[49]  S. Reddy,et al.  Deep sequencing in library selection projects: what insight does it bring? , 2015, Current opinion in structural biology.

[50]  K. Kinzler,et al.  Detection and quantification of rare mutations with massively parallel sequencing , 2011, Proceedings of the National Academy of Sciences.

[51]  M. Egholm,et al.  Measurement and Clinical Monitoring of Human Lymphocyte Clonality by Massively Parallel V-D-J Pyrosequencing , 2009, Science Translational Medicine.

[52]  Sean A Beausoleil,et al.  A proteomics approach for the identification and cloning of monoclonal antibodies from serum , 2012, Nature Biotechnology.

[53]  T. Panavas,et al.  IgG variable region and VH CDR3 diversity in unimmunized mice analyzed by massively parallel sequencing. , 2014, Molecular immunology.

[54]  Tongqing Zhou,et al.  De novo identification of VRC01 class HIV-1–neutralizing antibodies by next-generation sequencing of B-cell transcripts , 2013, Proceedings of the National Academy of Sciences.

[55]  D. Price,et al.  Wrestling with the repertoire: The promise and perils of next generation sequencing for antigen receptors , 2012, European journal of immunology.

[56]  R. White,et al.  High-Throughput Sequencing of the Zebrafish Antibody Repertoire , 2009, Science.

[57]  D. Koller,et al.  High-resolution antibody dynamics of vaccine-induced immune responses , 2014, Proceedings of the National Academy of Sciences.