ASAP - A Webserver for Immunoglobulin-Sequencing Analysis Pipeline

Reproducible and robust data on antibody repertoires are invaluable for basic and applied immunology. Next-generation sequencing (NGS) of antibody variable regions has emerged as a powerful tool in systems immunology, providing quantitative molecular information on antibody polyclonal composition. However, major computational challenges exist when analyzing antibody sequences, from error handling to hypermutation profiles and clonal expansion analyses. In this work, we developed the ASAP (A webserver for Immunoglobulin-Seq Analysis Pipeline) webserver (https://asap.tau.ac.il). The input to ASAP is a paired-end sequence dataset from one or more replicates, with or without unique molecular identifiers. These datasets can be derived from NGS of human or murine antibody variable regions. ASAP first filters and annotates the sequence reads using public or user-provided germline sequence information. The ASAP webserver next performs various calculations, including somatic hypermutation level, CDR3 lengths, V(D)J family assignments, and V(D)J combination distribution. These analyses are repeated for each replicate. ASAP provides additional information by analyzing the commonalities and differences between the repeats (“joint” analysis). For example, ASAP examines the shared variable regions and their frequency in each replicate to determine which sequences are less likely to be a result of a sample preparation derived and/or sequencing errors. Moreover, ASAP clusters the data to clones and reports the identity and prevalence of top ranking clones (clonal expansion analysis). ASAP further provides the distribution of synonymous and non-synonymous mutations within the V genes somatic hypermutations. Finally, ASAP provides means to process the data for proteomic analysis of serum/secreted antibodies by generating a variable region database for liquid chromatography high resolution tandem mass spectrometry (LC-MS/MS) interpretation. ASAP is user-friendly, free, and open to all users, with no login requirement. ASAP is applicable for researchers interested in basic questions related to B cell development and differentiation, as well as applied researchers who are interested in vaccine development and monoclonal antibody engineering. By virtue of its user-friendliness, ASAP opens the antibody analysis field to non-expert users who seek to boost their research with immune repertoire analysis.

[1]  Raghu Machiraju,et al.  annoPeak: a web application to annotate and visualize peaks from ChIP‐seq/ChIP‐exo‐seq , 2017, Bioinform..

[2]  Johannes Trück,et al.  Analysis of B Cell Repertoire Dynamics Following Hepatitis B Vaccination in Humans, and Enrichment of Vaccine-specific Antibody Sequences , 2015, EBioMedicine.

[3]  David Kipling,et al.  High-throughput immunoglobulin repertoire analysis distinguishes between human IgM memory and switched memory B-cell populations. , 2010, Blood.

[4]  Victor Greiff,et al.  Quantitative assessment of the robustness of next-generation sequencing of antibody variable gene repertoires from immunized mice , 2014, BMC Immunology.

[5]  Syed Ahmad Chan Bukhari,et al.  Reproducibility and Reuse of Adaptive Immune Receptor Repertoire Data , 2017, Front. Immunol..

[6]  Cédric R. Weber,et al.  Systems Analysis Reveals High Genetic and Antigen-Driven Predetermination of Antibody Repertoires throughout B Cell Development. , 2017, Cell reports.

[7]  Sean A Beausoleil,et al.  A proteomics approach for the identification and cloning of monoclonal antibodies from serum , 2012, Nature Biotechnology.

[8]  R. White,et al.  High-Throughput Sequencing of the Zebrafish Antibody Repertoire , 2009, Science.

[9]  Y. Wine,et al.  Molecular Landscape of Anti-Drug Antibodies Reveals the Mechanism of the Immune Response Following Treatment With TNFα Antagonists , 2019, bioRxiv.

[10]  J. Hakulinen,et al.  Affinity maturation of a novel antagonistic human monoclonal antibody with a long VH CDR3 targeting the Class A GPCR formyl-peptide receptor 1 , 2014, mAbs.

[11]  David Kipling,et al.  The Relationship between CD27 Negative and Positive B Cell Populations in Human Peripheral Blood , 2011, Front. Immun..

[12]  Stephen R. Quake,et al.  Genetic measurement of memory B-cell recall using antibody repertoire sequencing , 2013, Proceedings of the National Academy of Sciences.

[13]  S. Tonegawa,et al.  Somatic generation of antibody diversity. , 1976, Nature.

[14]  Franca Fraternali,et al.  BRepertoire: a user-friendly web server for analysing antibody repertoire data , 2018, Nucleic Acids Res..

[15]  Gabriel C. Wu,et al.  Temporal stability and molecular persistence of the bone marrow plasma cell antibody repertoire , 2016, Nature Communications.

[16]  A. Heger,et al.  UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy , 2016, bioRxiv.

[17]  Ludvig M. Sollid,et al.  Strong Clonal Relatedness between Serum and Gut IgA despite Different Plasma Cell Origins , 2017, Cell reports.

[18]  Andrew P Stubbs,et al.  Antigen Receptor Galaxy: A User-Friendly, Web-Based Tool for Analysis and Visualization of T and B Cell Receptor Repertoire Data , 2017, The Journal of Immunology.

[19]  O. Britanova,et al.  VDJviz: a versatile browser for immunogenomics data , 2016, BMC Genomics.

[20]  G. Yaari,et al.  Automated analysis of high-throughput B-cell sequencing data reveals a high frequency of novel immunoglobulin V gene segment alleles , 2015, Proceedings of the National Academy of Sciences.

[21]  George Georgiou,et al.  In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire , 2014, Nature Medicine.

[22]  Sai T Reddy,et al.  Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting , 2016, Science Advances.

[23]  Seung Hyun Kang,et al.  Monoclonal antibodies isolated without screening by analyzing the variable-gene repertoire of plasma cells , 2010, Nature Biotechnology.

[24]  Andrew D. Ellington,et al.  Molecular deconvolution of the monoclonal antibodies that comprise the polyclonal serum response , 2013, Proceedings of the National Academy of Sciences.

[25]  Alan Medlar,et al.  Wasabi: An Integrated Platform for Evolutionary Sequence Analysis and Data Visualization. , 2016, Molecular biology and evolution.

[26]  Fortunato Ferrara,et al.  Many Routes to an Antibody Heavy-Chain CDR3: Necessary, Yet Insufficient, for Specific Binding , 2018, Front. Immunol..

[27]  Mathieu Giraud,et al.  Correction: Vidjil: A Web Platform for Analysis of High-Throughput Repertoire Sequencing , 2017, PloS one.

[28]  Mikaël Salson,et al.  Vidjil: A Web Platform for Analysis of High-Throughput Repertoire Sequencing , 2016, PloS one.

[29]  Andrew D. Ellington,et al.  Identification and characterization of the constituent human serum antibodies elicited by vaccination , 2014, Proceedings of the National Academy of Sciences.

[30]  Hui Yang,et al.  Immune Repertoire Sequencing Using Molecular Identifiers Enables Accurate Clonality Discovery and Clone Size Quantification , 2018, Front. Immunol..

[31]  A. Velázquez‐Campoy,et al.  A Functional Analysis on the Interspecies Interaction between Mouse LFA-1 and Human Intercellular Adhesion Molecule-1 at the Cell Level , 2017, Front. Immunol..

[32]  Cédric R. Weber,et al.  Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires , 2017, Front. Immunol..

[33]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[34]  G. Yaari,et al.  Practical guidelines for B-cell receptor repertoire sequencing analysis , 2015, Genome Medicine.

[35]  Enkelejda Miho,et al.  Bioinformatic and Statistical Analysis of Adaptive Immune Repertoires. , 2015, Trends in immunology.

[36]  Mikhail Shugay,et al.  MiXCR: software for comprehensive adaptive immunity profiling , 2015, Nature Methods.

[37]  F. Fraternali,et al.  Transitional B Cells in Early Human B Cell Development – Time to Revisit the Paradigm? , 2016, Front. Immunol..

[38]  George Georgiou,et al.  High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire , 2013, Nature Biotechnology.

[39]  Edward M. Marcotte,et al.  Proteomic Identification of Monoclonal Antibodies from Serum , 2014, Analytical chemistry.

[40]  Gary E. Swan,et al.  B-cell repertoire responses to varicella-zoster vaccination in human identical twins , 2014, Proceedings of the National Academy of Sciences.

[41]  Theo M. Luider,et al.  Immune Repertoire after Immunization As Seen by Next-Generation Sequencing and Proteomics , 2017, Front. Immunol..

[42]  Marie-Paule Lefranc,et al.  IMGT, the international ImMunoGeneTics database. , 1997, Nucleic acids research.

[43]  David A. Hafler,et al.  pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires , 2014, Bioinform..

[44]  John M. Fonner,et al.  VDJServer: A Cloud-Based Analysis Portal and Data Commons for Immune Repertoire Sequences and Rearrangements , 2018, Front. Immunol..

[45]  G. B. Karlsson Hedestam,et al.  Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity , 2016, Nature Communications.

[46]  Jana Langenhan,et al.  Proteomic Analysis of Pemphigus Autoantibodies Indicates a Larger, More Diverse, and More Dynamic Repertoire than Determined by B Cell Genetics. , 2017, Cell reports.

[47]  Syed Ahmad Chan Bukhari,et al.  Adaptive Immune Receptor Repertoire Community recommendations for sharing immune-repertoire sequencing data , 2017, Nature Immunology.

[48]  Baoshan Zhang,et al.  Molecular-level analysis of the serum antibody repertoire in young adults before and after seasonal influenza vaccination , 2016, Nature Medicine.