Structural diversity of B-cell receptor repertoires along the B-cell differentiation axis in humans and mice

Most current analysis tools for antibody next-generation sequencing data work with primary sequence descriptors, leaving accompanying structural information unharnessed. We have used novel rapid methods to structurally characterize the paratopes of more than 180 million human and mouse B-cell receptor (BCR) repertoire sequences. These structurally annotated paratopes provide unprecedented insights into both the structural predetermination and dynamics of the adaptive immune response. We show that B-cell types can be distinguished based solely on these structural properties. Antigen-unexperienced BCR repertoires use the highest number and diversity of paratope structures and these patterns of naïve repertoire paratope usage are highly conserved across subjects. In contrast, more differentiated B-cells are more personalized in terms of paratope structure usage. Our results establish the paratope structure differences in BCR repertoires and have applications for many fields including immunodiagnostics, phage display library generation, and “humanness” assessment of BCR repertoires from transgenic animals.

[1]  Jeffrey J. Gray,et al.  Large-scale sequence and structural comparisons of human naive and antigen-experienced antibody repertoires , 2016, Proceedings of the National Academy of Sciences.

[2]  V. Greiff,et al.  A bioinformatic framework for immune repertoire diversity profiling enables detection of immunological status , 2015, Genome Medicine.

[3]  S. Quake,et al.  The promise and challenge of high-throughput sequencing of the antibody repertoire , 2014, Nature Biotechnology.

[4]  S. Tonegawa,et al.  Somatic generation of antibody diversity. , 1976, Nature.

[5]  J. Engler,et al.  Expressed murine and human CDR-H3 intervals of equal length exhibit distinct repertoires that differ in their amino acid composition and predicted range of structures. , 2003, Journal of molecular biology.

[6]  Yoonjoo Choi,et al.  FREAD revisited: Accurate loop structure prediction using a database search algorithm , 2010, Proteins.

[7]  Sebastian Kelm,et al.  SCALOP: sequence-based antibody canonical loop structure annotation , 2018, Bioinform..

[8]  C. Deane,et al.  Observed Antibody Space: A Resource for Data Mining Next-Generation Sequencing of Antibody Repertoires , 2018, The Journal of Immunology.

[9]  Roland L. Dunbrack,et al.  A new clustering of antibody CDR loop conformations. , 2011, Journal of molecular biology.

[10]  Paul G. Thomas,et al.  Defining antigen-specific plasmablast and memory B cell subsets in human blood after viral infection or vaccination , 2022 .

[11]  Anneliese O. Speak,et al.  Complete humanization of the mouse immunoglobulin loci enables efficient therapeutic antibody discovery , 2014, Nature Biotechnology.

[12]  D. Burton,et al.  Commonality despite exceptional diversity in the baseline human antibody repertoire , 2018, Nature.

[13]  Jiye Shi,et al.  The H3 loop of antibodies shows unique structural characteristics , 2017, Proteins.

[14]  Cédric R. Weber,et al.  Learning the High-Dimensional Immunogenomic Features That Predict Public and Private Antibody Repertoires , 2017, The Journal of Immunology.

[15]  Cédric R. Weber,et al.  Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires , 2017, Front. Immunol..

[16]  Johannes Trück,et al.  BCR repertoire sequencing: different patterns of B cell activation after two Meningococcal vaccines , 2015, Immunology and cell biology.

[17]  Brian D. Weitzner,et al.  The origin of CDR H3 structural diversity. , 2015, Structure.

[18]  C. E. Schrader,et al.  Mechanism and regulation of class switch recombination. , 2008, Annual review of immunology.

[19]  V. Giudicelli,et al.  IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. , 2003, Developmental and comparative immunology.

[20]  Thomas B Kepler,et al.  B-cell–lineage immunogen design in vaccine development with HIV-1 as a case study , 2012, Nature Biotechnology.

[21]  Steven H. Kleinstein,et al.  Models of Somatic Hypermutation Targeting and Substitution Based on Synonymous Mutations from High-Throughput Immunoglobulin Sequencing Data , 2013, Front. Immunol..

[22]  C. Deane,et al.  Length-independent structural similarities enrich the antibody CDR canonical class model , 2016, mAbs.

[23]  C. Deane,et al.  CODA: A combined algorithm for predicting the structurally variable regions of protein models , 2001, Protein science : a publication of the Protein Society.

[24]  A. Lesk,et al.  Standard conformations for the canonical structures of immunoglobulins. , 1997, Journal of molecular biology.

[25]  Haruki Nakamura,et al.  The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data , 2006, Nucleic Acids Res..

[26]  Peter D. Kwong,et al.  Gene-Specific Substitution Profiles Describe the Types and Frequencies of Amino Acid Changes during Antibody Somatic Hypermutation , 2017, Front. Immunol..

[27]  Juan C Almagro,et al.  Second antibody modeling assessment (AMA‐II) , 2014, Proteins.

[28]  Brian D. Weitzner,et al.  The Origin of CDR H 3 Structural Diversity , 2022 .

[29]  C. Deane,et al.  How B-Cell Receptor Repertoire Sequencing Can Be Enriched with Structural Antibody Data , 2017, Front. Immunol..

[30]  Cédric R. Weber,et al.  Systems Analysis Reveals High Genetic and Antigen-Driven Predetermination of Antibody Repertoires throughout B Cell Development. , 2017, Cell reports.

[31]  C. Deane,et al.  Antibody–antigen complex modelling in the era of immunoglobulin repertoire sequencing , 2019, Molecular Systems Design & Engineering.

[32]  C. Deane,et al.  Antibody H3 Structure Prediction , 2017, Computational and structural biotechnology journal.

[33]  Johannes Trück,et al.  B-cell repertoire dynamics after sequential hepatitis B vaccination and evidence for cross-reactive B-cell activation , 2016, Genome Medicine.

[34]  K. Roskin,et al.  Defining antigen-specific plasmablast and memory B cell subsets in blood following viral infection and vaccination of humans , 2016, Nature Immunology.

[35]  Sebastian Kelm,et al.  Environment specific substitution tables improve membrane protein alignment , 2011, Bioinform..

[36]  J. Sidney,et al.  Longitudinal Analysis of the Human B Cell Response to Ebola Virus Infection , 2019, Cell.

[37]  X. He,et al.  Comparative analysis of human and mouse immunoglobulin variable heavy regions from IMGT/LIGM-DB with IMGT/HighV-QUEST , 2014, Theoretical Biology and Medical Modelling.

[38]  Paolo Marcatili,et al.  Prediction of site-specific interactions in antibody-antigen complexes: the proABC method and server , 2013, Bioinform..

[39]  Jiye Shi,et al.  Five computational developability guidelines for therapeutic antibody profiling , 2019, Proceedings of the National Academy of Sciences.

[40]  George Georgiou,et al.  In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire , 2014, Nature Medicine.

[41]  C. Deane,et al.  Structurally Mapping Antibody Repertoires , 2018, Front. Immunol..

[42]  Jiye Shi,et al.  SAbDab: the structural antibody database , 2013, Nucleic Acids Res..

[43]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[44]  Mathieu Rouard,et al.  IMGT unique numbering for immunoglobulin and T cell receptor constant domains and Ig superfamily C-like domains. , 2005, Developmental and comparative immunology.