Scalable Models of Antibody Evolution and Benchmarking of Clonal Tree Reconstruction Methods

Affinity maturation (AM) of antibodies through somatic hypermutations (SHMs) enables the immune system to evolve to recognize diverse pathogens. The accumulation of SHMs leads to the formation of clonal trees of antibodies produced by B cells that have evolved from a common naive B cell. Recent advances in high-throughput sequencing have enabled deep scans of antibody repertoires, paving the way for reconstructing clonal trees. However, it is not clear if clonal trees, which capture micro-evolutionary time scales, can be reconstructed using traditional phylogenetic reconstruction methods with adequate accuracy. In fact, several clonal tree reconstruction methods have been developed to fix supposed shortcomings of phylogenetic methods. Nevertheless, no consensus has been reached regarding the relative accuracy of these methods, partially because evaluation is challenging. Benchmarking the performance of existing methods and developing better methods would both benefit from realistic models of clonal tree evolution specifically designed for emulating B cell evolution. In this paper, we propose a model for modeling B cell clonal tree evolution and use this model to benchmark several existing clonal tree reconstruction methods. Our model, designed to be extensible, has several features: by evolving the clonal tree and sequences simultaneously, it allows modelling selective pressure due to changes in affinity binding; it enables scalable simulations of millions of cells; it enables several rounds of infection by an evolving pathogen; and, it models building of memory. In addition, we also suggest a set of metrics for comparing clonal trees and for measuring their properties. Our benchmarking results show that while maximum likelihood phylogenetic reconstruction methods can fail to capture key features of clonal tree expansion if applied naively, a very simple postprocessing of their results, where super short branches are contracted, leads to inferences that are better than alternative methods.

[1]  Yana Safonova,et al.  IgEvolution: clonal analysis of antibody repertoires , 2019, bioRxiv.

[2]  T. Honjo,et al.  Class Switch Recombination and Hypermutation Require Activation-Induced Cytidine Deaminase (AID), a Potential RNA Editing Enzyme , 2000, Cell.

[3]  Yana Safonova,et al.  Reconstructing Antibody Repertoires from Error-Prone Immunosequencing Reads , 2017, The Journal of Immunology.

[4]  Stephen R. Quake,et al.  Signatures of selection in the human antibody repertoire: Selective sweeps, competing subclones, and neutral drift , 2017, Proceedings of the National Academy of Sciences.

[5]  A. Rodrigo,et al.  Likelihood-based tests of topologies in phylogenetics. , 2000, Systematic biology.

[6]  Mikhail Shugay,et al.  MiXCR: software for comprehensive adaptive immunity profiling , 2015, Nature Methods.

[7]  Niema Moshiri,et al.  FAVITES: simultaneous simulation of transmission networks, phylogenetic trees and sequences , 2019, Bioinform..

[8]  Tanja Stadler,et al.  Tracing Antibody Repertoire Evolution by Systems Phylogeny , 2018, Front. Immunol..

[9]  T. Tatusova,et al.  The Influenza Virus Resource at the National Center for Biotechnology Information , 2007, Journal of Virology.

[10]  P. Wilson,et al.  Restricted, canonical, stereotyped and convergent immunoglobulin responses , 2015, Philosophical Transactions of the Royal Society B: Biological Sciences.

[11]  David A. Hafler,et al.  pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires , 2014, Bioinform..

[12]  Myron F. Goodman,et al.  Biochemical Analysis of Hypermutational Targeting by Wild Type and Mutant Activation-induced Cytidine Deaminase* , 2004, Journal of Biological Chemistry.

[13]  Sean Nee,et al.  Birth-Death Models in Macroevolution , 2006 .

[14]  Daphne Koller,et al.  The Effects of Somatic Hypermutation on Neutralization and Binding in the PGT121 Family of Broadly Neutralizing HIV Antibodies , 2013, PLoS pathogens.

[15]  S. Quake,et al.  The promise and challenge of high-throughput sequencing of the antibody repertoire , 2014, Nature Biotechnology.

[16]  Trevor Bedford,et al.  Quantifying evolutionary constraints on B-cell affinity maturation , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[17]  Alla Lapidus,et al.  IgRepertoireConstructor: a novel algorithm for antibody repertoire construction and immunoproteogenomics analysis , 2015, Bioinform..

[18]  Marc Pybus,et al.  Human Secretory IgM Emerges from Plasma Cells Clonally Related to Gut Memory B Cells and Targets Highly Diverse Commensals , 2017, Immunity.

[19]  Jacob Glanville,et al.  The Individual and Population Genetics of Antibody Immunity , 2017, Trends in Immunology.

[20]  Kevin de Queiroz,et al.  Phylogenetic Relationships and Tempo of Early Diversification in Anolis Lizards , 1999 .

[21]  N A Kolchanov,et al.  Somatic hypermutagenesis in immunoglobulin genes. II. Influence of neighbouring base sequences on mutagenesis. , 1992, Biochimica et biophysica acta.

[22]  L. Wysocki,et al.  Di- and trinucleotide target preferences of somatic mutagenesis in normal and autoreactive B cells. , 1996, Journal of immunology.

[23]  Gur Yaari,et al.  Antibody Repertoire Analysis of Hepatitis C Virus Infections Identifies Immune Signatures Associated With Spontaneous Clearance , 2018, Front. Immunol..

[24]  I. Rogozin,et al.  Cutting Edge: DGYW/WRCH Is a Better Predictor of Mutability at G:C Bases in Ig Hypermutation Than the Widely Accepted RGYW/WRCY Motif and Probably Reflects a Two-Step Activation-Induced Cytidine Deaminase-Triggered Process , 2004, The Journal of Immunology.

[25]  L. Wysocki,et al.  Sequence-specific targeting of two bases on both DNA strands by the somatic hypermutation mechanism. , 2003, Molecular immunology.

[26]  Alexander Yermanos,et al.  immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking , 2020, Bioinformatics.

[27]  M. Goodman,et al.  Processive AID-catalysed cytosine deamination on single-stranded DNA simulates somatic hypermutation , 2003, Nature.

[28]  D. Koller,et al.  High-resolution antibody dynamics of vaccine-induced immune responses , 2014, Proceedings of the National Academy of Sciences.

[29]  K. Roskin,et al.  Defining antigen-specific plasmablast and memory B cell subsets in blood following viral infection and vaccination of humans , 2016, Nature Immunology.

[30]  W. Robinson,et al.  Affinity Maturation Drives Epitope Spreading and Generation of Proinflammatory Anti–Citrullinated Protein Antibodies in Rheumatoid Arthritis , 2018, Arthritis & rheumatology.

[31]  Kristian Davidsen,et al.  Benchmarking Tree and Ancestral Sequence Inference for B Cell Receptor Sequences , 2018, bioRxiv.

[32]  Polina Reshetova,et al.  Computational Model Reveals Limited Correlation between Germinal Center B-Cell Subclone Abundancy and Affinity: Implications for Repertoire Sequencing , 2017, Front. Immunol..

[33]  Steven H. Kleinstein,et al.  Models of Somatic Hypermutation Targeting and Substitution Based on Synonymous Mutations from High-Throughput Immunoglobulin Sequencing Data , 2013, Front. Immunol..

[34]  Gerton Lunter,et al.  A Phylogenetic Codon Substitution Model for Antibody Lineages , 2016, Genetics.

[35]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[36]  T. Mora,et al.  Inferring processes underlying B-cell repertoire diversity , 2015, bioRxiv.

[37]  T. Holmøy,et al.  High‐throughput sequencing of immune repertoires in multiple sclerosis , 2016, Annals of clinical and translational neurology.

[38]  Johannes Trück,et al.  B-cell repertoire dynamics after sequential hepatitis B vaccination and evidence for cross-reactive B-cell activation , 2016, Genome Medicine.

[39]  Sergio Roa,et al.  The biochemistry of somatic hypermutation. , 2008, Annual review of immunology.

[40]  Steven H. Kleinstein,et al.  The mutation patterns in B-cell immunoglobulin receptors reflect the influence of selection acting at multiple time-scales , 2015, Philosophical Transactions of the Royal Society B: Biological Sciences.

[41]  A. Perelson,et al.  The challenges of modelling antibody repertoire dynamics in HIV infection , 2015, Philosophical Transactions of the Royal Society B: Biological Sciences.

[42]  Donald W. Lee,et al.  BRILIA: Integrated Tool for High-Throughput Annotation and Lineage Tree Assembly of B-Cell Repertoires , 2017, Front. Immunol..

[43]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[44]  Cornelia L Dekker,et al.  Lineage tracing of human B cells reveals the in vivo landscape of human antibody class switching , 2016, eLife.

[45]  S. Guindon,et al.  How well can the exponential-growth coalescent approximate constant-rate birth–death population dynamics? , 2015, Proceedings of the Royal Society B: Biological Sciences.

[46]  William D. Lees,et al.  Utilities for High-Throughput Analysis of B-Cell Clonal Lineages , 2015, Journal of immunology research.

[47]  F. Levander,et al.  Antibody Heavy Chain Variable Domains of Different Germline Gene Origins Diversify through Different Paths , 2017, Front. Immunol..

[48]  Steven H. Kleinstein,et al.  B cells populating the multiple sclerosis brain mature in the draining cervical lymph nodes , 2014, Science Translational Medicine.

[49]  T. Moum,et al.  POLYTOMIES AND THE POWER OF PHYLOGENETIC INFERENCE , 1999, Evolution; international journal of organic evolution.

[50]  Paul G. Thomas,et al.  Defining antigen-specific plasmablast and memory B cell subsets in human blood after viral infection or vaccination , 2022 .

[51]  Vu C. Dinh,et al.  Nonbifurcating Phylogenetic Tree Inference via the Adaptive LASSO , 2018, Journal of the American Statistical Association.

[52]  A. Chakraborty,et al.  A Population Dynamics Model for Clonal Diversity in a Germinal Center , 2017, Front. Microbiol..

[53]  Thomas B. Kepler,et al.  Reconstructing a B-Cell Clonal Lineage. II. Mutation, Selection, and Affinity Maturation , 2014, Front. Immunol..

[54]  L. Childs,et al.  Trade-offs in antibody repertoires to complex antigens , 2015, Philosophical Transactions of the Royal Society B: Biological Sciences.

[55]  S. Tonegawa Somatic generation of antibody diversity , 1983, Nature.

[56]  Mark M. Davis,et al.  Lineage Structure of the Human Antibody Repertoire in Response to Influenza Vaccination , 2013, Science Translational Medicine.

[57]  C. Rice,et al.  Convergent Antibody Responses to SARS-CoV-2 in Convalescent Individuals , 2020, Nature.

[58]  Tanja Stadler,et al.  Phylogenetic Tools for Generalized HIV-1 Epidemics: Findings from the PANGEA-HIV Methods Comparison , 2016, Molecular biology and evolution.

[59]  Diego Ellerman,et al.  Immune repertoire mining for rapid affinity optimization of mouse monoclonal antibodies , 2019, mAbs.

[60]  K. Holsinger,et al.  Polytomies and Bayesian phylogenetic inference. , 2005, Systematic biology.

[61]  Thomas B Kepler,et al.  B-cell–lineage immunogen design in vaccine development with HIV-1 as a case study , 2012, Nature Biotechnology.

[62]  Quentin Marcou,et al.  High-throughput immune repertoire analysis with IGoR , 2017, Nature Communications.

[63]  William S. DeWitt,et al.  Using Genotype Abundance to Improve Phylogenetic Inference , 2017, Molecular biology and evolution.

[64]  Ron Unger,et al.  IgTree: creating Immunoglobulin variable region gene lineage trees. , 2008, Journal of immunological methods.

[65]  S. Tonegawa,et al.  Organization, structure, and assembly of immunoglobulin heavy chain diversity DNA segments , 1982, The Journal of experimental medicine.

[66]  Frederick Albert Matsen IV,et al.  Benchmarking tree and ancestral sequence inference for B cell receptor sequences , 2018, bioRxiv.

[67]  O. Gascuel,et al.  Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. , 2006, Systematic biology.

[68]  Cédric R. Weber,et al.  Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires , 2017, Front. Immunol..