Predicting the spectrum of TCR repertoire sharing with a data-driven model of recombination

Despite the extreme diversity of T cell repertoires, many identical T cell receptor (TCR) sequences are found in a large number of individual mice and humans. These widely-shared sequences, often referred to as ‘public‘, have been suggested to be over-represented due to their potential immune functionality or their ease of generation by V(D)J recombination. Here we show that even for large cohorts the observed degree of sharing of TCR sequences between individuals is well predicted by a model accounting for the known quantitative statistical biases in the generation process, together with a simple model of thymic selection. Whether a sequence is shared by many individuals is predicted to depend on the number of queried individuals and the sampling depth, as well as on the sequence itself, in agreement with the data. We introduce the degree of publicness conditional on the queried cohort size and the size of the sampled repertoires. Based on these observations we propose a public/private sequence classifier, ‘PUBLIC’ (Public Universal Binary Likelihood Inference Classifier), based on the generation probability, which performs very well even for small cohort sizes.

[1]  George M. Church,et al.  Single-cell sequencing reveals αβ chain pairing shapes the T cell repertoire , 2017, bioRxiv.

[2]  G. Yaari,et al.  Practical guidelines for B-cell receptor repertoire sequencing analysis , 2015, Genome Medicine.

[3]  M. Lieber,et al.  Mechanistic constraints on diversity in human V(D)J recombination , 1996, Molecular and cellular biology.

[4]  Philippe Kourilsky,et al.  Vα and Vβ Public Repertoires Are Highly Conserved in Terminal Deoxynucleotidyl Transferase-Deficient Mice1 , 2005, The Journal of Immunology.

[5]  Abigail Wacher,et al.  Comprehensive assessment of T-cell receptor beta-chain diversity in alphabeta T cells. , 2009, Blood.

[6]  Yuval Elhanati,et al.  Insights into immune system development and function from mouse T-cell repertoires , 2017, Proceedings of the National Academy of Sciences.

[7]  S. Quake,et al.  The promise and challenge of high-throughput sequencing of the antibody repertoire , 2014, Nature Biotechnology.

[8]  R. Holt,et al.  Profiling the T-cell receptor beta-chain repertoire by massively parallel sequencing. , 2009, Genome research.

[9]  Wilfred Ndifon,et al.  Chromatin conformation governs T-cell receptor Jβ gene segment usage , 2012, Proceedings of the National Academy of Sciences.

[10]  Trevor Bedford,et al.  Quantifying evolutionary constraints on B-cell affinity maturation , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[11]  Thierry Mora,et al.  Method for identification of condition-associated public antigen receptor sequences , 2017, bioRxiv.

[12]  R. Emerson,et al.  High-throughput pairing of T cell receptor α and β sequences , 2015, Science Translational Medicine.

[13]  J. Cabaniols,et al.  Public and private V beta T cell receptor repertoires against hen egg white lysozyme (HEL) in nontransgenic versus HEL transgenic mice , 1994, The Journal of experimental medicine.

[14]  Elizabeth Simpson,et al.  Public T Cell Receptor β-Chains Are Not Advantaged during Positive Selection1 , 2008, The Journal of Immunology.

[15]  N. Friedman,et al.  T-cell receptor repertoires share a restricted set of public and abundant CDR3 sequences that are associated with self-related immunity , 2014, Genome research.

[16]  Jing Ma,et al.  Preferential Use of Public TCR during Autoimmune Encephalomyelitis , 2016, The Journal of Immunology.

[17]  C. Carlson,et al.  Overlap and Effective Size of the Human CD8+ T Cell Receptor Repertoire , 2010, Science Translational Medicine.

[18]  Sonia M Leach,et al.  Identification of shared TCR sequences from T cells in human breast cancer using emulsion RT-PCR , 2016, Proceedings of the National Academy of Sciences.

[19]  M. Davenport,et al.  Specificity, promiscuity, and precursor frequency in immunoreceptors. , 2013, Current opinion in immunology.

[20]  D. Montefiori,et al.  On the composition of the preimmune repertoire of T cells specific for Peptide-major histocompatibility complex ligands. , 2010, Annual review of immunology.

[21]  Yuval Elhanati,et al.  Quantifying selection in immune receptor repertoires , 2014, Proceedings of the National Academy of Sciences.

[22]  Aaron R. Dinner,et al.  A Model for TCR Gene Segment Use1 , 2006, The Journal of Immunology.

[23]  Daniel C. Douek,et al.  Convergent recombination shapes the clonotypic landscape of the naïve T-cell repertoire , 2010, Proceedings of the National Academy of Sciences.

[24]  A. Pollard,et al.  Limb proportions show developmental plasticity in response to embryo movement , 2017, Scientific Reports.

[25]  J. Bell,et al.  Extensive conservation of alpha and beta chains of the human T-cell antigen receptor recognizing HLA-A2 and influenza A matrix peptide. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Alessandro Sette,et al.  Identifying specificity groups in the T cell receptor repertoire , 2017, Nature.

[27]  Ryan Emerson,et al.  TCR Sequencing Can Identify and Track Glioma-Infiltrating T Cells after DC Vaccination , 2016, Cancer Immunology Research.

[28]  M. Vignali,et al.  Contribution of systemic and somatic factors to clinical response and resistance to PD-L1 blockade in urothelial cancer: An exploratory multi-omic analysis , 2017, PLoS medicine.

[29]  Richard A. Olshen,et al.  Diversity and clonal selection in the human T-cell repertoire , 2014, Proceedings of the National Academy of Sciences.

[30]  M. Egerton,et al.  The generation and fate of thymocytes. , 1990, Seminars in immunology.

[31]  Thierry Mora,et al.  Quantifying lymphocyte receptor diversity , 2016, bioRxiv.

[32]  J. Altman,et al.  Individual variations in the murine T cell response to a specific peptide reflect variability in naive repertoires. , 1998, Immunity.

[33]  Thierry Mora,et al.  The Past, Present, and Future of Immune Repertoire Biology – The Rise of Next-Generation Repertoire Analysis , 2013, Front. Immunol..

[34]  D J Moss,et al.  Dominant selection of an invariant T cell antigen receptor in response to persistent infection by Epstein-Barr virus , 1994, The Journal of experimental medicine.

[35]  S. Tonegawa,et al.  Evidence for somatic rearrangement of immunoglobulin genes coding for variable and constant regions. , 1976, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Jérôme Lane,et al.  IMGT®, the international ImMunoGeneTics information system® , 2004, Nucleic Acids Res..

[37]  M.V. Pogorelyy,et al.  Method for identification of condition-associated public antigen receptor sequences , 2017 .

[38]  Yun S. Song,et al.  Estimating Copy Number and Allelic Variation at the Immunoglobulin Heavy Chain Locus Using Short Reads , 2016, PLoS Comput. Biol..

[39]  Jared Gartner,et al.  Tumor- and Neoantigen-Reactive T-cell Receptors Can Be Identified Based on Their Frequency in Fresh Tumor , 2016, Cancer Immunology Research.

[40]  Jiahuai Han,et al.  Determinants of public T cell responses , 2012, Cell Research.

[41]  Daniel C. Douek,et al.  The Role of Production Frequency in the Sharing of Simian Immunodeficiency Virus-Specific CD8+ TCRs between Macaques1 , 2008, The Journal of Immunology.

[42]  Daniel C. Douek,et al.  A Mechanism for TCR Sharing between T Cell Subsets and Individuals Revealed by Pyrosequencing , 2011, The Journal of Immunology.

[43]  Michel Sadelain,et al.  Therapeutic T cell engineering , 2017, Nature.

[44]  Mikhail Shugay,et al.  Distinctive properties of identical twins' TCR repertoires revealed by high-throughput sequencing , 2014, Proceedings of the National Academy of Sciences.

[45]  Andreas Dahl,et al.  CD8+ T cells specific for the islet autoantigen IGRP are restricted in their T cell receptor chain usage , 2017, Scientific Reports.

[46]  T. Mora,et al.  Inferring processes underlying B-cell repertoire diversity , 2015, bioRxiv.

[47]  Dave Ko,et al.  Tissue distribution and clonal diversity of the T and B cell repertoire in type 1 diabetes. , 2016, JCI insight.

[48]  Roland R. Regoes,et al.  Investigating the Consequences of Interference between Multiple CD8+ T Cell Escape Mutations in Early HIV Infection , 2016, PLoS Comput. Biol..

[49]  Alberto Albertini,et al.  Different TCRBV genes generate biased patterns of V-D-J diversity in human T cells , 2004, Immunogenetics.

[50]  Antonio Lanzavecchia,et al.  Broadly neutralizing antiviral antibodies. , 2013, Annual review of immunology.

[51]  Grant Lythe,et al.  How many TCR clonotypes does a body maintain? , 2016, Journal of theoretical biology.

[52]  Jun S. Liu,et al.  Landscape of tumor-infiltrating T cell repertoire of human cancers , 2016, Nature Genetics.

[53]  William S. DeWitt,et al.  Immunosequencing identifies signatures of cytomegalovirus exposure history and HLA-mediated effects on the T cell repertoire , 2017, Nature Genetics.

[54]  D. Price,et al.  The molecular basis for public T-cell responses? , 2008, Nature Reviews Immunology.

[55]  J. Casanova,et al.  H-2-restricted cytolytic T lymphocytes specific for HLA display T cell receptors of limited diversity , 1992, The Journal of experimental medicine.

[56]  P. Bradley,et al.  Quantifiable predictive features define epitope-specific T cell receptor repertoires , 2017, Nature.

[57]  Yuval Elhanati,et al.  Persisting fetal clonotypes influence the structure and overlap of adult human T cell receptor repertoires , 2016, bioRxiv.

[58]  IV FrederickA.Matsen,et al.  Consistency of VDJ Rearrangement and Substitution Parameters Enables Accurate B Cell Receptor Sequence Annotation , 2015, PLoS Comput. Biol..

[59]  James Ireland,et al.  Discovery of T Cell Receptor β Motifs Specific to HLA–B27–Positive Ankylosing Spondylitis by Deep Repertoire Sequence Analysis , 2017, Arthritis & rheumatology.

[60]  Y. Louzoun,et al.  Rep‐Seq: uncovering the immunological repertoire through next‐generation sequencing , 2012, Immunology.

[61]  N. Friedman,et al.  T cell receptor repertoires of mice and humans are clustered in similarity networks around conserved public CDR3 sequences , 2017, eLife.

[62]  Cornelia L Dekker,et al.  Individual heritable differences result in unique cell lymphocyte receptor repertoires of naïve and antigen-experienced cells , 2016, Nature Communications.

[63]  Thierry Mora,et al.  Statistical inference of the generation probability of T-cell receptors from sequence repertoires , 2012, Proceedings of the National Academy of Sciences.

[64]  Daniel C. Douek,et al.  Sharing of T cell receptors in antigen-specific responses is driven by convergent recombination , 2006, Proceedings of the National Academy of Sciences.

[65]  Benedict Ng,et al.  NKT and MAIT invariant TCRα sequences can be produced efficiently by VJ gene recombination. , 2013, Immunobiology.

[66]  Cédric R. Weber,et al.  Learning the High-Dimensional Immunogenomic Features That Predict Public and Private Antibody Repertoires , 2017, The Journal of Immunology.

[67]  John Shawe-Taylor,et al.  Computational analysis of stochastic heterogeneity in PCR amplification efficiency revealed by single molecule barcoding , 2015, Scientific Reports.

[68]  Quentin Marcou,et al.  High-throughput immune repertoire analysis with IGoR , 2017, Nature Communications.

[69]  Yuval Elhanati,et al.  repgenHMM: a dynamic programming tool to infer the rules of immune receptor generation from sequence data , 2015, bioRxiv.