The Repertoire Dissimilarity Index as a method to compare lymphocyte receptor repertoires

BackgroundThe B and T cells of the human adaptive immune system leverage a highly diverse repertoire of antigen-specific receptors to protect the human body from pathogens. The sequencing and analysis of immune repertoires is emerging as an important tool to understand immune responses, whether beneficial or harmful (in the case of autoimmunity). However, methods for studying these repertoires, and for directly comparing different immune repertoires, are lacking.ResultsIn this paper, we present a non-parametric method for directly comparing sequencing repertoires, with the goal of rigorously quantifying differences in V, D, and J gene segment utilization. This method, referred to as the Repertoire Dissimilarity Index (RDI), uses a bootstrapped subsampling approach to account for variance in sequencing depth, and, coupled with a data simulation approach, allows for direct quantification of the average variation between repertoires. We use the RDI method to recapitulate known differences in the formation of the CD4+ and CD8+ T cell repertoires, and further show that antigen-driven activation of naïve CD8+ T cells is more selective than in the CD4+ repertoire, resulting in a more specialized CD8+ memory repertoire.ConclusionsWe prove that the RDI method is an accurate and versatile method for comparisons of immune repertoires. The RDI method has been implemented as an R package, and is available for download through Bitbucket.

[1]  Steven H. Kleinstein,et al.  B cells populating the multiple sclerosis brain mature in the draining cervical lymph nodes , 2014, Science Translational Medicine.

[2]  P. Moss,et al.  Herpesvirus-Specific CD8 T Cell Immunity in Old Age: Cytomegalovirus Impairs the Response to a Coresident EBV Infection1 , 2004, The Journal of Immunology.

[3]  D. Schatz,et al.  Recombination centres and the orchestration of V(D)J recombination , 2011, Nature Reviews Immunology.

[4]  Susan P. Holmes,et al.  Waste Not , Want Not : Why Rarefying Microbiome Data is Inadmissible . October 1 , 2013 , 2013 .

[5]  P. Moss,et al.  Cytomegalovirus Infection Leads to Development of High Frequencies of Cytotoxic Virus-Specific CD4+ T Cells Targeted to Vascular Endothelium , 2016, PLoS pathogens.

[6]  S. Durham,et al.  Rhinitis, sinusitis, and upper airway disease Influence of seasonal exposure to grass pollen on local and peripheral blood IgE repertoires in patients with allergic rhinitis , 2022 .

[7]  Cornelia L Dekker,et al.  Individual heritable differences result in unique cell lymphocyte receptor repertoires of naïve and antigen-experienced cells , 2016, Nature Communications.

[8]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[9]  V. Giudicelli,et al.  IMGT(®) tools for the nucleotide analysis of immunoglobulin (IG) and T cell receptor (TR) V-(D)-J repertoires, polymorphisms, and IG mutations: IMGT/V-QUEST and IMGT/HighV-QUEST for NGS. , 2012, Methods in molecular biology.

[10]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[11]  J. Bell,et al.  Functional Heterogeneity and High Frequencies of Cytomegalovirus-Specific CD8+ T Lymphocytes in Healthy Seropositive Donors , 2000, Journal of Virology.

[12]  Susan Holmes,et al.  phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data , 2013, PloS one.

[13]  Jeffrey A. Wiser,et al.  ImmPort: disseminating data to the public for the future of immunology , 2014, Immunologic Research.

[14]  C. Quince,et al.  Dirichlet Multinomial Mixtures: Generative Models for Microbial Metagenomics , 2012, PloS one.

[15]  S. Tonegawa,et al.  Somatic generation of antibody diversity. , 1976, Nature.

[16]  P. Klenerman,et al.  Regulation of the Class II MHC Pathway in Primary Human Monocytes by Granulocyte-Macrophage Colony-Stimulating Factor 1 , 2003, The Journal of Immunology.

[17]  G. Yaari,et al.  Practical guidelines for B-cell receptor repertoire sequencing analysis , 2015, Genome Medicine.

[18]  Steven H. Kleinstein,et al.  Change-O: a toolkit for analyzing large-scale B cell immunoglobulin repertoire sequencing data , 2015, Bioinform..

[19]  Mark M. Davis,et al.  T-cell antigen receptor genes and T-cell recognition , 1988, Nature.

[20]  John M. Fonner,et al.  VDJML: a file format with tools for capturing the results of inferring immune receptor rearrangements , 2016, BMC Bioinformatics.

[21]  Hao Shen,et al.  Cutting Edge: CD4 and CD8 T Cells Are Intrinsically Different in Their Proliferative Responses1 , 2002, The Journal of Immunology.

[22]  Patrice Duroux,et al.  IMGT/HIGHV-QUEST: THE IMGT® WEB PORTAL FOR IMMUNOGLOBULIN (IG) OR ANTIBODY AND T CELL RECEPTOR (TR) ANALYSIS FROM NGS HIGH THROUGHPUT AND DEEP SEQUENCING , 2012 .

[23]  David A. Hafler,et al.  pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires , 2014, Bioinform..

[24]  M. Rep,et al.  Phenotypic and Functional Separation of Memory and Effector Human CD8+ T Cells , 1997, The Journal of experimental medicine.