Immune Repertoire Sequencing Using Molecular Identifiers Enables Accurate Clonality Discovery and Clone Size Quantification

Unique molecular identifiers (MIDs) have been demonstrated to effectively improve immune repertoire sequencing (IR-seq) accuracy, especially to identify somatic hypermutations in antibody repertoire sequencing. However, evaluating the sensitivity to detect rare T cells and the degree of clonal expansion in IR-seq has been difficult due to the lack of knowledge of T cell receptor (TCR) RNA molecule copy number and a generalized approach to estimate T cell clone size from TCR RNA molecule quantification. This limited the application of TCR repertoire sequencing (TCR-seq) in clinical settings, such as detecting minimal residual disease in lymphoid malignancies after treatment, evaluating effectiveness of vaccination and assessing degree of infection. Here, we describe using an MID Clustering-based IR-Seq (MIDCIRS) method to quantitatively study TCR RNA molecule copy number and clonality in T cells. First, we demonstrated the necessity of performing MID sub-clustering to eliminate erroneous sequences. Further, we showed that MIDCIRS enables a sensitive detection of a single cell in as many as one million naïve T cells and an accurate estimation of the degree of T cell clonal expression. The demonstrated accuracy, sensitivity, and wide dynamic range of MIDCIRS TCR-seq provide foundations for future applications in both basic research and clinical settings.

[1]  Christof von Kalle,et al.  High-resolution analysis of the human T-cell receptor repertoire , 2015, Nature Communications.

[2]  L. Notarangelo,et al.  Timely and spatially regulated maturation of B and T cell repertoire during human fetal development , 2015, Science Translational Medicine.

[3]  L. Penland,et al.  Determinism and stochasticity during maturation of the zebrafish antibody repertoire , 2011, Proceedings of the National Academy of Sciences.

[4]  Mikhail Pogorelyy,et al.  VDJtools: Unifying Post-analysis of T Cell Receptor Repertoires , 2015, PLoS Comput. Biol..

[5]  James L. Zehnder,et al.  High-throughput VDJ sequencing for quantification of minimal residual disease in chronic lymphocytic leukemia and immune reconstitution assessment , 2011, Proceedings of the National Academy of Sciences.

[6]  Richard A. Moore,et al.  Exhaustive T-cell repertoire sequencing of human peripheral blood samples reveals signatures of antigen selection and a directly measured repertoire size of at least 1 million clonotypes. , 2011, Genome research.

[7]  S. P. Fodor,et al.  Combinatorial labeling of single cells for gene expression cytometry , 2015, Science.

[8]  Mark M. Davis,et al.  Clonal Deletion Prunes but Does Not Eliminate Self-Specific αβ CD8(+) T Lymphocytes. , 2015, Immunity.

[9]  Andrew P. Stubbs,et al.  Evaluation of the Antigen-Experienced B-Cell Receptor Repertoire in Healthy Children and Adults , 2016, Front. Immunol..

[10]  David A. Hafler,et al.  pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires , 2014, Bioinform..

[11]  S. P. Fodor,et al.  Counting individual DNA molecules by the stochastic attachment of diverse labels , 2011, Proceedings of the National Academy of Sciences.

[12]  Sai T Reddy,et al.  Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting , 2016, Science Advances.

[13]  K. Kinzler,et al.  Detection and quantification of rare mutations with massively parallel sequencing , 2011, Proceedings of the National Academy of Sciences.

[14]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[15]  Mikhail Shugay,et al.  Towards error-free profiling of immune repertoires , 2014, Nature Methods.

[16]  Paul G. Thomas,et al.  Defining antigen-specific plasmablast and memory B cell subsets in human blood after viral infection or vaccination , 2022 .

[17]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[18]  Åsa K. Björklund,et al.  Smart-seq2 for sensitive full-length transcriptome profiling in single cells , 2013, Nature Methods.

[19]  D. Dimitrov,et al.  Expressed antibody repertoires in human cord blood cells: 454 sequencing and IMGT/HighV-QUEST analysis of germline gene usage, junctional diversity, and somatic mutations , 2011, Immunogenetics.

[20]  Haili Yu,et al.  Diversity index of mucosal resident T lymphocyte repertoire predicts clinical prognosis in gastric cancer , 2015, Oncoimmunology.

[21]  R. White,et al.  High-Throughput Sequencing of the Zebrafish Antibody Repertoire , 2009, Science.

[22]  Daniel C. Douek,et al.  Convergent recombination shapes the clonotypic landscape of the naïve T-cell repertoire , 2010, Proceedings of the National Academy of Sciences.

[23]  Nolan G. Ericson,et al.  Digital Genomic Quantification of Tumor-Infiltrating Lymphocytes , 2013, Science Translational Medicine.

[24]  Jedd D. Wolchok,et al.  T-cell invigoration to tumour burden ratio associated with anti-PD-1 response , 2017, Nature.

[25]  Mikhail Shugay,et al.  Quantitative Profiling of Immune Repertoires for Minor Lymphocyte Counts Using Unique Molecular Identifiers , 2015, The Journal of Immunology.

[26]  Patricia J. Parker,et al.  Direct measurement of T cell receptor affinity and sequence from naïve antiviral T cells , 2016, Science Translational Medicine.

[27]  Peter D. Crompton,et al.  Accurate Immune Repertoire Sequencing Reveals Malaria Infection Driven Antibody Lineage Diversification in Young Children , 2017 .

[28]  Mark M. Davis,et al.  Lineage Structure of the Human Antibody Repertoire in Response to Influenza Vaccination , 2013, Science Translational Medicine.

[29]  D. Campana,et al.  Deep-sequencing approach for minimal residual disease detection in acute lymphoblastic leukemia. , 2012, Blood.

[30]  Nir Friedman,et al.  Dynamic Perturbations of the T-Cell Receptor Repertoire in Chronic HIV Infection and following Antiretroviral Therapy , 2015, Front. Immunol..

[31]  Jun S. Liu,et al.  Landscape of tumor-infiltrating T cell repertoire of human cancers , 2016, Nature Genetics.

[32]  David Bryder,et al.  Transcription factor profiling in individual hematopoietic progenitors by digital RT-PCR , 2006, Proceedings of the National Academy of Sciences.

[33]  Tony Z. Jia,et al.  Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes , 2012, Proceedings of the National Academy of Sciences.

[34]  Jeffrey A Jones,et al.  Immunoglobulin transcript sequence and somatic hypermutation computation from unselected RNA-seq reads in chronic lymphocytic leukemia , 2015, Proceedings of the National Academy of Sciences.

[35]  Baoshan Zhang,et al.  Molecular-level analysis of the serum antibody repertoire in young adults before and after seasonal influenza vaccination , 2016, Nature Medicine.

[36]  Scott D. Brown,et al.  Profiling tissue-resident T cell repertoires by RNA sequencing , 2015, Genome Medicine.

[37]  Stephen R. Quake,et al.  Genetic measurement of memory B-cell recall using antibody repertoire sequencing , 2013, Proceedings of the National Academy of Sciences.

[38]  Thierry Mora,et al.  Quantifying lymphocyte receptor diversity , 2016, bioRxiv.

[39]  Gioele La Manno,et al.  Quantitative single-cell RNA-seq with unique molecular identifiers , 2013, Nature Methods.

[40]  Richard A. Olshen,et al.  Diversity and clonal selection in the human T-cell repertoire , 2014, Proceedings of the National Academy of Sciences.

[41]  D. Price,et al.  The molecular basis for public T-cell responses? , 2008, Nature Reviews Immunology.

[42]  Pawel Zajac,et al.  Base Preferences in Non-Templated Nucleotide Incorporation by MMLV-Derived Reverse Transcriptases , 2013, PloS one.

[43]  S. P. Fodor,et al.  Digital Encoding of Cellular mRNAs Enabling Precise and Absolute Gene Expression Measurement by Single-Molecule Counting , 2014, Analytical chemistry.

[44]  Y. Shiao A new reverse transcription-polymerase chain reaction method for accurate quantification , 2003, BMC biotechnology.

[45]  Dennis R. Burton,et al.  Clonify: unseeded antibody lineage assignment from next-generation sequencing data , 2016, Scientific Reports.

[46]  Wei Shi,et al.  Transcriptional profiling of mouse B cell terminal differentiation defines a signature for antibody-secreting plasma cells , 2015, Nature Immunology.

[47]  X. Jin,et al.  The human cytotoxic T-lymphocyte (CTL) response to cytomegalovirus is dominated by structural protein pp65: frequency, specificity, and T-cell receptor usage of pp65-specific CTL , 1996, Journal of virology.