论文信息 - Rapid T-cell receptor interaction grouping with ting

Rapid T-cell receptor interaction grouping with ting

MOTIVATION Clustering T cell receptor repertoire (TCRR) sequences according to antigen specificity is challenging. The previously published tool GLIPH needs several days to weeks for clustering large repertoires, making its use impractical in larger studies. In addition, the methodology used in GLIPH suffers from shortcomings, including non-determinism, potential loss of significant antigen-specific sequences or inclusion of too many unspecific sequences. RESULTS We present an algorithm for clustering TCRR sequences that scales efficiently to large repertoires. We clustered 36 real datasets with up to 62 000 unique CDR3β sequences using both an implementation of our method called ting, GLIPH and its successsor GLIPH2. While GLIPH required multiple weeks, ting only needed about one minute for the same task. GLIPH2 is comparably fast, but uses a different grouping paradigm. In addition, we found that in naïve repertoires, where no or very few antigen-specific CDR3 sequences or clusters should exist, our method indeed selects much fewer motifs and produces smaller clusters. AVAILABILITY Our method has been implemented in Python as a tool called ting. It is available from GitHub (https://github.com/FelixMoelder/ting) or PyPI under the MIT license.

[1] K. Jarrod Millman,et al. Array programming with NumPy , 2020, Nat..

[2] Michael J. Fischer,et al. An improved equivalence algorithm , 1964, CACM.

[3] A. Thiel,et al. SLAMF7 and IL-6R define distinct cytotoxic versus helper memory CD8+ T cells , 2020, Nature Communications.

[4] Peter N. Robinson,et al. IMSEQ - a fast and error aware approach to immunogenetic sequence analysis , 2015, Bioinform..

[5] Mark M. Davis,et al. Analyzing the M. tuberculosis immune response by T cell receptor clustering with GLIPH2 and genome-wide antigen screening , 2020, Nature Biotechnology.

[6] A. Scheffold,et al. Human Anti-fungal Th17 Immunity and Pathology Rely on Cross-Reactivity against Candida albicans , 2019, Cell.

[7] Alessandro Sette,et al. Identifying specificity groups in the T cell receptor repertoire , 2017, Nature.

[8] V. Giudicelli,et al. IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. , 2003, Developmental and comparative immunology.

[9] Andrew K. Sewell,et al. Why must T cells be cross-reactive? , 2012, Nature Reviews Immunology.