Scirpy: a Scanpy extension for analyzing single-cell T-cell receptor-sequencing data

Abstract Summary Advances in single-cell technologies have enabled the investigation of T-cell phenotypes and repertoires at unprecedented resolution and scale. Bioinformatic methods for the efficient analysis of these large-scale datasets are instrumental for advancing our understanding of adaptive immune responses. However, while well-established solutions are accessible for the processing of single-cell transcriptomes, no streamlined pipelines are available for the comprehensive characterization of T-cell receptors. Here, we propose single-cell immune repertoires in Python (Scirpy), a scalable Python toolkit that provides simplified access to the analysis and visualization of immune repertoires from single cells and seamless integration with transcriptomic data. Availability and implementation Scirpy source code and documentation are available at https://github.com/icbi-lab/scirpy. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Olivier Elemento,et al.  Single-cell TCRseq: paired recovery of entire T-cell alpha and beta chain transcripts in T-cell receptors from single-cell RNAseq , 2016, Genome Medicine.

[2]  P. Bradley,et al.  Quantifiable predictive features define epitope-specific T cell receptor repertoires , 2017, Nature.

[3]  P. Klenerman,et al.  Targeted reconstruction of T cell receptor sequence from single cell RNA-seq links CDR3 length to T cell differentiation state , 2017, Nucleic acids research.

[4]  Pornpimol Charoentong,et al.  Computational genomics tools for dissecting tumour–immune cell interactions , 2016, Nature Reviews Genetics.

[5]  Mandeep Singh,et al.  B-cell receptor reconstruction from single-cell RNA-seq with VDJPuzzle , 2017, bioRxiv.

[6]  Boxi Kang,et al.  Lineage tracking reveals dynamic relationships of T cells in colorectal cancer , 2018, Nature.

[7]  B. Binstadt,et al.  Dual TCR T Cells: Identity Crisis or Multitaskers? , 2019, The Journal of Immunology.

[8]  Thomas D. Wu,et al.  Peripheral T cell expansion predicts tumour infiltration and clinical response , 2020, Nature.

[9]  V. Greiff,et al.  A bioinformatic framework for immune repertoire diversity profiling enables detection of immunological status , 2015, Genome Medicine.

[10]  Z. Trajanoski,et al.  Next-generation computational tools for interrogating cancer immunity , 2019, Nature Reviews Genetics.

[11]  Syed Ahmad Chan Bukhari,et al.  AIRR Community Standardized Representations for Annotated Immune Repertoires , 2018, Front. Immunol..

[12]  Piyushkumar A. Mundra,et al.  Immune-awakening revealed by peripheral T cell dynamics after one cycle of immunotherapy , 2019, Nature Cancer.

[13]  Jun S. Liu,et al.  Landscape of tumor-infiltrating T cell repertoire of human cancers , 2016, Nature Genetics.

[14]  Geir Kjetil Sandve,et al.  immuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking , 2019, bioRxiv.

[15]  Paul Hoffman,et al.  Integrating single-cell transcriptomic data across different conditions, technologies, and species , 2018, Nature Biotechnology.

[16]  Aditya Radhakrishnan,et al.  Reverse TCR repertoire evolution toward dominant low-affinity clones during chronic CMV infection , 2020, Nature Immunology.

[17]  David A. Hafler,et al.  pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires , 2014, Bioinform..

[18]  Joel Nothman,et al.  SciPy 1.0-Fundamental Algorithms for Scientific Computing in Python , 2019, ArXiv.

[19]  Dietmar Plenz,et al.  powerlaw: A Python Package for Analysis of Heavy-Tailed Distributions , 2013, PloS one.

[20]  L. Bradbury,et al.  Altered Repertoire Diversity and Disease‐Associated Clonal Expansions Revealed by T Cell Receptor Immunosequencing in Ankylosing Spondylitis Patients , 2020, Arthritis & Rheumatology.

[21]  Lai Guan Ng,et al.  Dimensionality reduction for visualizing single-cell data using UMAP , 2018, Nature Biotechnology.

[22]  Mikhail Pogorelyy,et al.  tcR: an R package for T cell receptor repertoire advanced data analysis , 2015, BMC Bioinformatics.

[23]  Fabian J Theis,et al.  Predicting antigen specificity of single T cells based on TCR CDR3 regions , 2019, bioRxiv.

[24]  K. Coppieters,et al.  Faculty Opinions recommendation of Viral infection triggers central nervous system autoimmunity via activation of CD8+ T cells expressing dual TCRs. , 2010 .

[25]  Craig H. Bassing,et al.  Antigen Receptor Allelic Exclusion: An Update and Reappraisal , 2010, The Journal of Immunology.

[26]  Anneliese O. Speak,et al.  T cell fate and clonality inference from single cell transcriptomes , 2016, Nature Methods.

[27]  Fabian J Theis,et al.  SCANPY: large-scale single-cell gene expression data analysis , 2018, Genome Biology.

[28]  Jeff Daily,et al.  Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments , 2016, BMC Bioinformatics.

[29]  Mikhail Pogorelyy,et al.  VDJtools: Unifying Post-analysis of T Cell Receptor Repertoires , 2015, PLoS Comput. Biol..

[30]  Alessandro Sette,et al.  Identifying specificity groups in the T cell receptor repertoire , 2017, Nature.

[31]  Xun Xu,et al.  IMonitor: A Robust Pipeline for TCR and BCR Repertoire Analysis , 2015, Genetics.

[32]  Aleksandra A. Kolodziejczyk,et al.  Classification of low quality cells from single-cell RNA-seq data , 2016, Genome Biology.