Highly Multiplexed Single-Cell Full-Length cDNA Sequencing of human immune cells with 10X Genomics and R2C2

Single cell transcriptome analysis elucidates facets of cell biology that have been previously out of reach. However, the high-throughput analysis of thousands of single cell transcriptomes has been limited by sample preparation and sequencing technology. High-throughput single cell analysis today is facilitated by protocols like the 10X Genomics platform or Drop-Seq which generate cDNA pools in which the origin of a transcript is encoded at its 5’ or 3’ end. These cDNA pools are most often analyzed by short read Illumina sequencing which can identify the cellular origin of a transcript and what gene it was transcribed from. However, these methods fail to retrieve isoform information. In principle, cDNA pools prepared using these approaches can be analyzed with Pacific Biosciences and Oxford Nanopore long-read sequencers to retrieve isoform information but current implementations rely heavily on Illumina short-reads for analysis in addition to long reads. Here, we used R2C2 to sequence and demultiplex 12 million full-length cDNA molecules generated by the 10X Chromium platform from ∼3000 peripheral blood mononuclear cells (PBMCs). We used these reads to – independent from Illumina data – cluster cells into B cells, T cells, and Monocytes and generate isoform-level transcriptomes for these cell types. We also generated isoform-level transcriptomes for all single cells and used this information to identify a wide range of isoform diversity between genes. Finally, we also designed a computational workflow to extract paired adaptive immune receptors – T cell receptor and B cell receptor (TCR and BCR) – sequences unique to each T and B cell. This work represents a new, simple, and powerful approach that – using a single sequencing method – can extract an unprecedented amount of information from thousands of single cells.

[1]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[2]  A. Mortazavi,et al.  Mapping and modeling the genomic basis of differential RNA isoform expression at single-cell resolution with LR-Split-seq , 2021, Genome Biology.

[3]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[4]  J. Hadfield,et al.  RNA sequencing: the teenage years , 2019, Nature Reviews Genetics.

[5]  Pascal Barbry,et al.  High throughput error corrected Nanopore single cell transcriptome sequencing , 2019, Nature Communications.

[6]  E. Worrell The teenage years , 2018 .

[7]  James T. Webber,et al.  Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris , 2018, Nature.

[8]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[9]  Mark Gerstein,et al.  Accurate Identification and Analysis of Human mRNA Isoforms Using Deep Long Read Sequencing , 2013, G3: Genes, Genomes, Genetics.

[10]  Charles Cole,et al.  Complete characterization of the human immune cell transcriptome using accurate full-length cDNA sequencing. , 2020, Genome research.

[11]  Lennart Martens,et al.  1 SQANTI : extensive characterization of long read transcript sequences for quality control in 1 full-length transcriptome identification and quantification 2 3 , 2017 .

[12]  A. C. Webb,et al.  Nucleotide sequence of human monocyte interleukin 1 precursor cDNA. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[13]  F. Luciani,et al.  High-throughput targeted long-read single cell sequencing reveals the clonal and transcriptional landscape of lymphocytes , 2018, Nature Communications.

[14]  Susan B. Carpenter,et al.  Generation of an isoform-level transcriptome atlas of macrophage activation , 2021, The Journal of biological chemistry.

[15]  Paul Hoffman,et al.  Integrating single-cell transcriptomic data across different conditions, technologies, and species , 2018, Nature Biotechnology.

[16]  Hagen U. Tilgner,et al.  Single-cell isoform RNA sequencing (ScISOr-Seq) across thousands of cells reveals isoforms of cerebellar cell types , 2018, bioRxiv.

[17]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[18]  Hugh E. Olsen,et al.  Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells , 2017, Nature Communications.

[19]  Yvan Saeys,et al.  A single-cell atlas of mouse brain macrophages reveals unique transcriptional identities shaped by ontogeny and tissue environment , 2019, Nature Neuroscience.

[20]  Barbara J. Wold,et al.  A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification , 2019, bioRxiv.

[21]  Aviv Regev,et al.  A revised airway epithelial hierarchy includes CFTR-expressing ionocytes , 2018, Nature.

[22]  Principal Investigators,et al.  Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris , 2018 .

[23]  Grace X. Y. Zheng,et al.  Massively parallel digital transcriptional profiling of single cells , 2016, Nature Communications.

[24]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[25]  C. Vollmers,et al.  Complete characterization of the human immune cell transcriptome using accurate full-length cDNA sequencing , 2019, bioRxiv.

[26]  M. Cogné,et al.  Structure and expression of the mb‐1 transcript in human lymphoid cells , 1992, Clinical and experimental immunology.

[27]  Andrew C. Adey,et al.  Single-Cell Transcriptional Profiling of a Multicellular Organism , 2017 .

[28]  Anneliese O. Speak,et al.  T cell fate and clonality inference from single cell transcriptomes , 2016, Nature Methods.

[29]  M. Tress,et al.  Corrigendum: SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification. , 2018, Genome research.

[30]  Angela N. Brooks,et al.  Full-length transcript characterization of SF3B1 mutation in chronic lymphocytic leukemia reveals downregulation of retained introns , 2018, Nature Communications.

[31]  Ricardo J. Miragaia,et al.  Single-Cell Transcriptomics of Regulatory T Cells Reveals Trajectories of Tissue Adaptation , 2019, Immunity.

[32]  K. Laidre,et al.  Depletion of Hemoglobin Transcripts and Long-Read Sequencing Improves the Transcriptome Annotation of the Polar Bear (Ursus maritimus) , 2019, bioRxiv.

[33]  P. Reddien,et al.  Fundamentals of planarian regeneration. , 2004, Annual review of cell and developmental biology.

[34]  Richard E. Green,et al.  Improving nanopore read accuracy with the R2C2 method enables the sequencing of highly multiplexed full-length single-cell cDNA , 2018, Proceedings of the National Academy of Sciences.

[35]  S. Orkin,et al.  Mapping the Mouse Cell Atlas by Microwell-Seq , 2018, Cell.

[36]  Ning Ma,et al.  IgBLAST: an immunoglobulin variable domain sequence analysis tool , 2013, Nucleic Acids Res..

[37]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[38]  Marie-Paule Lefranc,et al.  IMGT-ONTOLOGY for immunogenetics and immunoinformatics , 2003, Silico Biol..

[39]  J. C. Love,et al.  Seq-Well: portable, low-cost RNA sequencing of single cells at high throughput , 2017, Nature Methods.

[40]  A. Cribbs,et al.  Highly accurate barcode and UMI error correction using dual nucleotide dimer blocks allows direct single-cell nanopore transcriptome sequencing , 2021, bioRxiv.

[41]  Steven L Salzberg,et al.  Transcriptome assembly from long-read RNA-seq alignments with StringTie2 , 2019, Genome biology.

[42]  Sarah A Teichmann,et al.  BraCeR: B-cell-receptor reconstruction and clonality inference from single-cell RNA-seq , 2018, Nature Methods.

[43]  Robert J. Schmitz,et al.  R2C2: Improving nanopore read accuracy enables the sequencing of highly-multiplexed full-length single-cell cDNA , 2018, bioRxiv.

[44]  Angela N. Brooks,et al.  Nanopore native RNA sequencing of a human poly(A) transcriptome , 2018, bioRxiv.

[45]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[46]  B. Haynes,et al.  Isolation and characterization of the genomic human CD7 gene: structural similarity with the murine Thy-1 gene. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Ahmed Mahfouz,et al.  Single-cell isoform RNA sequencing characterizes isoforms in thousands of cerebellar cells , 2018, Nature Biotechnology.

[48]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[49]  Donald Sharon,et al.  Defining a personal, allele-specific, and single-molecule long-read transcriptome , 2014, Proceedings of the National Academy of Sciences.