Estimating the Allele-Specific Expression of SNVs From 10× Genomics Single-Cell RNA-Sequencing Data

With the recent advances in single-cell RNA-sequencing (scRNA-seq) technologies, estimation of allele expression from single cells is becoming increasingly reliable. Allele expression is both quantitative and dynamic and is an essential component of the genomic interactome. Here, we systematically estimate allele expression from heterozygous single nucleotide variant (SNV) loci using scRNA-seq data generated on the 10x Genomics platform. We include in the analysis 26,640 human adipose-derived mesenchymal stem cells (from three healthy donors), with an average sequencing reads over 120K/cell (more than 4 billion scRNA-seq reads total). High quality SNV calls assessed in our study contained approximately 15% exonic and >50% intronic loci. To analyze the allele expression, we estimate the expressed Variant Allele Fraction (VAFRNA) from SNV-aware alignments and analyze its variance and distribution (mono- and bi-allelic) at different cutoffs for required minimal number of sequencing reads. Our analysis shows that when assessing SNV loci covered by a minimum of 3 unique sequencing reads, over 50% of the heterozygous SNVs show bi-allelic expression, while at minimum of 10 reads, nearly 90% of the SNVs are bi-allelic. Consistent with single cell studies on RNA velocity and models of transcriptional burst kinetics, we observe a substantially higher rate of monoallelic expression among intronic SNVs, signifying the usefulness of scVAFRNA to assess dynamic cellular processes. Our analysis demonstrates the feasibility of scVAFRNA estimation from current scRNA-seq datasets and shows that the 3’-based library generation protocol of 10x Genomics scRNA-seq data can be highly informative in SNV-based analyses.

[1]  C. Gregg,et al.  New subtypes of allele-specific epigenetic effects: implications for brain development, function and disease , 2019, Current Opinion in Neurobiology.

[2]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[3]  Fen Xu,et al.  Single-cell RNA-seq of cultured human adipose-derived mesenchymal stem cells , 2019, Scientific Data.

[4]  Åsa K. Björklund,et al.  Smart-seq2 for sensitive full-length transcriptome profiling in single cells , 2013, Nature Methods.

[5]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[6]  R. Sandberg,et al.  Genomic encoding of transcriptional burst kinetics , 2019, Nature.

[7]  Paul Hoffman,et al.  Integrating single-cell transcriptomic data across different conditions, technologies, and species , 2018, Nature Biotechnology.

[8]  A. Heger,et al.  UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy , 2016, bioRxiv.

[9]  K. Tsaneva-Atanasova,et al.  RNA2DNAlign: nucleotide resolution allele asymmetries through quantitative assessment of RNA and DNA paired sequencing data , 2016, Nucleic acids research.

[10]  R. Satija,et al.  Integrative single-cell analysis , 2019, Nature Reviews Genetics.

[11]  Mauricio O. Carneiro,et al.  From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline , 2013, Current protocols in bioinformatics.

[12]  A. Wakamatsu,et al.  Genome-wide determination of RNA stability reveals hundreds of short-lived noncoding transcripts in mammals , 2012, Genome research.

[13]  Jeff E. Mold,et al.  Analysis of allelic expression patterns in clonal somatic cells by single-cell RNA-seq , 2016, Nature Genetics.

[14]  Ashwinikumar Kulkarni,et al.  Beyond bulk: a review of single cell transcriptomics methodologies and applications. , 2019, Current opinion in biotechnology.

[15]  M. Vibranovski,et al.  Early X chromosome inactivation during human preimplantation development revealed by single-cell RNA-sequencing , 2017, Scientific Reports.

[16]  Mihaela Zavolan,et al.  Deciphering the role of RNA-binding proteins in the post-transcriptional control of gene expression. , 2010, Briefings in functional genomics.

[17]  J. Marioni,et al.  Mosaic autosomal aneuploidies are detectable from single-cell RNAseq data , 2017, BMC Genomics.

[18]  Erik Sundström,et al.  RNA velocity of single cells , 2018, Nature.

[19]  Jingyuan Fu,et al.  Calling genotypes from public RNA-sequencing data enables identification of genetic variants that affect gene-expression levels , 2014, Genome Medicine.

[20]  Xun Zhu,et al.  Using single nucleotide variations in single-cell RNA-seq to identify subpopulations and genotype-phenotype linkage , 2016, Nature Communications.

[21]  J. Pal,et al.  Role of 5′‐ and 3′‐untranslated regions of mRNAs in human diseases , 2009, Biology of the cell.

[22]  Itay Tirosh,et al.  Single-Cell RNA Sequencing in Cancer: Lessons Learned and Emerging Challenges. , 2019, Molecular cell.

[23]  Piero Carninci,et al.  Biased allelic expression in human primary fibroblast single cells. , 2015, American journal of human genetics.

[24]  Jonathan K. Pritchard,et al.  WASP: allele-specific software for robust molecular quantitative trait locus discovery , 2015, Nature Methods.

[25]  M. Zavolan,et al.  Terminal exon characterization with TECtool reveals an abundance of cell-specific isoforms , 2018, Nature Methods.

[26]  Yuchao Jiang,et al.  Single-Cell Allele-Specific Gene Expression Analysis. , 2019, Methods in molecular biology.

[27]  S. Fuqua,et al.  Novel Insights into Breast Cancer Genetic Variance through RNA Sequencing , 2013, Scientific Reports.

[28]  Alice Giustacchini,et al.  Unravelling Intratumoral Heterogeneity through High-Sensitivity Single-Cell Mutational Analysis and Parallel RNA Sequencing , 2019, Molecular cell.

[29]  K. Crandall,et al.  Overexpressed somatic alleles are enriched in functional elements in Breast Cancer , 2017, Scientific Reports.

[30]  J. Marioni,et al.  Inferring the kinetics of stochastic gene expression from single-cell RNA-sequencing data , 2013, Genome Biology.

[31]  M. Lee Understanding Cancer Through the Lens of Epigenetic Inheritance, Allele-Specific Gene Expression, and High-Throughput Technology , 2019, Front. Oncol..

[32]  Ken Chen,et al.  Monovar: single nucleotide variant detection in single cells , 2016, Nature Methods.

[33]  A. Horvath,et al.  RsQTL: correlation of expressed SNVs with splicing using RNA-sequencing data , 2019, bioRxiv.

[34]  Mengjie Chen,et al.  SNV identification from single-cell RNA sequencing data. , 2019, Human molecular genetics.

[35]  M. Zavolan,et al.  Identification and consequences of miRNA–target interactions — beyond repression of gene expression , 2014, Nature Reviews Genetics.

[36]  K. Crandall,et al.  Systematic pan-cancer analysis of somatic allele frequency , 2018, Scientific reports.

[37]  Aleksandra A. Kolodziejczyk,et al.  Characterizing noise structure in single-cell RNA-seq distinguishes genuine from technical stochastic allelic expression , 2015, Nature Communications.

[38]  Michael B. Stadler,et al.  Analysis of intronic and exonic reads in RNA-seq data characterizes transcriptional and post-transcriptional regulation , 2015, Nature Biotechnology.

[39]  Piotr Słowiński,et al.  ReQTL: identifying correlations between expressed SNVs and gene expression using RNA-sequencing data , 2019, Bioinform..

[40]  Jong Kyoung Kim,et al.  Corrigendum: Characterizing noise structure in single-cell RNA-seq distinguishes genuine from technical stochastic allelic expression , 2015, Nature Communications.

[41]  Mercedeh Movassagh,et al.  SNPlice: variants that modulate Intron retention from RNA-sequencing data , 2015, Bioinform..

[42]  Rickard Sandberg,et al.  scphaser: haplotype inference using single-cell RNA-seq data , 2016, Bioinform..

[43]  Krishna R. Kalari,et al.  Cell-level somatic mutation detection from single-cell RNA sequencing , 2019, Bioinform..

[44]  R. Sandberg,et al.  Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells , 2014, Science.

[45]  P. Mondal,et al.  SNP rs1049430 in the 3'-UTR of SH3GL2 regulates its expression: Clinical and prognostic implications in head and neck squamous cell carcinoma. , 2015, Biochimica et biophysica acta.