Evaluation of TagSeq, a reliable low‐cost alternative for RNAseq

RNAseq is a relatively new tool for ecological genetics that offers researchers insight into changes in gene expression in response to a myriad of natural or experimental conditions. However, standard RNAseq methods (e.g., Illumina TruSeq® or NEBNext®) can be cost prohibitive, especially when study designs require large sample sizes. Consequently, RNAseq is often underused as a method, or is applied to small sample sizes that confer poor statistical power. Low cost RNAseq methods could therefore enable far greater and more powerful applications of transcriptomics in ecological genetics and beyond. Standard mRNAseq is costly partly because one sequences portions of the full length of all transcripts. Such whole‐mRNA data are redundant for estimates of relative gene expression. TagSeq is an alternative method that focuses sequencing effort on mRNAs’ 3’ end, reducing the necessary sequencing depth per sample, and thus cost. We present a revised TagSeq library construction procedure, and compare its performance against NEBNext®, the ‘gold‐standard’ whole mRNAseq method. We built both TagSeq and NEBNext® libraries from the same biological samples, each spiked with control RNAs. We found that TagSeq measured the control RNA distribution more accurately than NEBNext®, for a fraction of the cost per sample (~10%). The higher accuracy of TagSeq was particularly apparent for transcripts of moderate to low abundance. Technical replicates of TagSeq libraries are highly correlated, and were correlated with NEBNext® results. Overall, we show that our modified TagSeq protocol is an efficient alternative to traditional whole mRNAseq, offering researchers comparable data at greatly reduced cost.

[1]  M. Matz,et al.  Genomic determinants of coral heat tolerance across latitudes , 2015, Science.

[2]  T. Fennell,et al.  Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries , 2011, Genome Biology.

[3]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[4]  Michael A Black,et al.  The power and promise of RNA‐seq in ecology and evolution , 2016, Molecular ecology.

[5]  Seth M. Barribeau,et al.  Gene expression differences underlying genotype-by-genotype specificity in a host–parasite system , 2014, Proceedings of the National Academy of Sciences.

[6]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[7]  Cole Trapnell,et al.  Role of Rodent Secondary Motor Cortex in Value-based Action Selection Nih Public Access Author Manuscript , 2006 .

[8]  T. Juenger,et al.  Deeply diverged alleles in the Arabidopsis AREB1 transcription factor drive genome-wide differences in transcriptional response to the environment. , 2015, Molecular biology and evolution.

[9]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[10]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[11]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[12]  Cole Trapnell,et al.  Improving RNA-Seq expression estimates by correcting for fragment bias , 2011, Genome Biology.

[13]  C. Eizaguirre,et al.  Exploring local immunological adaptation of two stickleback ecotypes by experimental infection and transcriptome-wide digital gene expression analysis , 2012, Molecular ecology.

[14]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[15]  Florian Leese,et al.  Detection and Removal of PCR Duplicates in Population Genomic ddRAD Studies by Addition of a Degenerate Base Region (DBR) in Sequencing Adapters , 2014, The Biological Bulletin.

[16]  M. Matz,et al.  Profiling gene expression responses of coral larvae (Acropora millepora) to elevated temperature and settlement inducers using a novel RNA‐Seq procedure , 2011, Molecular ecology.

[17]  Magdalena Zarowiecki,et al.  Whipworm genome and dual-species transcriptome analyses provide molecular insights into an intimate host-parasite interaction , 2014, Nature Genetics.

[18]  Elin Videvall,et al.  The Avian Transcriptome Response to Malaria Infection , 2015, Molecular biology and evolution.

[19]  Standards, Guidelines and Best Practices for Rna-seq V1.0 (june 2011) the Encode Consortium , 2022 .

[20]  D. Reich,et al.  Cost-effective, high-throughput DNA sequencing libraries for multiplexed target capture , 2012, Genome research.

[21]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[22]  John T. Lovell,et al.  Exploiting Differential Gene Expression and Epistasis to Discover Candidate Genes for Drought-Associated QTLs in Arabidopsis thaliana , 2015, Plant Cell.