Measuring differential gene expression by short read sequencing: quantitative comparison to 2-channel gene expression microarrays

BackgroundHigh-throughput cDNA synthesis and sequencing of poly(A)-enriched RNA is rapidly emerging as a technology competing to replace microarrays as a quantitative platform for measuring gene expression.ResultsConsequently, we compared full length cDNA sequencing to 2-channel gene expression microarrays in the context of measuring differential gene expression. Because of its comparable cost to a gene expression microarray, our study focused on the data obtainable from a single lane of an Illumina 1 G sequencer. We compared sequencing data to a highly replicated microarray experiment profiling two divergent strains of S. cerevisiae.ConclusionUsing a large number of quantitative PCR (qPCR) assays, more than previous studies, we found that neither technology is decisively better at measuring differential gene expression. Further, we report sequencing results from a diploid hybrid of two strains of S. cerevisiae that indicate full length cDNA sequencing can discover heterozygosity and measure quantitative allele-specific expression simultaneously.

[1]  Wei Zhou,et al.  Characterization of the Yeast Transcriptome , 1997, Cell.

[2]  Rithy K. Roth,et al.  Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays , 2000, Nature Biotechnology.

[3]  M. Rattray,et al.  A comparison of microarray and MPSS technology platforms for expression analysis of Arabidopsis , 2007, BMC Genomics.

[4]  Mark Stitt,et al.  Real-time RT-PCR profiling of over 1400 Arabidopsis transcription factors: unprecedented sensitivity reveals novel root- and shoot-specific genes. , 2004, The Plant journal : for cell and molecular biology.

[5]  R. Britten,et al.  Organization, Transcription, and Regulation in the Animal Genome , 1973, The Quarterly Review of Biology.

[6]  Christian Schlötterer,et al.  Gene expression profiling by massively parallel sequencing. , 2007, Genome research.

[7]  Knut Reinert,et al.  SeqAn An efficient, generic C++ library for sequence analysis , 2008, BMC Bioinformatics.

[8]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[9]  Eugene W. Myers,et al.  A fast bit-vector algorithm for approximate string matching based on dynamic programming , 1998, JACM.

[10]  R. Lister,et al.  Highly Integrated Single-Base Resolution Maps of the Epigenome in Arabidopsis , 2008, Cell.

[11]  S. Ranade,et al.  Stem cell transcriptome profiling via massive-scale mRNA sequencing , 2008, Nature Methods.

[12]  M. Stephens,et al.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. , 2008, Genome research.

[13]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[14]  L. Kruglyak,et al.  Genetic Dissection of Transcriptional Regulation in Budding Yeast , 2002, Science.

[15]  L. Kruglyak,et al.  Gene–Environment Interaction in Yeast Gene Expression , 2008, PLoS biology.

[16]  A. Paterson Leafing through the genomes of our major crop plants: strategies for capturing unique information , 2006, Nature Reviews Genetics.

[17]  Edward M. McCreight,et al.  Priority Search Trees , 1985, SIAM J. Comput..

[18]  M. Rosbash,et al.  Number and distribution of polyadenylated RNA sequences in yeast , 1977, Cell.

[19]  Ricardo A. Baeza-Yates,et al.  Fast and Practical Approximate String Matching , 1996, Inf. Process. Lett..

[20]  Simon J Davis,et al.  Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS , 2007, BMC Genomics.

[21]  I. Goodhead,et al.  Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution , 2008, Nature.

[22]  Leonid Kruglyak,et al.  Local Regulatory Variation in Saccharomyces cerevisiae , 2005, PLoS genetics.

[23]  L. Kruglyak,et al.  Genetics of global gene expression , 2006, Nature Reviews Genetics.

[24]  Piero Carninci,et al.  Development of a DNA barcode tagging method for monitoring dynamic changes in gene expression by using an ultra high-throughput sequencer. , 2008, BioTechniques.

[25]  A. Frigessi,et al.  Validation of oligoarrays for quantitative exploration of the transcriptome , 2008, BMC Genomics.

[26]  Ricardo A. Baeza-Yates,et al.  Fast and Practical Approximate String Matching , 1992, Inf. Process. Lett..

[27]  Yaqin Ma,et al.  BatchPrimer3: A high throughput web application for PCR and sequencing primer design , 2008, BMC Bioinformatics.

[28]  Ryan D. Morin,et al.  Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing. , 2008, BioTechniques.

[29]  Jared C Roach,et al.  Application of affymetrix array and massively parallel signature sequencing for identification of genes involved in prostate cancer progression , 2005, BMC Cancer.

[30]  J. Roach,et al.  Statistical analysis of MPSS measurements: application to the study of LPS-activated macrophage gene expression. , 2005, Proceedings of the National Academy of Sciences of the United States of America.