Genetic validation of whole-transcriptome sequencing for mapping expression affected by cis-regulatory variation

BackgroundIdentifying associations between genotypes and gene expression levels using microarrays has enabled systematic interrogation of regulatory variation underlying complex phenotypes. This approach has vast potential for functional characterization of disease states, but its prohibitive cost, given hundreds to thousands of individual samples from populations have to be genotyped and expression profiled, has limited its widespread application.ResultsHere we demonstrate that genomic regions with allele-specific expression (ASE) detected by sequencing cDNA are highly enriched for cis- acting expression quantitative trait loci (cis- eQTL) identified by profiling of 500 animals in parallel, with up to 90% agreement on the allele that is preferentially expressed. We also observed widespread noncoding and antisense ASE and identified several allele-specific alternative splicing variants.ConclusionMonitoring ASE by sequencing cDNA from as little as one sample is a practical alternative to expression genetics for mapping cis-acting variation that regulates RNA transcription and processing.

[1]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[2]  Bradley J. Main,et al.  BMC Genomics BioMed Central Methodology article Allele-specific expression assays using Solexa , 2009 .

[3]  Ting Wang,et al.  The UCSC Genome Browser Database: update 2009 , 2008, Nucleic Acids Res..

[4]  Mathieu Blanchette,et al.  Global patterns of cis variation in human cells revealed by high-density allelic expression analysis , 2009, Nature Genetics.

[5]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[6]  Christophe Malabat,et al.  Widespread bidirectional promoters are the major source of cryptic transcripts in yeast , 2009, Nature.

[7]  Kiyoshi Asai,et al.  The Functional RNA Database 3.0: databases to support mining and annotation of functional RNAs , 2008, Nucleic Acids Res..

[8]  E. Schadt,et al.  Genetic and Genomic Analysis of a Fat Mass Trait with Complex Inheritance Reveals Marked Sex Specificity , 2006, PLoS genetics.

[9]  John D. Storey,et al.  Mapping the Genetic Architecture of Gene Expression in Human Liver , 2008, PLoS biology.

[10]  K. Kinzler,et al.  The Antisense Transcriptomes of Human Cells , 2008, Science.

[11]  F. Clark,et al.  Understanding alternative splicing: towards a cellular code , 2005, Nature Reviews Molecular Cell Biology.

[12]  Ronghua Chen,et al.  Digital transcriptome profiling using selective hexamer priming for cDNA synthesis , 2009, Nature Methods.

[13]  B. Weir,et al.  The quantitative genetics of transcription. , 2005, Trends in genetics : TIG.

[14]  T. Babak,et al.  Global Survey of Genomic Imprinting by Transcriptome Sequencing , 2008, Current Biology.

[15]  S. Batalov,et al.  Antisense Transcription in the Mammalian Transcriptome , 2005, Science.

[16]  C. Haley,et al.  A simple regression method for mapping quantitative trait loci in line crosses using flanking markers , 1992, Heredity.

[17]  E. Mardis,et al.  Transcriptome-Wide Identification of Novel Imprinted Genes in Neonatal Mouse Brain , 2008, PloS one.

[18]  Jehyuk Lee,et al.  A Robust Approach to Identifying Tissue-Specific Gene Expression Regulatory Variants Using Personalized Human Induced Pluripotent Stem Cells , 2009, PLoS genetics.

[19]  Jehyuk Lee,et al.  Digital RNA Allelotyping Reveals Tissue-specific and Allele-specific Gene Expression in Human , 2009, Nature Methods.

[20]  Gene W. Yeo,et al.  Divergent Transcription from Active Promoters , 2008, Science.

[21]  Eleazar Eskin,et al.  A sequence-based variation map of 8.27 million SNPs in inbred mouse strains , 2007, Nature.

[22]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[23]  S. Horvath,et al.  Variations in DNA elucidate molecular networks that cause disease , 2008, Nature.

[24]  Erez Y. Levanon,et al.  Widespread occurrence of antisense transcription in the human genome , 2003, Nature Biotechnology.

[25]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[26]  Eric E Schadt,et al.  Cis-acting expression quantitative trait loci in mice. , 2005, Genome research.

[27]  Jacek Majewski,et al.  Genome-wide analysis of transcript isoform variation in humans , 2008, Nature Genetics.

[28]  H. Fraser,et al.  Common polymorphic transcript variation in human disease. , 2009, Genome research.

[29]  K. Dewar,et al.  Targeted screening of cis-regulatory variation in human haplotypes. , 2008, Genome research.

[30]  Brian L. Frey,et al.  Alpha-Ketoisocaproate-induced hypersecretion of insulin by islets from diabetes-susceptible mice. , 2005, American journal of physiology. Endocrinology and metabolism.