Critical Evaluation of Imprinted Gene Expression by RNA–Seq: A New Perspective

In contrast to existing estimates of approximately 200 murine imprinted genes, recent work based on transcriptome sequencing uncovered parent-of-origin allelic effects at more than 1,300 loci in the developing brain and two adult brain regions, including hundreds present in only males or females. Our independent replication of the embryonic brain stage, where the majority of novel imprinted genes were discovered and the majority of previously known imprinted genes confirmed, resulted in only 12.9% concordance among the novel imprinted loci. Further analysis and pyrosequencing-based validation revealed that the vast majority of the novel reported imprinted loci are false-positives explained by technical and biological variation of the experimental approach. We show that allele-specific expression (ASE) measured with RNA–Seq is not accurately modeled with statistical methods that assume random independent sampling and that systematic error must be accounted for to enable accurate identification of imprinted expression. Application of a robust approach that accounts for these effects revealed 50 candidate genes where allelic bias was predicted to be parent-of-origin–dependent. However, 11 independent validation attempts through a range of allelic expression biases confirmed only 6 of these novel cases. The results emphasize the importance of independent validation and suggest that the number of imprinted genes is much closer to the initial estimates.

[1]  D. Barlow,et al.  The mouse insulin-like growth factor type-2 receptor is imprinted and closely linked to the Tme locus , 1991, Nature.

[2]  Emily H Turner,et al.  Targeted Capture and Massively Parallel Sequencing of Twelve Human Exomes , 2009, Nature.

[3]  David Haig,et al.  Sex-Specific Parent-of-Origin Allelic Expression in the Mouse Brain , 2010, Science.

[4]  Bradley J. Main,et al.  BMC Genomics BioMed Central Methodology article Allele-specific expression assays using Solexa , 2009 .

[5]  A. Efstratiadis,et al.  Parental imprinting of the mouse insulin-like growth factor II gene , 1991, Cell.

[6]  S. Elbein,et al.  Detection of allelic imbalance in gene expression using pyrosequencing. , 2007, Methods in molecular biology.

[7]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[8]  I. Goodhead,et al.  Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution , 2008, Nature.

[9]  T. Bestor,et al.  WAMIDEX: A web atlas of murine genomic imprinting and differential expression , 2008, Epigenetics.

[10]  M. Surani,et al.  Development of reconstituted mouse eggs suggests imprinting of the genome during gametogenesis , 1984, Nature.

[11]  Thomas M. Keane,et al.  Mouse genomic variation and its effect on phenotypes and gene regulation , 2011, Nature.

[12]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[13]  Kenta Nakai,et al.  Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis. , 2011, Genome research.

[14]  J. Cavaille,et al.  Non‐coding RNAs in imprinted gene clusters , 2008, Biology of the cell.

[15]  L. Maquat,et al.  Failsafe nonsense-mediated mRNA decay does not detectably target eIF4E-bound mRNA , 2007, Nature Structural &Molecular Biology.

[16]  S. Luo,et al.  High-Resolution Analysis of Parent-of-Origin Allelic Expression in the Mouse Brain , 2010, Science.

[17]  M. Surani,et al.  Embryological and molecular investigations of parental imprinting on mouse chromosome 7 , 1991, Nature.

[18]  T. Babak,et al.  Global Survey of Genomic Imprinting by Transcriptome Sequencing , 2008, Current Biology.

[19]  Lira Mamanova,et al.  FRT-seq: Amplification-free, strand-specific, transcriptome sequencing , 2010, Nature Methods.

[20]  K. Hahm,et al.  Cloning of novel trinucleotide-repeat (CAG) containing genes in mouse brain. , 1997, Biochemical and biophysical research communications.

[21]  M. Hidalgo,et al.  Pyrosequencing protocol using a universal biotinylated primer for mutation detection and SNP genotyping , 2007, Nature Protocols.

[22]  Eric E Schadt,et al.  Genetic validation of whole-transcriptome sequencing for mapping expression affected by cis-regulatory variation , 2010, BMC Genomics.

[23]  Paul Scherz,et al.  Functional analysis of secreted and transmembrane proteins critical to mouse development , 2001, Nature Genetics.

[24]  Kiyoshi Asai,et al.  The Functional RNA Database 3.0: databases to support mining and annotation of functional RNAs , 2008, Nucleic Acids Res..

[25]  Y. Ihara,et al.  One of the antigenic determinants of paired helical filaments is related to tau protein. , 1986, Journal of biochemistry.

[26]  C. Beechey,et al.  Complementation studies with mouse translocations. , 1978, Cytogenetics and cell genetics.

[27]  John C. Marioni,et al.  Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data , 2009, Bioinform..

[28]  Aya Kojima,et al.  fRNAdb: a platform for mining/annotating functional RNA candidates from non-coding RNA sequences , 2006, Nucleic Acids Res..

[29]  U. Francke,et al.  Lack of Pwcr1/MBII-85 snoRNA is critical for neonatal lethality in Prader–Willi syndrome mouse models , 2005, Mammalian Genome.

[30]  E. Mardis,et al.  Transcriptome-Wide Identification of Novel Imprinted Genes in Neonatal Mouse Brain , 2008, PloS one.

[31]  D. Barlow,et al.  Gametic Imprinting in Mammals , 1995, Science.

[32]  al. et,et al.  Massive cell death of immature hematopoietic cells and neurons in Bcl-x-deficient mice , 1995, Science.

[33]  D. Clayton,et al.  Genome-wide analysis of allelic expression imbalance in human primary cells by high-throughput transcriptome resequencing , 2009, Human molecular genetics.

[34]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[35]  Charles Lee,et al.  Identification of the Imprinted KLF14 Transcription Factor Undergoing Human-Specific Accelerated Evolution , 2007, PLoS genetics.

[36]  A. Wood,et al.  Genomic Imprinting of Dopa decarboxylase in Heart and Reciprocal Allelic Expression with Neighboring Grb10 , 2007, Molecular and Cellular Biology.

[37]  M. Bartolomei,et al.  Parental imprinting of the mouse H19 gene , 1991, Nature.

[38]  C. Nusbaum,et al.  Key considerations for measuring allelic expression on a genomic scale using high‐throughput sequencing , 2010, Molecular ecology.

[39]  Michael Krawczak,et al.  Statistical inference of allelic imbalance from transcriptome data , 2011, Human mutation.

[40]  Eleazar Eskin,et al.  A sequence-based variation map of 8.27 million SNPs in inbred mouse strains , 2007, Nature.

[41]  R. Oakey,et al.  Retrotransposition and genomic imprinting. , 2010, Briefings in functional genomics.

[42]  A. Wood,et al.  Allele-specific demethylation at an imprinted mammalian promoter , 2007, Nucleic acids research.

[43]  J. Thierry-Mieg,et al.  AceView: a comprehensive cDNA-supported gene and transcripts annotation , 2006, Genome Biology.

[44]  Z. Ning,et al.  Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of GC-biased genomes , 2009, Nature Methods.

[45]  K. Anderson,et al.  The coiled-coil domain containing protein CCDC40 is essential for motile cilia function and left-right axis formation , 2011, Nature Genetics.

[46]  J. Graves,et al.  Evolution of genomic imprinting: insights from marsupials and monotremes. , 2009, Annual review of genomics and human genetics.

[47]  H. Spencer,et al.  A census of mammalian imprinting. , 2005, Trends in genetics : TIG.

[48]  DP Barlow Methylation and imprinting: from host defense to gene regulation? , 1993, Science.

[49]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[50]  S. Scherer,et al.  Comparative analysis of human chromosome 7q21 and mouse proximal chromosome 6 reveals a placental-specific imprinted gene, TFPI2/Tfpi2, which requires EHMT2 and EED for allelic-silencing. , 2008, Genome research.

[51]  T. Borodina,et al.  Transcriptome analysis by strand-specific sequencing of complementary DNA , 2009, Nucleic acids research.

[52]  A. Wood,et al.  Chromosome-wide identification of novel imprinted genes using microarrays and uniparental disomies , 2006, Nucleic acids research.

[53]  L. Coin,et al.  Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads , 2011, Genome Biology.