Allelic imbalance metre (Allim), a new tool for measuring allele-specific gene expression with RNA-seq data

Estimating differences in gene expression among alleles is of high interest for many areas in biology and medicine. Here, we present a user‐friendly software tool, Allim, to estimate allele‐specific gene expression. Because mapping bias is a major problem for reliable estimates of allele‐specific gene expression using RNA‐seq, Allim combines two different strategies to account for the mapping biases. In order to reduce the mapping bias, Allim first generates a polymorphism‐aware reference genome that accounts for the sequence variation between the alleles. Then, a sequence‐specific simulation tool estimates the residual mapping bias. Statistical tests for allelic imbalance are provided that can be used with the bias corrected RNA‐seq data.

[1]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[2]  Christian Schlötterer,et al.  Evaluation of Different Reference Based Annotation Strategies Using RNA-Seq – A Case Study in Drososphila pseudoobscura , 2012, PloS one.

[3]  Lachlan James M. Coin,et al.  Inferring combined CNV/SNP haplotypes from genotype data , 2010, Bioinform..

[4]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[5]  Serban Nacu,et al.  Fast and SNP-tolerant detection of complex variants and splicing in short reads , 2010, Bioinform..

[6]  M. Gerstein,et al.  AlleleSeq: analysis of allele-specific expression and binding in a network framework , 2011, Molecular systems biology.

[7]  John C. Marioni,et al.  Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data , 2009, Bioinform..

[8]  Fatih Ozsolak,et al.  RNA sequencing: advances, challenges and opportunities , 2011, Nature Reviews Genetics.

[9]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[10]  J. Reifman,et al.  A new strategy to reduce allelic bias in RNA-Seq readmapping , 2012, Nucleic acids research.

[11]  Brian P. Brunk,et al.  Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM) , 2011, Bioinform..

[12]  L M McIntyre,et al.  Allelic imbalance in Drosophila hybrid heads: exons, isoforms, and evolution. , 2012, Molecular biology and evolution.

[13]  A. Amores,et al.  Identification of transcriptome SNPs between Xiphophorus lines and species for assessing allele specific gene expression within F₁ interspecies hybrids. , 2012, Comparative biochemistry and physiology. Toxicology & pharmacology : CBP.

[14]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[15]  Daniel A. Skelly,et al.  A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data. , 2011, Genome research.

[16]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[17]  L. Coin,et al.  Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads , 2011, Genome Biology.

[18]  Cole Trapnell,et al.  Computational methods for transcriptome annotation and quantification using RNA-seq , 2011, Nature Methods.

[19]  A. Futschik,et al.  PoPoolation: A Toolbox for Population Genetic Analysis of Next Generation Sequencing Data from Pooled Individuals , 2011, PloS one.