MetaDiff: differential isoform expression analysis using random-effects meta-regression

BackgroundRNA sequencing (RNA-Seq) allows an unbiased survey of the entire transcriptome in a high-throughput manner. A major application of RNA-Seq is to detect differential isoform expression across experimental conditions, which is of great biological interest due to its direct relevance to protein function and disease pathogenesis. Detection of differential isoform expression is challenging because of uncertainty in isoform expression estimation owing to ambiguous reads and variability in precision of the estimates across samples. It is desirable to have a method that can account for these issues and is flexible enough to allow adjustment for covariates.ResultsIn this paper, we present MetaDiff, a random-effects meta-regression model that naturally fits for the above purposes. Through extensive simulations and analysis of an RNA-Seq dataset on human heart failure, we show that the random-effects meta-regression approach is computationally fast, reliable, and can improve the power of differential expression analysis while controlling for false positives due to the effect of covariates or confounding variables. In contrast, several existing methods either fail to control false discovery rate or have reduced power in the presence of covariates or confounding variables. The source code, compiled JAR package and documentation of MetaDiff are freely available at https://github.com/jiach/MetaDiff.ConclusionOur results indicate that random-effects meta-regression offers a flexible framework for differential expression analysis of isoforms, particularly when gene expression is influenced by other variables.

[1]  Burns C Blaxall,et al.  Phospholipase C epsilon modulates beta-adrenergic receptor-dependent cardiac contraction and inhibits cardiac hypertrophy. , 2005, Circulation research.

[2]  Mingyao Li,et al.  PennSeq: accurate isoform-specific gene expression quantification in RNA-Seq by modeling non-uniform read distribution , 2013, Nucleic acids research.

[3]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[4]  J. Bähler,et al.  Tuning gene expression to changing environments: from rapid responses to evolutionary adaptation , 2008, Nature Reviews Genetics.

[5]  David G Hendrickson,et al.  Differential analysis of gene regulation at transcript resolution with RNA-seq , 2012, Nature Biotechnology.

[6]  Antti Honkela,et al.  Identifying differentially expressed transcripts from RNA-seq data with biological variation , 2011, Bioinform..

[7]  Jürg Bähler,et al.  Tuning gene expression to changing environments: from rapid responses to evolutionary adaptation , 2009, Nature Reviews Genetics.

[8]  Ning Leng,et al.  EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments , 2013, Bioinform..

[9]  Anna Strömberg,et al.  Gender Differences in Patients with Heart Failure , 2003, European journal of cardiovascular nursing : journal of the Working Group on Cardiovascular Nursing of the European Society of Cardiology.

[10]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[11]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[12]  L. Coin,et al.  Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads , 2011, Genome Biology.

[13]  A. Logan,et al.  Statistical Issues in a Metaregression Analysis of Randomized Trials: Impact on the Dietary Sodium Intake and Blood Pressure Relationship , 1999, Biometrics.

[14]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[15]  Leopold Parts,et al.  Gene expression changes with age in skin, adipose tissue, blood and brain , 2013, Genome Biology.

[16]  Alan V. Smrcka,et al.  Phospholipase C (cid:1) Modulates (cid:2) -Adrenergic Receptor Dependent Cardiac Contraction and Inhibits Cardiac Hypertrophy , 2005 .

[17]  Ingmar Visser,et al.  Testing overall and moderator effects in random effects meta-regression. , 2011, The British journal of mathematical and statistical psychology.

[18]  C S Berkey,et al.  A random-effects regression model for meta-analysis. , 1995, Statistics in medicine.

[19]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[20]  Julian P T Higgins,et al.  Controlling the risk of spurious findings from meta‐regression , 2004, Statistics in medicine.

[21]  R. Guigó,et al.  Modelling and simulating generic RNA-Seq experiments with the flux simulator , 2012, Nucleic acids research.

[22]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[23]  C. Hart,et al.  PDGF-A is required for normal murine cardiovascular development. , 1996, Developmental biology.

[24]  S. Du,et al.  SMYD Proteins: Key Regulators in Skeletal and Cardiac Muscle Development and Function , 2014, Anatomical record.

[25]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[26]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[27]  Ernest Turro,et al.  Flexible analysis of RNA-seq data using mixed effects models , 2014, Bioinform..

[28]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[29]  R. Sandberg,et al.  Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells , 2014, Science.