rSeqDiff: Detecting Differential Isoform Expression from RNA-Seq Data Using Hierarchical Likelihood Ratio Test

High-throughput sequencing of transcriptomes (RNA-Seq) has recently become a powerful tool for the study of gene expression. We present rSeqDiff, an efficient algorithm for the detection of differential expression and differential splicing of genes from RNA-Seq experiments across multiple conditions. Unlike existing approaches which detect differential expression of transcripts, our approach considers three cases for each gene: 1) no differential expression, 2) differential expression without differential splicing and 3) differential splicing. We specify statistical models characterizing each of these three cases and use hierarchical likelihood ratio test for model selection. Simulation studies show that our approach achieves good power for detecting differentially expressed or differentially spliced genes. Comparisons with competing methods on two real RNA-Seq datasets demonstrate that our approach provides accurate estimates of isoform abundances and biological meaningful rankings of differentially spliced genes. The proposed approach is implemented as an R package named rSeqDiff.

[1]  Wing Hung Wong,et al.  Statistical inferences for isoform expression in RNA-Seq , 2009, Bioinform..

[2]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[3]  Hao Wu,et al.  A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data , 2012, Biostatistics.

[4]  K. Anthony,et al.  Aberrant RNA processing events in neurological disorders , 2010, Brain Research.

[5]  Davis J. McCarthy,et al.  Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation , 2012, Nucleic acids research.

[6]  Yufeng Liu,et al.  FDM: a graph-based statistical method to detect differential transcription using RNA-seq data , 2011, Bioinform..

[7]  R. Guigó,et al.  Are splicing mutations the most frequent cause of hereditary disease? , 2005, FEBS letters.

[8]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[9]  M. Farlow,et al.  High Levels of Alzheimer Beta-Amyloid Precursor Protein (APP) in Children With Severely Autistic Behavior and Aggression , 2006, Journal of child neurology.

[10]  John A. Calarco,et al.  Emerging Roles of Alternative Pre-mRNA Splicing Regulation in Neuronal Development and Function , 2012, Front. Neurosci..

[11]  E. Wang,et al.  Analysis and design of RNA sequencing experiments for identifying isoform regulation , 2010, Nature Methods.

[12]  T. Sakurai The role of NrCAM in neural development and disorders—Beyond a simple glue in the brain , 2012, Molecular and Cellular Neuroscience.

[13]  N. Kato,et al.  Association of the neuronal cell adhesion molecule (NRCAM) gene variants with autism. , 2009, The international journal of neuropsychopharmacology.

[14]  W. Huber,et al.  Detecting differential usage of exons from RNA-seq data , 2012, Genome research.

[15]  S. Horvath,et al.  Transcriptomic Analysis of Autistic Brain Reveals Convergent Molecular Pathology , 2011, Nature.

[16]  J. Salzman,et al.  Statistical properties of an early stopping rule for resampling-based multiple testing. , 2012, Biometrika.

[17]  M. Marcu,et al.  Scinderin, a Ca2+-Dependent Actin Filament Severing Protein that Controls Cortical Actin Network Dynamics During Secretion , 2004, Neurochemical Research.

[18]  Fan Wang,et al.  CisGenome Browser: a flexible tool for genomic data visualization , 2010, Bioinform..

[19]  R. Vassar,et al.  Molecular Neurodegeneration BioMed Central Review The Alzheimer's disease β-secretase enzyme, BACE1 , 2007 .

[20]  Sharmila Banerjee-Basu,et al.  AutDB: a gene reference resource for autism research , 2008, Nucleic Acids Res..

[21]  T. Tabira,et al.  Three novel alternatively spliced isoforms of the human beta-site amyloid precursor protein cleaving enzyme (BACE) and their effect on amyloid beta-peptide production , 2001, Neuroscience Letters.

[22]  M. Wolfe,et al.  Promotion of BACE1 mRNA Alternative Splicing Reduces Amyloid β-Peptide Production* , 2008, Journal of Biological Chemistry.

[23]  R. F. Luco,et al.  Epigenetics in Alternative Pre-mRNA Splicing , 2011, Cell.

[24]  Douglas L. Black,et al.  Neuronal regulation of alternative pre-mRNA splicing , 2007, Nature Reviews Neuroscience.

[25]  Antti Honkela,et al.  Identifying differentially expressed transcripts from RNA-seq data with biological variation , 2011, Bioinform..

[26]  B. Ray,et al.  Increased Secreted Amyloid Precursor Protein-α (sAPPα) in Severe Autism: Proposal of a Specific, Anabolic Pathway and Putative Biomarker , 2011, PloS one.

[27]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[28]  C. Sanberg,et al.  Peripheral biomarkers in Autism: secreted amyloid precursor protein-alpha as a probable key player in early diagnosis. , 2008, International journal of clinical and experimental medicine.

[29]  Donny D. Licatalosi,et al.  Splicing Regulation in Neurologic Disease , 2006, Neuron.

[30]  David G Hendrickson,et al.  Differential analysis of gene regulation at transcript resolution with RNA-seq , 2012, Nature Biotechnology.

[31]  B. Ray,et al.  Autism, Alzheimer disease, and fragile X , 2011, Neurology.

[32]  Ji Wan,et al.  Genome-Wide Determination of a Broad ESRP-Regulated Posttranscriptional Network by High-Throughput Sequencing , 2012, Molecular and Cellular Biology.

[33]  Juw Won Park,et al.  MATS: a Bayesian framework for flexible detection of differential alternative splicing from RNA-Seq data , 2012, Nucleic acids research.

[34]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[35]  Gil Ast,et al.  Insights into the connection between cancer and alternative splicing. , 2008, Trends in genetics : TIG.

[36]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[37]  J. Buxbaum,et al.  Association analysis of the NrCAM gene in autism and in subsets of families with severe obsessive–compulsive or self-stimulatory behaviors , 2006, Psychiatric genetics.

[38]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[39]  D. Alkon,et al.  Quantification and distribution of beta-secretase alternative splice variants in the rat and human brain. , 2003, Brain research. Molecular brain research.

[40]  Hongzhe Li,et al.  A Hierarchical Bayesian Model for Estimating and Inferring Differential Isoform Expression for Multi-sample RNA-Seq Data , 2011, Statistics in Biosciences.

[41]  Hui Jiang,et al.  Statistical Modeling of RNA-Seq Data. , 2011, Statistical science : a review journal of the Institute of Mathematical Statistics.

[42]  Cole Trapnell,et al.  Computational methods for transcriptome annotation and quantification using RNA-seq , 2011, Nature Methods.