Integrative Analysis of Histone ChIP‐seq and RNA‐seq Data

The R package epigenomix has been designed to detect differentially transcribed gene isoforms that, in addition, exhibit altered histone modifications at their respective genomic loci. The package provides methods to map histone ChIP‐seq profiles to isoforms and estimate their transcript abundances from RNA‐seq data. Based on the differences observed between case and control samples in the RNA‐seq and ChIP‐seq data, a correlation measure is calculated for each isoform. The distribution of this correlation measure is further investigated by a Bayesian mixture model to (i) reveal the relationship between the studied histone modification and transcriptional activity, and (ii) detect specific isoforms with differences in both transcription values and histone modifications. The method is designed for experiments with a few or no replicates, and is superior to separate analyses of both data types in that setting. This unit illustrates the integrative analysis of ChIP‐seq and RNA‐seq data with epigenomix. © 2016 by John Wiley & Sons, Inc.

[1]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[2]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[3]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[4]  Daniel R. Zerbino,et al.  Ensembl 2016 , 2015, Nucleic Acids Res..

[5]  Martin Dugas,et al.  Integrative Analyses for Omics Data: A Bayesian Mixture Model to Assess the Concordance of ChIP-chip and ChIP-seq Measurements , 2012, Journal of toxicology and environmental health. Part A.

[6]  Pearlly Yan,et al.  Comparative study on ChIP-seq data: normalization and binding pattern characterization , 2009, Bioinform..

[7]  Lior Pachter,et al.  Near-optimal RNA-Seq quantification , 2015, ArXiv.

[8]  Martin Schäfer,et al.  Integrative analysis of histone ChIP-seq and transcription data using Bayesian mixture models , 2014, Bioinform..

[9]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[10]  P. Park,et al.  Design and analysis of ChIP-seq experiments for DNA-binding proteins , 2008, Nature Biotechnology.

[11]  Aaron T. L. Lun,et al.  De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly , 2014, Nucleic acids research.

[12]  Dario Strbenac,et al.  Regional activation of the cancer genome by long-range epigenetic remodeling. , 2013, Cancer cell.

[13]  Claudia Angelini,et al.  Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems , 2014, Front. Cell Dev. Biol..

[14]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[15]  Günter P. Wagner,et al.  Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples , 2012, Theory in Biosciences.