Motivation: Analysis of next‐generation sequencing data often results in a list of genomic regions. These may include differentially methylated CpGs/regions, transcription factor binding sites, interacting chromatin regions, or GWAS‐associated SNPs, among others. A common analysis step is to annotate such genomic regions to genomic annotations (promoters, exons, enhancers, etc.). Existing tools are limited by a lack of annotation sources and flexible options, the time it takes to annotate regions, an artificial one‐to‐one region‐to‐annotation mapping, a lack of visualization options to easily summarize data, or some combination thereof. Results: We developed the annotatr Bioconductor package to flexibly and quickly summarize and plot annotations of genomic regions. The annotatr package reports all intersections of regions and annotations, giving a better understanding of the genomic context of the regions. A variety of graphics functions are implemented to easily plot numerical or categorical data associated with the regions across the annotations, and across annotation intersections, providing insight into how characteristics of the regions differ across the annotations. We demonstrate that annotatr is up to 27× faster than comparable R packages. Overall, annotatr enables a richer biological interpretation of experiments. Availability and Implementation: http://bioconductor.org/packages/annotatr/ and https://github.com/rcavalcante/annotatr Contact: rcavalca@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online.
[1]
Robert Gentleman,et al.
Software for Computing and Annotating Genomic Ranges
,
2013,
PLoS Comput. Biol..
[2]
Yongseok Park,et al.
MethylSig: a whole genome DNA methylation analysis pipeline
,
2014,
Bioinform..
[3]
ENCODEConsortium,et al.
An Integrated Encyclopedia of DNA Elements in the Human Genome
,
2012,
Nature.
[4]
Felix Krueger,et al.
Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications
,
2011,
Bioinform..
[5]
Hadley Wickham,et al.
ggplot2 - Elegant Graphics for Data Analysis (2nd Edition)
,
2017
.
[6]
T. Meehan,et al.
An atlas of active enhancers across human cell types and tissues
,
2014,
Nature.
[7]
J. Licht,et al.
Leukemic IDH1 and IDH2 mutations result in a hypermethylation phenotype, disrupt TET2 function, and impair hematopoietic differentiation.
,
2010,
Cancer cell.
[8]
Clifford A. Meyer,et al.
Model-based Analysis of ChIP-Seq (MACS)
,
2008,
Genome Biology.
[9]
David S. Lapointe,et al.
ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data
,
2010,
BMC Bioinformatics.
[10]
Aaron R. Quinlan,et al.
BIOINFORMATICS APPLICATIONS NOTE
,
2022
.
[11]
Jeffrey M. Bhasin,et al.
Goldmine integrates information placing genomic ranges into meaningful biological contexts
,
2016,
Nucleic acids research.
[12]
J. Harrow,et al.
GENCODE: producing a reference annotation for ENCODE
,
2006,
Genome Biology.