Summary Sequence logos have become a crucial visualization method for studying underlying sequence patterns in the genome. Despite this, there remains a scarcity of software packages that provide the versatility often required for such visualizations. ggseqlogo is an R package built on the ggplot2 package that aims to address this issue. ggseqlogo offers native illustration of publication‐ready DNA, RNA and protein sequence logos in a highly customizable fashion with features including multi‐logo plots, qualitative and quantitative colour schemes, annotation of logos and integration with other plots. The package is intuitive to use and seamlessly integrates into R analysis pipelines. Availability and implementation ggseqlogo is released under the GNU licence and is freely available via CRAN‐The Comprehensive R Archive Network https://cran.r‐project.org/web/packages/ggseqlogo. A detailed tutorial can be found at https://omarwagih.github.io/ggseqlogo. Contact wagih@ebi.ac.uk
[1]
Hadley Wickham,et al.
ggplot2 - Elegant Graphics for Data Analysis (2nd Edition)
,
2017
.
[2]
G. Crooks,et al.
WebLogo: a sequence logo generator.
,
2004,
Genome research.
[3]
B. Deplancke,et al.
The Genetics of Transcription Factor DNA Binding Variation
,
2016,
Cell.
[4]
Morten Nielsen,et al.
SigniSite: Identification of residue-level genotype-phenotype correlations in protein multiple sequence alignments
,
2013,
Nucleic Acids Res..
[5]
Gary D Bader,et al.
MIMP: predicting the impact of mutations on kinase-substrate phosphorylation
,
2015,
Nature Methods.
[6]
T. D. Schneider,et al.
Sequence logos: a new way to display consensus sequences.
,
1990,
Nucleic acids research.