CpGmotifs: a tool to discover DNA motifs associated to CpG methylation events

Background The investigation of molecular alterations associated with the conservation and variation of DNA methylation in eukaryotes is gaining interest in the biomedical research community. Among the different determinants of methylation stability, the DNA composition of the CpG surrounding regions has been shown to have a crucial role in the maintenance and establishment of methylation statuses. This aspect has been previously characterized in a quantitative manner by inspecting the nucleotidic composition in the region. Research in this field still lacks a qualitative perspective, linked to the identification of certain sequences (or DNA motifs) related to particular DNA methylation phenomena. Results Here we present a novel computational strategy based on short DNA motif discovery in order to characterize sequence patterns related to aberrant CpG methylation events. We provide our framework as a user-friendly, shiny-based application, CpGmotifs, to easily retrieve and characterize DNA patterns related to CpG methylation in the human genome. Our tool supports the functional interpretation of deregulated methylation events by predicting transcription factors binding sites (TFBS) encompassing the identified motifs. Conclusions CpGmotifs is an open source software. Its source code is available on GitHub https://github.com/Greco-Lab/CpGmotifs and a ready-to-use docker image is provided on DockerHub at https://hub.docker.com/r/grecolab/cpgmotifs .

[1]  Tal Galili,et al.  dendextend: an R package for visualizing, adjusting and comparing trees of hierarchical clustering , 2015, Bioinform..

[2]  Jun Wang,et al.  Identification of DNA motifs that regulate DNA methylation , 2019, bioRxiv.

[3]  J. Kere,et al.  Differential DNA Methylation in Purified Human Blood Cells: Implications for Cell Lineage and Studies on Disease Susceptibility , 2012, PloS one.

[4]  D. Eizirik,et al.  Interferon regulatory factor-1 is a key transcription factor in murine beta cells under immune attack , 2009, Diabetologia.

[5]  Eva K. Lee,et al.  Predicting aberrant CpG island methylation , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  K. Ikeo,et al.  DNA Methylome Analysis Identifies Transcription Factor-Based Epigenomic Signatures of Multilineage Competence in Neural Stem/Progenitor Cells. , 2017, Cell reports.

[7]  A. Feinberg,et al.  Increased methylation variation in epigenetic domains across cancer types , 2011, Nature Genetics.

[8]  A. Bird,et al.  Repression of genes by DNA methylation depends on CpG density and promoter strength: evidence for involvement of a methyl‐CpG binding protein. , 1992, The EMBO journal.

[9]  D. Greco,et al.  DNA sequence context as a marker of CpG methylation instability in normal and cancer tissues , 2020, Scientific Reports.

[10]  David J. Arenillas,et al.  JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework , 2017, Nucleic acids research.

[11]  Thomas E. Bartlett,et al.  Corruption of the Intra-Gene DNA Methylation Architecture Is a Hallmark of Cancer , 2013, PloS one.

[12]  W. Neuhofer Role of NFAT5 in Inflammatory Disorders Associated with Osmotic Stress , 2010, Current genomics.

[13]  Martin Vingron,et al.  A trans-acting locus regulates an anti-viral expression network and type 1 diabetes risk , 2010, Nature.

[14]  L. Glimcher,et al.  Protective role of nuclear factor of activated T cells 2 in CD8+ long-lived memory T cells in an allergy model. , 2008, The Journal of allergy and clinical immunology.

[15]  M. Betts,et al.  Characterization of T-Bet and Eomes in Peripheral Human Immune Cells , 2014, Front. Immunol..

[16]  William Stafford Noble,et al.  Quantifying similarity between motifs , 2007, Genome Biology.

[17]  D. Schübeler,et al.  Genomic patterns and context specific interpretation of DNA methylation. , 2014, Current opinion in genetics & development.

[18]  SEMplMe: A tool for integrating DNA methylation effects in transcription factor binding affinity predictions , 2020 .

[19]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[20]  Timothy L. Bailey,et al.  Gene expression Advance Access publication May 4, 2011 DREME: motif discovery in transcription factor ChIP-seq data , 2011 .

[21]  Y. Ben-David,et al.  The ets transcription factor Fli-1 in development, cancer and disease , 2014, Oncogene.

[22]  Feng Luo,et al.  MultiMotifMaker: A Multi-Thread Tool for Identifying DNA Methylation Motifs from Pacbio Reads , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.