W-ChIPMotifs: a web application tool for de novo motif discovery from ChIP-based high-throughput data

Summary: W-ChIPMotifs is a web application tool that provides a user friendly interface for de novo motif discovery. The web tool is based on our previous ChIPMotifs program which is a de novo motif finding tool developed for ChIP-based high-throughput data and incorporated various ab initio motif discovery tools such as MEME, MaMF, Weeder and optimized the significance of the detected motifs by using a bootstrap resampling statistic method and a Fisher test. Use of a randomized statistical model like bootstrap resampling can significantly increase the accuracy of the detected motifs. In our web tool, we have modified the program in two aspects: (i) we have refined the P-value with a Bonferroni correction; (ii) we have incorporated the STAMP tool to infer phylogenetic information and to determine the detected motifs if they are novel and known using the TRANSFAC and JASPAR databases. A comprehensive result file is mailed to users. Availability: http://motif.bmi.ohio-state.edu/ChIPMotifs. Data used in the article may be downloaded from http://motif.bmi.ohio-state.edu/ChIPMotifs/examples.shtml. Contact: victor.jin@osumc.edu

[1]  X. Chen,et al.  The Oct4 and Nanog transcription network regulates pluripotency in mouse embryonic stem cells , 2006, Nature Genetics.

[2]  Xin Chen,et al.  TRANSFAC: an integrated system for gene expression regulation , 2000, Nucleic Acids Res..

[3]  Graziano Pesole,et al.  Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes , 2004, Nucleic Acids Res..

[4]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[5]  Allen D. Delaney,et al.  Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing , 2007, Nature Methods.

[6]  Qing Zhou,et al.  A boosting approach for motif modeling using ChIP-chip data , 2005, Bioinform..

[7]  Nicolas Mermod,et al.  Evaluation of computer tools for the prediction of transcription factor binding sites on genomic DNA , 1998, Silico Biol..

[8]  William Stafford Noble,et al.  Assessing computational tools for the discovery of transcription factor binding sites , 2005, Nature Biotechnology.

[9]  Ernest Fraenkel,et al.  TAMO: a flexible, object-oriented framework for analyzing transcriptional regulation using DNA-sequence motifs , 2005 .

[10]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[11]  Wyeth W. Wasserman,et al.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles , 2004, Nucleic Acids Res..

[12]  Ernest Fraenkel,et al.  TAMO: a flexible, object-oriented framework for analyzing transcriptional regulation using DNA-sequence motifs , 2005, Bioinform..

[13]  T. D. Schneider,et al.  Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E. coli. , 1982, Nucleic acids research.

[14]  J. Lieb,et al.  Progress and challenges in profiling the dynamics of chromatin and transcription factor binding with DNA microarrays. , 2004, Current opinion in genetics & development.

[15]  Henriette O'Geen,et al.  Identification of an OCT4 and SRY regulatory module using integrated computational and experimental genomics approaches. , 2007, Genome research.

[16]  T. Hubbard,et al.  NestedMICA: sensitive inference of over-represented motifs in nucleic acid sequence , 2005, Nucleic acids research.

[17]  Panayiotis V. Benos,et al.  STAMP: a web tool for exploring DNA-binding motif similarities , 2007, Nucleic Acids Res..

[18]  Tim Hui-Ming Huang,et al.  Isolating human transcription factor targets by coupling chromatin immunoprecipitation and CpG island microarray analysis. , 2002, Genes & development.

[19]  Jun S. Liu,et al.  Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. , 1993, Science.

[20]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[21]  Gary D. Stormo,et al.  Identifying DNA and protein patterns with statistically significant alignments of multiple sequences , 1999, Bioinform..

[22]  E. Birney,et al.  Trawler: de novo regulatory motif discovery pipeline for chromatin immunoprecipitation , 2007, Nature Methods.

[23]  Charles Elkan,et al.  The Value of Prior Knowledge in Discovering Motifs with MEME , 1995, ISMB.

[24]  J Moult,et al.  Genetic algorithms for protein structure prediction. , 1996, Current opinion in structural biology.

[25]  Ajay N. Jain,et al.  A deterministic motif finding algorithm with application to the human genome , 2006, Bioinform..