The discovery, positioning and verification of a set of transcription-associated motifs in vertebrates

We have developed several new methods to investigate transcriptional motifs in vertebrates. We developed a specific alignment tool appropriate for regions involved in transcription control, and exhaustively enumerated all possible 12-mers for involvement in transcription by virtue of their mammalian conservation. We then used deeper comparative analysis across vertebrates to identify the active instances of these motifs. We have shown experimentally in Medaka fish that a subset of these predictions is involved in transcription.

[1]  Alexander E. Kel,et al.  TRANSFAC®: transcriptional regulation, from patterns to profiles , 2003, Nucleic Acids Res..

[2]  S Beck,et al.  Epigenomics: genome-wide study of methylation phenomena. , 2002, Current issues in molecular biology.

[3]  R. Scarpulla,et al.  Identity of GABP with NRF-2, a multisubunit activator of cytochrome oxidase expression, reveals a cellular role for an ETS domain activator of viral promoters. , 1993, Genes & development.

[4]  R J Schwartz,et al.  Identification of Novel DNA Binding Targets and Regulatory Domains of a Murine Tinman Homeodomain Factor, nkx-2.5(*) , 1995, The Journal of Biological Chemistry.

[5]  Matthew W. Hahn,et al.  The evolution of transcriptional regulation in eukaryotes. , 2003, Molecular biology and evolution.

[6]  Richard Mott,et al.  EST_GENOME: a program to align spliced DNA sequences to unspliced genomic DNA , 1997, Comput. Appl. Biosci..

[7]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[8]  Nicola J. Rinaldi,et al.  Control of Pancreas and Liver Gene Expression by HNF Transcription Factors , 2004, Science.

[9]  B. Cullen,et al.  In vitro selection of DNA elements highly responsive to the human T-cell lymphotropic virus type I transcriptional activator, Tax , 1994, Molecular and cellular biology.

[10]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[11]  D Haussler,et al.  Integrating database homology in a probabilistic gene structure model. , 1997, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[12]  R. Treisman,et al.  A sensitive method for the determination of protein-DNA binding specificities. , 1990, Nucleic acids research.

[13]  J. Collado-Vides,et al.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads. , 2000, Nucleic acids research.

[14]  L. Gold,et al.  Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. , 1990, Science.

[15]  R. Durbin,et al.  Comparative analysis of noncoding regions of 77 orthologous mouse and human gene pairs. , 1999, Genome research.

[16]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[17]  K. Lindblad-Toh,et al.  Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals , 2005, Nature.

[18]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[19]  P. Bucher Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. , 1990, Journal of molecular biology.

[20]  H. Thiesen,et al.  Target Detection Assay (TDA): a versatile procedure to determine DNA binding sites as demonstrated on SP1 protein. , 1990, Nucleic acids research.

[21]  Michael Snyder,et al.  ChIP-chip: a genomic approach for identifying transcription factor binding sites. , 2002, Methods in enzymology.

[22]  김동규,et al.  [서평]「Algorithms on Strings, Trees, and Sequences」 , 2000 .

[23]  R. Mantovani,et al.  A survey of 178 NF-Y binding CCAAT boxes. , 1998, Nucleic acids research.

[24]  I. Jonassen,et al.  Predicting gene regulatory elements in silico on a genomic scale. , 1998, Genome research.

[25]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[26]  Marie-France Sagot,et al.  Algorithms for Extracting Structured Motifs Using a Suffix Tree with an Application to Promoter and Regulatory Site Consensus Identification , 2000, J. Comput. Biol..

[27]  Andrey N. Naumochkin,et al.  Transcription Regulatory Regions Database (TRRD): its status in 2002 , 2002, Nucleic Acids Res..

[28]  Alexander E. Kel,et al.  Transcription Regulatory Regions Database (TRRD): its status in 2000 , 2000, Nucleic Acids Res..

[29]  V. Solovyev,et al.  Ab initio gene finding in Drosophila genomic DNA. , 2000, Genome research.

[30]  Mathieu Blanchette,et al.  Separating real motifs from their artifacts , 2001, ISMB.

[31]  R. Durbin,et al.  GeneWise and Genomewise. , 2004, Genome research.

[32]  John M. Greally,et al.  Epigenomics: beyond CpG islands , 2004, Nature Reviews Genetics.

[33]  Y. Lutz,et al.  Definition of the DNA-binding site repertoire for the Drosophila transcription factor SNAIL. , 1993, Nucleic acids research.

[34]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[35]  H Niemann,et al.  Identification and analysis of eukaryotic promoters: recent computational approaches. , 2001, Trends in genetics : TIG.

[36]  D Sankoff,et al.  Computational complexity of inferring phylogenies from chromosome inversion data. , 1987, Journal of theoretical biology.

[37]  Ian Holmes,et al.  Finding Regulatory Elements Using Joint Likelihoods for Sequence and Expression Profile Data , 2000, ISMB.

[38]  James T Kadonaga,et al.  Regulation of RNA Polymerase II Transcription by Sequence-Specific DNA Binding Factors , 2004, Cell.

[39]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[40]  M. Buckingham Skeletal muscle formation in vertebrates. , 2001, Current opinion in genetics & development.

[41]  B. Amati,et al.  Distinct DNA binding preferences for the c-Myc/Max and Max/Max dimers. , 1993, Nucleic acids research.

[42]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[43]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[44]  Ewan Birney,et al.  Discovering novel cis-regulatory motifs using functional networks. , 2003, Genome research.

[45]  Wyeth W. Wasserman,et al.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles , 2004, Nucleic Acids Res..

[46]  L. O. Penalva,et al.  RNA Binding Protein Sex-Lethal (Sxl) and Control of Drosophila Sex Determination and Dosage Compensation , 2003, Microbiology and Molecular Biology Reviews.

[47]  Stefan Kurtz,et al.  Reducing the space requirement of suffix trees , 1999 .

[48]  C. Allis,et al.  Translating the Histone Code , 2001, Science.