CAGE Basic/Analysis Databases: the CAGE resource for comprehensive promoter analysis

Cap-analysis gene expression (CAGE) Basic and Analysis Databases store an original resource produced by CAGE, which measures expression levels of transcription starting sites by sequencing large amounts of transcript 5′ ends, termed CAGE tags. Millions of human and mouse high-quality CAGE tags derived from different conditions in >20 tissues consisting of >250 RNA samples are essential for identification of novel promoters and promoter characterization in the aspect of expression profile. CAGE Basic Database is a primary database of the CAGE resource, RNA samples, CAGE libraries, CAGE clone and tag sequences and so on. CAGE Analysis Database stores promoter related information, such as counts of related transcripts, CpG islands and conserved genome region. It also provides expression profiles at base pair and promoter levels. Both databases are based on the same framework, CAGE tag starting sites, tag clusters for defining promoters and transcriptional units (TUs). Their associations and TU attributes are available to find promoters of interest. These databases were provided for Functional Annotation Of Mouse 3 (FANTOM3), an international collaboration research project focusing on expanding the transcriptome and subsequent analyses. Now access is free for all users through the World Wide Web at .

[1]  Yoshihide Hayashizaki,et al.  EICO (Expression-based Imprint Candidate Organizer): finding disease-related imprinted genes , 2004, Nucleic Acids Res..

[2]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[3]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[4]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[5]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[6]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[7]  Sumio Sugano,et al.  5′-end SAGE for the analysis of transcriptional start sites , 2004, Nature Biotechnology.

[8]  E. Liu,et al.  Gene identification signature (GIS) analysis for transcriptome characterization and genome annotation , 2005, Nature Methods.

[9]  P. Bucher Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. , 1990, Journal of molecular biology.

[10]  Piero Carninci,et al.  Tag-based approaches for transcriptome research and genome annotation , 2005, Nature Methods.

[11]  Yoshihide Hayashizaki,et al.  Construction of representative transcript and protein sets of human, mouse, and rat as a platform for their transcriptome and proteome analysis. , 2004, Genomics.

[12]  J. Kawai,et al.  A genome-wide and nonredundant mouse transcription factor database. , 2004, Biochemical and biophysical research communications.

[13]  M. Frommer,et al.  CpG islands in vertebrate genomes. , 1987, Journal of molecular biology.

[14]  J. T. Kadonaga,et al.  The RNA polymerase II core promoter. , 2003, Annual review of biochemistry.

[15]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[16]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Yoshihide Hayashizaki,et al.  Discovery of imprinted transcripts in the mouse transcriptome using large-scale expression profiling. , 2003, Genome research.

[18]  S. Batalov,et al.  Antisense Transcription in the Mammalian Transcriptome , 2005, Science.

[19]  Philipp Bucher,et al.  The Eukaryotic Promoter Database EPD: the impact of in silico primer extension , 2004, Nucleic Acids Res..

[20]  J. Kawai,et al.  Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[21]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[22]  Kenta Nakai,et al.  BTSS, DataBase of Transcriptional Start Sites: progress report 2004 , 2004, Nucleic Acids Res..