Experimental analysis of the annotation of promoters in the public database.

The ability to identify and examine promoter elements is important to researchers who wish to understand how gene expression is regulated in normal and pathological states. Unfortunately, the number of human promoters that have been directly experimentally defined is small. In order to determine if promoter sequences can be identified by simply aligning mRNA and genomic sequences, we have used a reporter gene assay to assess the promoter activity of the immediate 5' region flanking 38 mRNAs mapping to chromosome 21. For comparison, we have measured the activities of 19 sequences not thought to be promoters and 39 sequences taken from the Eukaryotic Promoter Database. Our results suggest that alignment of reference mRNAs to genomic sequence allows promoters to be identified for at least 75% of genes. These data provide the first empirical evidence that the current state of annotation of the genome is sufficient to allow molecular geneticists to correctly identify promoter sequences for most genes for which reference mRNA and genomic sequences are available.

[1]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[2]  K. Goddard,et al.  The amyloid precursor protein locus and very-late-onset Alzheimer disease. , 2001, American journal of human genetics.

[3]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[4]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[5]  Jurg Ott,et al.  A possible vulnerability locus for bipolar affective disorder on chromosome 21q22.3 , 1994, Nature Genetics.

[6]  M. Petersen,et al.  Nondisjunction in trisomy 21: Origin and mechanisms , 2001, Cytogenetic and Genome Research.

[7]  K Frech,et al.  First pass annotation of promoters on human chromosome 22. , 2001, Genome research.

[8]  Leena Peltonen,et al.  Dissecting Human Disease in the Postgenomic Era , 2001, Science.

[9]  T. Burke,et al.  The downstream core promoter element, DPE, is conserved from Drosophila to humans and is recognized by TAFII60 of Drosophila. , 1997, Genes & development.

[10]  Chromosome 21 workshop. , 1999, American journal of medical genetics.

[11]  A. Saunders,et al.  Gene identification in Alzheimer's disease. , 2001, Pharmacogenomics.

[12]  M. Hattori,et al.  The DNA sequence of human chromosome 21 , 2000, Nature.

[13]  E. Lander The New Genomics: Global Views of Biology , 1996, Science.

[14]  Philipp Bucher,et al.  The Eukaryotic Promoter Database, EPD: new entry types and links to gene expression data , 2002, Nucleic Acids Res..

[15]  V A McKusick,et al.  Genomics and medicine. Dissecting human disease in the postgenomic era. , 2001, Science.