The Arabidopsis Information Resource (TAIR): gene structure and function annotation

The Arabidopsis Information Resource (TAIR, http://arabidopsis.org) is the model organism database for the fully sequenced and intensively studied model plant Arabidopsis thaliana. Data in TAIR is derived in large part from manual curation of the Arabidopsis research literature and direct submissions from the research community. New developments at TAIR include the addition of the GBrowse genome viewer to the TAIR site, a redesigned home page, navigation structure and portal pages to make the site more intuitive and easier to use, the launch of several TAIR web services and a new genome annotation release (TAIR7) in April 2007. A combination of manual and computational methods were used to generate this release, which contains 27 029 protein-coding genes, 3889 pseudogenes or transposable elements and 1123 ncRNAs (32 041 genes in all, 37 019 gene models). A total of 681 new genes and 1002 new splice variants were added. Overall, 10 098 loci (one-third of all loci from the previous TAIR6 release) were updated for the TAIR7 release.

[1]  J. Ohlrogge,et al.  Sampling the Arabidopsis Transcriptome with Massively Parallel Pyrosequencing1[W][OA] , 2007, Plant Physiology.

[2]  Jean-Michel Claverie,et al.  FusionDB: a database for in-depth analysis of prokaryotic gene fusion events , 2004, Nucleic Acids Res..

[3]  B. Haas,et al.  Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release , 2005, BMC Biology.

[4]  K. Silverstein,et al.  Genome Organization of More Than 300 Defensin-Like Genes in Arabidopsis1[w] , 2005, Plant Physiology.

[5]  Mark D. Wilkinson,et al.  BioMOBY: An Open Source Biological Web Services Proposal , 2002, Briefings Bioinform..

[6]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[7]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[8]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[9]  Jia Ye,et al.  Vertebrate gene predictions and the problem of large genes , 2003, Nature Reviews Genetics.

[10]  Joseph M. Dale,et al.  Empirical Analysis of Transcriptional Activity in the Arabidopsis Genome , 2003, Science.

[11]  Stephen M. Mount,et al.  Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. , 2003, Nucleic acids research.

[12]  Wen-Hsiung Li,et al.  A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection. , 2007, Genome research.

[13]  G. Phillips,et al.  Identification of transcribed sequences in Arabidopsis thaliana by using high-resolution genome tiling arrays. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14]  加島 倫 Small cysteine-rich peptides resembling antimicrobial peptides have been under-predicted in plants. , 2007, The Plant journal : for cell and molecular biology.

[15]  S. Brenner,et al.  Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Lior Pachter,et al.  VISTA: computational tools for comparative genomics , 2004, Nucleic Acids Res..