UniProtKB/Swiss-Prot.

The Swiss Institute of Bioinformatics (SIB), the European Bioinformatics Institute (EBI), and the Protein Information Resource (PIR) form the Universal Protein Resource (UniProt) consortium. Its main goal is to provide the scientific community with a central resource for protein sequences and functional information. The UniProt consortium maintains the UniProt KnowledgeBase (UniProtKB) and several supplementary databases including the UniProt Reference Clusters (UniRef) and the UniProt Archive (UniParc). (1) UniProtKB is a comprehensive protein sequence knowledgebase that consists of two sections: UniProtKB/Swiss-Prot, which contains manually annotated entries, and UniProtKB/TrEMBL, which contains computer-annotated entries. UniProtKB/Swiss-Prot entries contain information curated by biologists and provide users with cross-links to about 100 external databases and with access to additional information or tools. (2) The UniRef databases (UniRef100, UniRef90, and UniRef50) define clusters of protein sequences that share 100, 90, or 50% identity. (3) The UniParc database stores and maps all publicly available protein sequence data, including obsolete data excluded from UniProtKB. The UniProt databases can be accessed online (http://www.uniprot.org/) or downloaded in several formats (ftp://ftp.uniprot.org/pub). New releases are published every 2 weeks. The purpose of this chapter is to present a guided tour of a UniProtKB/Swiss-Prot entry, paying particular attention to the specificities of plant protein annotation. We will also present some of the tools and databases that are linked to each entry.

[1]  Ron D. Appel,et al.  ExPASy: the proteomics server for in-depth protein knowledge and analysis , 2003, Nucleic Acids Res..

[2]  L. Stein,et al.  Gramene, a Tool for Grass Genomics , 2002, Plant Physiology.

[3]  Qunfeng Dong,et al.  MaizeGDB, the community database for maize genetics and genomics , 2004, Nucleic Acids Res..

[4]  A. Bairoch,et al.  The Swiss-Prot protein knowledgebase and ExPASy: providing the plant community with high quality proteomic data and tools. , 2004, Plant physiology and biochemistry : PPB.

[5]  Cathy H. Wu,et al.  Plant Protein Annotation in the UniProt Knowledgebase1 , 2005, Plant Physiology.

[6]  Arnaud Couloux,et al.  GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts , 2004, Nucleic Acids Res..

[7]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[8]  Rodrigo Lopez,et al.  Public web-based services from the European Bioinformatics Institute , 2004, Nucleic Acids Res..

[9]  Thure Etzold,et al.  SRS - an indexing and retrieval tool for flat file data libraries , 1993, Comput. Appl. Biosci..

[10]  Amos Bairoch,et al.  ScanProsite: a reference implementation of a PROSITE scanning tool. , 2002, Applied bioinformatics.

[11]  Amos Bairoch,et al.  Swiss-Prot: Juggling between evolution and stability , 2004, Briefings Bioinform..

[12]  William H. Press,et al.  Numerical Recipes in C, 2nd Edition , 1992 .

[13]  Jungwon Yoon,et al.  The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community , 2003, Nucleic Acids Res..

[14]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[15]  R D Appel,et al.  Protein identification and analysis tools in the ExPASy server. , 1999, Methods in molecular biology.

[16]  Amos Bairoch,et al.  The ENZYME database in 2000 , 2000, Nucleic Acids Res..