Protein Information Resource: a community resource for expert annotation of protein data

The Protein Information Resource, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the most comprehensive and expertly annotated protein sequence database in the public domain, the PIR-International Protein Sequence Database. To provide timely and high quality annotation and promote database interoperability, the PIR-International employs rule-based and classification-driven procedures based on controlled vocabulary and standard nomenclature and includes status tags to distinguish experimentally determined from predicted protein features. The database contains about 200,000 non-redundant protein sequences, which are classified into families and superfamilies and their domains and motifs identified. Entries are extensively cross-referenced to other sequence, classification, genome, structure and activity databases. The PIR web site features search engines that use sequence similarity and database annotation to facilitate the analysis and functional identification of proteins. The PIR-Inter-national databases and search tools are accessible on the PIR web site at http://pir.georgetown.edu/ and at the MIPS web site at http://www.mips.biochem.mpg.de. The PIR-International Protein Sequence Database and other files are also available by FTP.

[1]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Amos Bairoch,et al.  The PROSITE database, its status in 1999 , 1999, Nucleic Acids Res..

[3]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[4]  Winona C. Barker,et al.  PIR-ALN: a database of protein sequence alignments , 1999, Bioinform..

[5]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[6]  Sean R. Eddy,et al.  Maximum Discrimination Hidden Markov Models of Sequence Consensus , 1995, J. Comput. Biol..

[7]  Friedhelm Pfeiffer,et al.  Database of protein sequence alignments: PIR-ALN , 1999, Nucleic Acids Res..

[8]  Cathy H. Wu,et al.  ProClass Protein Family Database , 1999, Nucleic Acids Res..

[9]  Cathy H. Wu,et al.  iProClass: an integrated, comprehensive and annotated protein classification database , 2001, Nucleic Acids Res..

[10]  Amos Bairoch,et al.  The PROSITE database, its status in 1997 , 1997, Nucleic Acids Res..

[11]  Michael Y. Galperin,et al.  The COG database: a tool for genome-scale analysis of protein functions and evolution , 2000, Nucleic Acids Res..

[12]  W C Barker,et al.  Superfamily classification in PIR-International Protein Sequence Database. , 1996, Methods in enzymology.

[13]  Robert M. Stephens,et al.  The RESID Database of protein structure modifications and the NRL-3D Sequence-Structure Database , 2001, Nucleic Acids Res..

[14]  Peter B. McGarvey,et al.  PIR: a new resource for bioinformatics , 2000, Bioinform..

[15]  Cathy H. Wu,et al.  Gene Family Identification Network Design for Protein Sequence Analysis , 1999, Int. J. Artif. Intell. Tools.

[16]  Amos Bairoch,et al.  The PROSITE database, its status in 2002 , 2002, Nucleic Acids Res..

[17]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.