MIPS: a database for genomes and protein sequences

The Munich Information Center for Protein Sequences (MIPS-GSF), Martinsried near Munich, Germany, develops and maintains genome oriented databases. It is commonplace that the amount of sequence data available increases rapidly, but not the capacity of qualified manual annotation at the sequence databases. Therefore, our strategy aims to cope with the data stream by the comprehensive application of analysis tools to sequences of complete genomes, the systematic classification of protein sequences and the active support of sequence analysis and functional genomics projects. This report describes the systematic and up-to-date analysis of genomes (PEDANT), a comprehensive database of the yeast genome (MYGD), a database reflecting the progress in sequencing the Arabidopsis thaliana genome (MATD), the database of assembled, annotated human EST clusters (MEST), and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). MIPS provides access through its WWW server (http://www.mips.biochem.mpg.de) to a spectrum of generic databases, including the above mentioned as well as a database of protein families (PROTFAM), the MITOP database, and the all-against-all FASTA database.

[1]  P. Piffanelli,et al.  Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana , 1998, Nature.

[2]  Amos Bairoch,et al.  The PROSITE database, its status in 1997 , 1997, Nucleic Acids Res..

[3]  Dmitrij Frishman,et al.  PEDANTic genome analysis , 1997 .

[4]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[5]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[6]  Jérôme Gracy,et al.  Automated protein sequence database classification. I. Integration of compositional similarity search, local similarity search, and multiple sequence alignment , 1998, Bioinform..

[7]  Sean R. Eddy,et al.  Pfam: multiple sequence alignments and HMM-profiles of protein domains , 1998, Nucleic Acids Res..

[8]  G. Schuler Pieces of the puzzle: expressed sequence tags and the catalog of human genes , 1997, Journal of Molecular Medicine.

[9]  W C Barker,et al.  Superfamily classification in PIR-International Protein Sequence Database. , 1996, Methods in enzymology.

[10]  B. Dujon,et al.  Genomic Exploration of the Hemiascomycetous Yeasts: 1. A set of yeast species for molecular evolution studies 1 , 2000, FEBS letters.

[11]  X. Huang,et al.  An improved sequence assembly program. , 1996, Genomics.

[12]  Mikhail S. Gelfand,et al.  Combining diverse evidence for gene recognition in completely sequenced bacterial genomes , 1998, German Conference on Bioinformatics.

[13]  Thomas Meitinger,et al.  MITOP, the mitochondrial proteome database: 2000 update , 2000, Nucleic Acids Res..

[14]  P. Deloukas,et al.  A Gene Map of the Human Genome , 1996, Science.

[15]  Thomas Meitinger,et al.  MITOP, THE MITOCHONDRIAL PROTEOME DATABASE , 2000 .

[16]  Peter Sommerlad,et al.  Pattern-Oriented Software Architecture , 1996 .

[17]  James I. Garrels,et al.  The Yeast Protein Database (YPD): a curated proteome database for Saccharomyces cerevisiae , 1998, Nucleic Acids Res..

[18]  S. Brunak,et al.  SHORT COMMUNICATION Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites , 1997 .

[19]  Peter B. McGarvey,et al.  Protein Information Resource: a community resource for expert annotation of protein data , 2001, Nucleic Acids Res..

[20]  Steven Henikoff,et al.  PATMAT: a searching and extraction program for sequence, pattern and block queries and databases , 1992, Comput. Appl. Biosci..