The PRINTS Database: A Resource for Identification of Protein Families

The PRINTS database houses a collection of protein fingerprints, which may be used to assign family and functional attributes to uncharacterised sequences, such as those currently emanating from the various genome-sequencing projects. The April 2002 release includes 1,700 family fingerprints, encoding approximately 10,500 motifs, covering a range of globular and membrane proteins, modular polypeptides and so on. Fingerprints are groups of conserved motifs that, taken together, provide diagnostic protein family signatures. They derive much of their potency from the biological context afforded by matching motif neighbours; this makes them at once more flexible and powerful than single-motif approaches. The technique further departs from other pattern-matching methods by readily allowing the creation of fingerprints at superfamily-, family- and subfamily-specific levels, thereby allowing more fine-grained diagnoses. Here, we provide an overview of the method of protein fingerprinting and how the results of fingerprint analyses are used to build PRINTS and its relational cousin, PRINTS-S.

[1]  Terri K. Attwood,et al.  PRECIS: Protein reports engineered from concise information in SWISS-PROT , 2003, Bioinform..

[2]  Kay Hofmann,et al.  Protein classification and functional assignment , 1998 .

[3]  Shmuel Pietrokovski,et al.  Increased coverage of protein families with the Blocks Database servers , 2000, Nucleic Acids Res..

[4]  Alex Bateman,et al.  The InterPro database, an integrated documentation resource for protein families, domains and functional sites , 2001, Nucleic Acids Res..

[5]  Amos Bairoch,et al.  The PROSITE database, its status in 2002 , 2002, Nucleic Acids Res..

[6]  T K Attwood,et al.  A compendium of specific motifs for diagnosing GPCR subtypes. , 2001, Trends in pharmacological sciences.

[7]  Terri K. Attwood,et al.  The Role of Pattern Databases in Sequence Analysis , 2000, Briefings Bioinform..

[8]  Terri K. Attwood,et al.  FingerPRINTScan: intelligent searching of the PRINTS motif database , 1999, Bioinform..

[9]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[10]  Douglas L. Brutlag,et al.  The EMOTIF database , 2001, Nucleic Acids Res..

[11]  Terri K. Attwood,et al.  PRINTS-S: the database formerly known as PRINTS , 2000, Nucleic Acids Res..

[12]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[13]  David Scott,et al.  The PRINTS Database of Protein Fingerprints: A Novel Information Resource for Computational Molecular Biology , 1997, J. Chem. Inf. Comput. Sci..

[14]  T. Attwood,et al.  PRINTS--a protein motif fingerprint database. , 1994, Protein engineering.

[15]  T. K. Attwood,et al.  ADSP - a new package for computational sequence analysis , 1992, Comput. Appl. Biosci..

[16]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Jérôme Gouzy,et al.  Recent improvements of the ProDom database of protein domain families , 1999, Nucleic Acids Res..

[18]  T K Attwood,et al.  Fingerprinting G-protein-coupled receptors. , 1994, Protein engineering.

[19]  Robert D. Finn,et al.  Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins , 1999, Nucleic Acids Res..

[20]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[21]  B. Rost,et al.  Marrying structure and genomics. , 1998, Structure.

[22]  T K Attwood,et al.  Deriving structural and functional insights from a ligand-based hierarchical classification of G protein-coupled receptors. , 2002, Protein engineering.

[23]  Terri K. Attwood,et al.  PRINTS and PRINTS-S shed light on protein ancestry , 2002, Nucleic Acids Res..

[24]  Terri K. Attwood,et al.  BLAST PRINTS - alternative perspectives on sequence similarity , 1999, Bioinform..

[25]  W. Pearson Empirical statistical estimates for sequence similarity searches. , 1998, Journal of molecular biology.