The Impact of Structural Genomics on the Protein Data Bank

The advent of structural genomics presents new challenges to the archive of biomacromolecular structures — the Protein Data Bank (PDB). As technologies involved in structure determination have advanced, both the number and size of structures available in the PDB have increased rapidly. The structural genomics initiatives are creating a large amount of data that needs to be tracked, archived, and made easily available. The PDB has developed tools to facilitate the rapid deposition of data produced by the structural genomics initiatives and has created databases to track the progress of the work.

[1]  Philip E. Bourne,et al.  [30] Macromolecular crystallographic information file , 1997 .

[2]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[3]  Tim J. P. Hubbard,et al.  SCOP: a Structural Classification of Proteins database , 2000, Nucleic Acids Res..

[4]  M. Perutz,et al.  Three dimensional fourier synthesis of horse deoxyhaemoglobin at 2.8 Angstrom units resolution. , 1970, Nature.

[5]  H. Watson,et al.  The Stereochemistry of the Protein Myoglobin , 1976 .

[6]  J. Kendrew,et al.  A Three-Dimensional Model of the Myoglobin Molecule Obtained by X-Ray Analysis , 1958, Nature.

[7]  Philip E. Bourne,et al.  The Protein Data Bank: A Case Study in Management of Community Data , 2004 .

[8]  Frances M. G. Pearl,et al.  The CATH extended protein‐family database: Providing structural annotations for genome sequences , 2002, Protein science : a publication of the Protein Society.

[9]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Philip E. Bourne,et al.  The Macromolecular Crystallographic Information File (mmCIF) , 2001 .

[11]  M. Perutz,et al.  Structure of Hæmoglobin: A Three-Dimensional Fourier Synthesis at 5.5-Å. Resolution, Obtained by X-Ray Analysis , 1960, Nature.

[12]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[13]  Tim J. P. Hubbard,et al.  SCOP database in 2002: refinements accommodate structural genomics , 2002, Nucleic Acids Res..

[14]  Zukang Feng,et al.  Validation of protein structures for protein data bank. , 2003, Methods in enzymology.

[15]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[16]  W. Hendrickson Determination of macromolecular structures from anomalous diffraction of synchrotron radiation. , 1991, Science.