Protein Data Bank (PDB): The Single Global Macromolecular Structure Archive.

The Protein Data Bank (PDB)--the single global repository of experimentally determined 3D structures of biological macromolecules and their complexes--was established in 1971, becoming the first open-access digital resource in the biological sciences. The PDB archive currently houses ~130,000 entries (May 2017). It is managed by the Worldwide Protein Data Bank organization (wwPDB; wwpdb.org), which includes the RCSB Protein Data Bank (RCSB PDB; rcsb.org), the Protein Data Bank Japan (PDBj; pdbj.org), the Protein Data Bank in Europe (PDBe; pdbe.org), and BioMagResBank (BMRB; www.bmrb.wisc.edu). The four wwPDB partners operate a unified global software system that enforces community-agreed data standards and supports data Deposition, Biocuration, and Validation of ~11,000 new PDB entries annually (deposit.wwpdb.org). The RCSB PDB currently acts as the archive keeper, ensuring disaster recovery of PDB data and coordinating weekly updates. wwPDB partners disseminate the same archival data from multiple FTP sites, while operating complementary websites that provide their own views of PDB data with selected value-added information and links to related data resources. At present, the PDB archives experimental data, associated metadata, and 3D-atomic level structural models derived from three well-established methods: crystallography, nuclear magnetic resonance spectroscopy (NMR), and electron microscopy (3DEM). wwPDB partners are working closely with experts in related experimental areas (small-angle scattering, chemical cross-linking/mass spectrometry, Forster energy resonance transfer or FRET, etc.) to establish a federation of data resources that will support sustainable archiving and validation of 3D structural models and experimental data derived from integrative or hybrid methods.

[1]  M. Baker,et al.  Outcome of the First Electron Microscopy Validation Task Force Meeting , 2012, Structure.

[2]  Zukang Feng,et al.  The chemical component dictionary: complete descriptions of constituent molecules in experimentally determined 3D macromolecules in the Protein Data Bank , 2015, Bioinform..

[3]  M. Perutz,et al.  Structure of Hæmoglobin: A Three-Dimensional Fourier Synthesis at 5.5-Å. Resolution, Obtained by X-Ray Analysis , 1960, Nature.

[4]  Haruki Nakamura,et al.  Outcome of the First wwPDB/CCDC/D3R Ligand Validation Workshop. , 2016, Structure.

[5]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[6]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[7]  M. F. PERUTZ,et al.  Three Dimensional Fourier Synthesis of Horse Deoxyhaemoglobin at 2.8 Å Resolution , 1970, Nature.

[8]  Ardan Patwardhan,et al.  EMPIAR: a public archive for raw electron microscopy image data , 2016, Nature Methods.

[9]  J. Kendrew,et al.  A Three-Dimensional Model of the Myoglobin Molecule Obtained by X-Ray Analysis , 1958, Nature.

[10]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[11]  G. Marius Clore,et al.  Hybrid Approaches to Structural Characterization of Conformational Ensembles of Complex Macromolecular Systems Combining NMR Residual Dipolar Couplings and Solution X-ray Scattering. , 2016, Chemical reviews.

[12]  I. Bruno,et al.  Cambridge Structural Database , 2002 .

[13]  Sameer Velankar,et al.  PDBe: Protein Data Bank in Europe , 2011, Nucleic Acids Res..

[14]  Aaron A Hoskins,et al.  Structural Analysis of Multi-Helical RNAs by NMR-SAXS/WAXS: Application to the U4/U6 di-snRNA. , 2016, Journal of molecular biology.

[15]  G. Montelione,et al.  Recommendations of the wwPDB NMR Validation Task Force. , 2013, Structure.

[16]  N. O. Manning,et al.  The protein data bank , 1999, Genetica.

[17]  Ruedi Aebersold,et al.  Molecular Architecture of the 40SeIF1eIF3 Translation Initiation Complex. , 2014 .

[18]  J L Sussman,et al.  AutoDep: a web-based system for deposition and validation of macromolecular structural -information. , 2000, Acta crystallographica. Section D, Biological crystallography.

[19]  G J Barton,et al.  Deposition of macromolecular structures. , 1998, Acta crystallographica. Section D, Biological crystallography.

[20]  Akira R. Kinjo,et al.  Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format , 2011, Nucleic Acids Res..

[21]  Annalisa Pastore,et al.  Application of Nuclear Magnetic Resonance and Hybrid Methods to Structure Determination of Complex Systems. , 2016, Advances in experimental medicine and biology.

[22]  J L Sussman,et al.  Protein Data Bank (PDB): database of three-dimensional structural information of biological macromolecules. , 1998, Acta crystallographica. Section D, Biological crystallography.

[23]  Gregory Kucherov,et al.  NORINE: a database of nonribosomal peptides , 2007, Nucleic Acids Res..

[24]  Roland L Dunbrack,et al.  Outcome of a workshop on archiving structural models of biological macromolecules. , 2006, Structure.

[25]  Akira R. Kinjo,et al.  Protein structure databases with new web services for structural biology and biomedical research , 2008, Briefings Bioinform..

[26]  E F Meyer,et al.  The first years of the Protein Data Bank , 1997, Protein science : a publication of the Protein Society.

[27]  John L. Markley,et al.  NRG-CING: integrated validation reports of remediated experimental biomolecular NMR data and coordinates in wwPDB , 2011, Nucleic Acids Res..

[28]  E. Ulrich,et al.  Creation of a nuclear magnetic resonance data repository and literature database. , 1989, Protein sequences & data analysis.

[29]  Michael Nilges,et al.  NMR Exchange Format: a unified and open standard for representation of NMR restraint data , 2015, Nature Structural &Molecular Biology.

[30]  Haruki Nakamura,et al.  PDBML: the representation of archival macromolecular structure data in XML , 2005, Bioinform..

[31]  Randy J. Read,et al.  A New Generation of Crystallographic Validation Tools for the Protein Data Bank , 2011, Structure.

[32]  R. Aebersold,et al.  Molecular Architecture of the 40S⋅eIF1⋅eIF3 Translation Initiation Complex , 2014, Cell.

[33]  Zukang Feng,et al.  Improving the representation of peptide-like inhibitor and antibiotic molecules in the Protein Data Bank , 2014, Biopolymers.

[34]  John L. Markley,et al.  The NMR restraints grid at BMRB for 5,266 protein and nucleic acid PDB entries , 2009, Journal of biomolecular NMR.

[35]  Abhik Mukhopadhyay,et al.  PDBe: improved accessibility of macromolecular structure data from PDB and EMDB , 2015, Nucleic Acids Res..

[36]  John L. Markley,et al.  STAR/CIF macromolecular NMR data dictionaries and data file formats , 1996 .

[37]  Haruki Nakamura,et al.  Outcome of the First wwPDB Hybrid/Integrative Methods Task Force Workshop. , 2015, Structure.

[38]  Akira R. Kinjo,et al.  Publication of nuclear magnetic resonance experimental data with semantic web technology and the application thereof to biomedical research of proteins , 2016, Journal of Biomedical Semantics.

[39]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[40]  John D. Westbrook,et al.  The Protein Model Portal , 2008, Journal of Structural and Functional Genomics.

[41]  Dmitri I. Svergun,et al.  SASBDB, a repository for biological small-angle scattering data , 2014, Nucleic Acids Res..

[42]  Oleg V. Tsodikov,et al.  Data publication with the structural biology data grid supports live analysis , 2016, Nature Communications.

[43]  Juergen Haas,et al.  The Protein Model Portal—a comprehensive resource for protein structure and model information , 2013, Database J. Biol. Databases Curation.

[44]  Sameer Velankar,et al.  E-MSD: improving data deposition and structure quality , 2005, Nucleic Acids Res..

[45]  R. G. Hart,et al.  Structure of Myoglobin: A Three-Dimensional Fourier Synthesis at 2 Å. Resolution , 1960, Nature.

[46]  H. Berman The Protein Data Bank: a historical perspective. , 2008, Acta crystallographica. Section A, Foundations of crystallography.

[47]  John D. Westbrook,et al.  EMDataBank unified data resource for 3DEM , 2013, Nucleic Acids Res..

[48]  D. I. Svergun,et al.  sasCIF: an extension of core Crystallographic Information File for SAS , 2000 .

[49]  Haruki Nakamura,et al.  BioMagResBank (BMRB) as a partner in the Worldwide Protein Data Bank (wwPDB): new policies affecting biomolecular NMR depositions , 2008, Journal of biomolecular NMR.

[50]  Jill Trewhella,et al.  Report of the wwPDB Small-Angle Scattering Task Force: data requirements for biomolecular modeling and the PDB. , 2013, Structure.