Crystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaboration

Using an open-access distribution model, the Crystallography Open Database (COD, http://www.crystallography.net) collects all known ‘small molecule / small to medium sized unit cell’ crystal structures and makes them available freely on the Internet. As of today, the COD has aggregated ∼150 000 structures, offering basic search capabilities and the possibility to download the whole database, or parts thereof using a variety of standard open communication protocols. A newly developed website provides capabilities for all registered users to deposit published and so far unpublished structures as personal communications or pre-publication depositions. Such a setup enables extension of the COD database by many users simultaneously. This increases the possibilities for growth of the COD database, and is the first step towards establishing a world wide Internet-based collaborative platform dedicated to the collection and curation of structural knowledge.

[1]  A. Krishna Sinha,et al.  Geoinformatics : data to knowledge , 2006 .

[2]  Matt Zandstra,et al.  Version Control with Subversion , 2010 .

[3]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[4]  A. R. Srinivasan,et al.  The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids. , 1992, Biophysical journal.

[5]  Saulius Gražulis,et al.  Crystallography Open Database – an open-access collection of crystal structures , 2009, Journal of applied crystallography.

[6]  L. Zucker,et al.  Minerva Unbound: Knowledge Stocks, Knowledge Flows and New Knowledge Production , 2006 .

[7]  F. Allen,et al.  The Cambridge Crystallographic Data Centre: computer-based search, retrieval, analysis and display of information , 1979 .

[8]  Herbert J. Bernstein,et al.  VCIF2: extended CIF validation software , 2008 .

[9]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[10]  F. Allen,et al.  The crystallographic information file (CIF) : a new standard archive file for crystallography , 1991 .

[11]  Robert T. Downs,et al.  Building the American Mineralogist Crystal Structure Database: A recipe for construction of a small Internet database , 2006 .

[12]  R. Downs,et al.  The American Mineralogist crystal structure database , 2003 .

[13]  J. Kaduk,et al.  Use of the Inorganic Crystal Structure Database as a problem solving tool. , 2002, Acta crystallographica. Section B, Structural science.

[14]  Michael Pilato Version Control with Subversion , 2004 .

[15]  Soorya N Kabekkodu,et al.  New Powder Diffraction File (PDF-4) in relational database format: advantages and data-mining capabilities. , 2002, Acta crystallographica. Section B, Structural science.

[16]  Nicholas K. Sauter,et al.  The Computational Crystallography Toolbox: crystallographic algorithms in a reusable software framework , 2002 .

[17]  J. Rodgers,et al.  CRYSTMET: a database of the structures and powder patterns of metals and intermetallics. , 2002, Acta crystallographica. Section B, Structural science.

[18]  Evan Bolton,et al.  An overview of the PubChem BioAssay resource , 2009, Nucleic Acids Res..