Design and use of a software framework to obtain information derived from macromolecular structure data

The broad goal of the work has been to design, implement, and begin to use an extensible object oriented framework which represents biological macromolecules and which can be queried in efforts to better understand the biological significance of these complex molecules. Using this framework a variety of methods can be added (by us and others) with functionality ranging from simple verification of a single 3D coordinate set derived from X-ray crystallography or NMR studies, to a detailed interrogation of features common to a number of structures represented by one or more disparate experimental sets of data. Features of the current framework, which have been described in detail elsewhere, are summarized to define the context. This is followed by two examples of using the framework and a discussion of how it is being expanded.

[1]  U. Hobohm,et al.  Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.

[2]  Takeuchi Akikazu,et al.  Third International Conference on Logic Programming , 1986, Lecture Notes in Computer Science.

[3]  S J Wodak,et al.  Sesam: A relational database for structure and sequence of macromolecules , 1991, Proteins.

[4]  M J Sternberg,et al.  A relational database of protein structures designed for flexible enquiries about conformation. , 1989, Protein engineering.

[5]  G. N. Ramachandran,et al.  Conformation of polypeptides and proteins. , 1968, Advances in protein chemistry.

[6]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[7]  Henk Sol,et al.  Proceedings of the 54th Hawaii International Conference on System Sciences , 1997, HICSS 2015.

[8]  A. R. Srinivasan,et al.  The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids. , 1992, Biophysical journal.

[9]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[10]  G J Kemp,et al.  An object-oriented database for protein structure analysis. , 1990, Protein engineering.

[11]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[12]  T. Blundell,et al.  Knowledge-based protein modeling. , 1994, Critical reviews in biochemistry and molecular biology.

[13]  John Fox,et al.  Using Prolog to Represent and Reason about Protein Structure , 1986, ICLP.

[14]  David W. Shipman,et al.  The functional data model and the data languages DAPLEX , 1981, TODS.

[15]  N. D. Clarke,et al.  Identification of protein folds: Matching hydrophobicity patterns of sequence sets with solvent accessibility patterns of known structures , 1990, Proteins.

[16]  Andy J. Morffew,et al.  The use of prolog as a protein querying language , 1986, Comput. Chem..

[17]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.