Combining Computation with Database Access in Biomolecular Computing

Protein structure analysis is a very important application for database technology, with industrial spinoffs. It is also very demanding because sequence search is so very different from 3-D structure, search. We have developed a common data model for integrating sequence and structure data, and for relating different sequence numbering schemes to 3-D structures. We Lave been able to use our high-level functional language Daplex to express queries of both kinds, but using alternative storage schemas to get good performance in a way that is transparent to the user. Daplex functions can be stored in the database and associated with sub-types in an object-oriented fashion. This architecture allows practising scientists to combine complex geometric calculations with data access, which they need in order to search for complex relationships in the data which may validate or modify their hypotheses.

[1]  Norman W. Paton,et al.  Object-oriented databases - a semantic data model approach , 1992, Prentice Hall International Series in Computer Science.

[2]  Peter M. D. Gray,et al.  Optimization of Methods in a Navigational Query Language , 1991, DOOD.

[3]  G Vriend,et al.  Parameter relation rows: a query system for protein structure function relationships. , 1990, Protein engineering.

[4]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[5]  A. Lesk,et al.  Canonical structures for the hypervariable regions of immunoglobulins. , 1987, Journal of molecular biology.

[6]  Peter M. D. Gray,et al.  Design of a 3D User Interface to a Database , 1993, Workshop on Database Issues for Data Visualization.

[7]  M J Sternberg,et al.  A relational database of protein structures designed for flexible enquiries about conformation. , 1989, Protein engineering.

[8]  A. Lesk,et al.  Conformations of immunoglobulin hypervariable regions , 1989, Nature.

[9]  G J Kemp,et al.  An object-oriented database for protein structure analysis. , 1990, Protein engineering.

[10]  Peter M. D. Gray,et al.  Efficient Access to FDM Objects Stored in a Relational Database , 1994, BNCOD.

[11]  S J Wodak,et al.  Sesam: A relational database for structure and sequence of macromolecules , 1991, Proteins.

[12]  Graham J. L. Kemp,et al.  Protein modelling: a design application of an object-oriented database , 1991 .

[13]  A. Mclachlan Gene duplications in the structural evolution of chymotrypsin. , 1979, Journal of molecular biology.

[14]  B. Liu,et al.  [Effect of BN52021 on platelet activating factor induced aggregation of psoriatic polymorphonuclear neutrophils]. , 1994, Zhonghua yi xue za zhi.

[15]  David W. Shipman,et al.  The functional data model and the data languages DAPLEX , 1981, TODS.

[16]  David W. Shipman The functional data model and the data language DAPLEX , 1979, SIGMOD '79.