A methodology for building a data warehouse in a scientific environment

Rational drug design is an example where integrated access to heterogeneous scientific data is urgently needed, as it becomes rapidly available due to new experimental and computational techniques. This is currently problematic as data is scattered over heterogeneous, mostly file-based legacy databases, the data is erroneous, incomplete, and inconsistent in representation and content. The authors have developed a methodology, including metadata specification data transformation and software architecture, that supports data analysis and preparation for building a data warehouse using object-oriented database technology. With this methodology the system ReLiBase has been realized, that allows querying and visualization of the drug-design related data from heterogeneous resources.

[1]  François Bancilhon Object databases , 1996, CSUR.

[2]  Jean Thierry-Mieg,et al.  The ACEDB genome database , 1994 .

[3]  A. Bairoch,et al.  The SWISS-PROT protein sequence data bank. , 1991, Nucleic acids research.

[4]  Limsoon Wong,et al.  A Data Transformation System for Biological Data Sources , 1995, VLDB.

[5]  Béatrice Finance,et al.  IRO-DB: a distributed system federating object and relational databases , 1995 .

[6]  Erich J. Neuhold,et al.  Database integration using the open object-oriented database system VODAK , 1995 .

[7]  Karl Aberer,et al.  The Use of Object-Oriented Data Models for Biomolecular Databases , 1995 .

[8]  Karl Aberer,et al.  Constituting a Receptor-Ligand Database from Quality-Enriched Data , 1995, ISMB 1995.

[9]  Serge Abiteboul,et al.  Querying and Updating the File , 1993, VLDB.

[10]  Karl Aberer,et al.  Flexible Design and Efficient Implementation of a Hypermedia Document Database System by Tailoring Semantic Relationships , 1995, DS-6.

[11]  Luca Cabibbo On the Power of Stratified Logic Programs with Value Invention for Expressing Database Transformations , 1995, ICDT.

[12]  Thure Etzold,et al.  SRS - an indexing and retrieval tool for flat file data libraries , 1993, Comput. Appl. Biosci..

[13]  Peter M. D. Gray,et al.  Combining Computation with Database Access in Biomolecular Computing , 1994, ADB.

[14]  K. Nishikawa,et al.  Constructing a protein mutant database. , 1994, Protein engineering.

[15]  Qing Li,et al.  A Framework for Object Migration in Object-Oriented Databases , 1994, Data Knowl. Eng..

[16]  Nathan Goodman An Object-Oriented DBMS War Story: Developing a Genome Mapping Database in C++ , 1995, Modern Database Systems.

[17]  O Ritter,et al.  Prototype implementation of the integrated genomic database. , 1994, Computers and biomedical research, an international journal.

[18]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[19]  W. H. Inmon,et al.  Rdb/VMS: Developing the Data Warehouse , 1993 .

[20]  Karl Aberer,et al.  Semantic query optimization for methods in object-oriented database systems , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[21]  Karl Aberer,et al.  Semantic Optimization of Biomolecular Queries in Object-oriented Database Systems , 1995 .

[22]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[23]  Marc H. Scholl,et al.  Federation and Stepwise Reduction of Database Systems , 1994, ADB.

[24]  Anthony Kosky,et al.  Semantics of Database Transformations , 1995, Semantics in Databases.

[25]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.