Open data management solutions for problem solving environments: application of distributed authoring and versioning to the Extensible Computational Chemistry Environment

Next-generation problem solving environments (PSEs) promise significant advances over those now available. They will span scientific disciplines and incorporate collaboration capabilities. They will host feature-detection and other agents, allow data mining and pedigree tracking, and provide access from a wide range of devices. Fundamental changes in PSE architecture are required to realize these and other PSE goals. This paper focuses specifically on issues related to data management and recommends an approach based on open, metadata-driven repositories with loosely defined, dynamic schemas. Benefits of this approach are discussed and the redesign of the Extensible Computational Chemistry Environment's (Ecce) data storage architecture to use such a repository is described, based on the distributed authoring and versioning (DAV) standard. The suitability of DAV for scientific data, the mapping of the Ecce schema to DAV, and promising initial results are presented.

[1]  Donald R. Jones,et al.  Computational Chemistry in the Environmental Molecular Sciences Laboratory , 1999 .

[2]  E. Gallopoulos,et al.  Problem-solving Environments For Computational Science , 1997, IEEE Computational Science and Engineering.

[3]  Steven Tuecke,et al.  GridFTP: Protocol Extensions to FTP for the Grid , 2001 .

[4]  Michael Stonebraker,et al.  "The Committee for Advanced DBMS Function": Third Generation Data Base System Manifesto , 1990, SIGMOD Conference.

[5]  James R. Rice,et al.  From Scientific Software Libraries to Problem Solving Environments John R. Rice , 1996 .

[6]  Peter Murray-Rust,et al.  A universal approach to web-based chemistry using XML and CML , 2000 .

[7]  Michael Stonebraker,et al.  Third-generation database system manifesto , 1990, SGMD.

[8]  E. Gallopoulos,et al.  Computer as thinker/doer: problem-solving environments for computational science , 1994, IEEE Computational Science and Engineering.

[9]  R. Whiteside,et al.  PRE: A Framework for Enterprise Integration; , 1998, DIISM.

[10]  David J. DeWitt,et al.  Of Objects and Databases: A Decade of Turmoil , 1996, VLDB.

[11]  Arnaud Le Hors,et al.  Document Object Model (DOM) Level 2 Core Specification - Version 1.0 , 2000 .

[12]  E. James Whitehead,et al.  HTTP Extensions for Distributed Authoring - WEBDAV , 1999, RFC.

[13]  David J. DeWitt,et al.  The Object-Oriented Database System Manifesto , 1994, Building an Object-Oriented Database System, The Story of O2.

[14]  Roy T. Fielding,et al.  Web-based development of complex information products , 1998, CACM.