The DASCOSA-DB Grid Database System

Computational science applications performing distributed computations using grid networks are now emerging. These applications have new and demanding requirements for efficient query processing. To meet these requirements, we have developed the DASCOSA-DB distributed database system. In this chapter, a detailed overview of the architecture and implementation of DASCOSA-DB is given, as well as a description of novel features developed to better support typical data-intensive applications running on a grid system: fault-tolerant query processing, dynamic refragmentation, allocation and replication of data fragments, and distributed semantic caching.

[1]  Peter A. Boncz,et al.  AmbientDB: Relational Query Processing in a P2P Network , 2003, DBISP2P.

[2]  Kjetil Nørvåg,et al.  DYFRAM: dynamic fragmentation and replica management in distributed database systems , 2010, Distributed and Parallel Databases.

[3]  Beng Chin Ooi,et al.  PeerDB: a P2P-based system for distributed data sharing , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[4]  Jim Smith,et al.  Fault-tolerance in distributed query processing , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).

[5]  Zachary G. Ives,et al.  Reliable storage and querying for collaborative data sharing systems , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[6]  John Mylopoulos,et al.  Data Sharing in the Hyperion Peer Database System , 2005, VLDB.

[7]  Scott Shenker,et al.  Querying the Internet with PIER , 2003, VLDB.

[8]  Kjetil Nørvåg,et al.  Site-autonomous distributed semantic caching , 2011, SAC.

[9]  A. N. Wilschut,et al.  Dataflow query execution in a parallel main-memory environment , 1991, Distributed and Parallel Databases.

[10]  Kjetil Nørvåg,et al.  PROQID: partial restarts of queries in distributed databases , 2008, CIKM '08.

[11]  Patrick Valduriez,et al.  Design and Implementation of Atlas P2P Architecture , 2006 .

[12]  Kjetil Nørvåg,et al.  Efficient and Robust Database Support for Data-Intensive Applications in Dynamic Environments , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[13]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[14]  Alfons Kemper,et al.  ObjectGlobe: Ubiquitous query processing on the Internet , 2001, The VLDB Journal.

[15]  Scott Shenker,et al.  Complex Queries in Dht-based Peer-to-peer Networks , 2002 .

[16]  B Praveen Kumar,et al.  Mariposa a Wide-Area Distributed Database System , 2010, ICCA 2010.

[17]  Ben Y. Zhao,et al.  OceanStore: an architecture for global-scale persistent storage , 2000, SIGP.

[18]  Donald Kossmann,et al.  The state of the art in distributed query processing , 2000, CSUR.

[19]  Marcel Waldvogel,et al.  Bringing efficient advanced queries to distributed hash tables , 2004, 29th Annual IEEE International Conference on Local Computer Networks.

[20]  Robbert van Renesse,et al.  Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining , 2003, TOCS.

[21]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[22]  Dan Suciu,et al.  The Piazza peer data management system , 2004, IEEE Transactions on Knowledge and Data Engineering.

[23]  Christos Doulkeridis,et al.  Efficient Distributed Top-k Query Processing with Caching , 2011, DASFAA.