ASM-based design of data warehouses and on-line analytical processing systems

On-line analytical processing (OLAP) systems deal with analytical tasks in businesses. As these tasks do not depend on the latest updates by transactions, it is assumed that the data used in OLAP systems are kept in a data warehouse, which separates the input from operational databases from the outputs to dialogue interfaces for OLAP. Data Warehouses and OLAP systems are a promising area for the application of Abstract State Machines (ASMs). In this article a layered ground model specification for data warehouses and OLAP system is presented that is based explicitly on the fundamental idea of separating input from operational databases from output to OLAP systems. On this basis we start defining formal refinement rules for such systems. As these refinement rules are formally correct they enable a formal method for the high-quality design of data warehouses and OLAP systems that can be applied without knowing mathematical details of the ASM formalism. Furthermore, we discuss pragmatic guidelines for the application of such rules.

[1]  Dimitri Theodoratos Detecting Redundancy in Data Warehouse Evolution , 1999, ER.

[2]  Klaus-Dieter Schewe,et al.  Balancing Redundancy and Query Costs in Distributed Data Warehouses , 2005, APCCM.

[3]  Bernhard Thalheim,et al.  Modelling Data Warehouses and OLAP Applications by Means of Dialogue Objects , 1999, ER.

[4]  Sunita Sarawagi,et al.  Modeling multidimensional databases , 1997, Proceedings 13th International Conference on Data Engineering.

[5]  Klaus-Dieter Schewe,et al.  Integrating Database and Dialogue Design , 2000, Knowledge and Information Systems.

[6]  Andreas Blass,et al.  Abstract state machines capture parallel algorithms , 2003, TOCL.

[7]  Egon Börger,et al.  Java and the Java Virtual Machine: Definition, Verification, Validation , 2001 .

[8]  Klaus-Dieter Schewe,et al.  Using Abstract State Machines for Distributed Data Warehouse Design , 2004, APCCM.

[9]  Timos K. Sellis,et al.  Data Warehouse Schema and Instance Design , 1998, ER.

[10]  David Harel,et al.  Statecharts: A Visual Formalism for Complex Systems , 1987, Sci. Comput. Program..

[11]  Klaus-Dieter Schewe,et al.  UML: A Modern Dinosaur? A Critical Analysis of the Unified Modelling Language , 2000, EJC.

[12]  Jennifer Widom,et al.  Research problems in data warehousing , 1995, CIKM '95.

[13]  W. H. Inmon,et al.  Building the data warehouse (2nd ed.) , 1996 .

[14]  Bernhard Thalheim,et al.  Operational Semantics of Transactions , 2003, ADC.

[15]  Egon Börger,et al.  Abstract State Machines , 2003 .

[16]  Ivar Jacobson,et al.  The unified modeling language reference manual , 2010 .

[17]  Klaus-Dieter Schewe,et al.  Cost Optimisation for Distributed Data Warehouses , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[18]  Ivar Jacobson,et al.  Object-oriented software engineering - a use case driven approach , 1993, TOOLS.

[19]  Laks V. S. Lakshmanan,et al.  A Foundation for Multi-dimensional Databases , 1997, VLDB.

[20]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[21]  Jane Zhao,et al.  Quality-assured design of on-line analytical processing systems using abstract state machines , 2004 .

[22]  Egon Börger,et al.  The ASM Refinement Method , 2003, Formal Aspects of Computing.

[23]  Egon Börger,et al.  Abstract State Machines , 2003 .

[24]  Erik Thomsen,et al.  OLAP Solutions - Building Multidimensional Information Systems , 1997 .

[25]  Hui Ma,et al.  Quality-assured design of on-line analytical processing systems using abstract state machines , 2004, Fourth International Conference onQuality Software, 2004. QSIC 2004. Proceedings..

[26]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[27]  Zoubida Kedad,et al.  Dealing with Semantic Heterogeneity During Data Integration , 1999, ER.

[28]  Charles Wallace,et al.  Formalizing Database Recovery , 1997, J. Univers. Comput. Sci..

[29]  Yuri Gurevich,et al.  Sequential abstract-state machines capture sequential algorithms , 2000, TOCL.