Query-based data warehousing tool

Data warehousing is an essential element of decision support. It aims at enabling the knowledge user to make better and faster daily business decisions. In order to supply a decisional database, meta-data is needed to enable the communication between various function areas of the warehouse and an ETL tool (Extraction, Transformation, and Load) is needed to define the warehousing process. The developers use a mapping guideline to specify the ETL tool with the mapping expression of each attribute. In this paper, we will define a model covering different types of mapping expressions. We will use this model to create an active ETL tool. In our approach, we use queries to achieve the warehousing process. SQL queries will be used to represent the mapping between the source and the target data. Thus, we allow DBMS to play an expanded role as a data transformation engine as well as a data store. This approach enables a complete interaction between mapping meta-data and the warehousing tool. In addition, this paper investigates the efficiency for a Query-based data warehousing tool. It describes a query generator for reusable and more efficient data warehouse (DW) processing. Besides exposing the advantages of this approach, this paper shows a case study based on real scale commercial data to verify our tool features.

[1]  Christoph Quix,et al.  Repository Support for Data Warehouse Evolution , 1999, DMDW.

[2]  Erhard Rahm,et al.  An Integrative and Uniform Model for Metadata Management in Data Warehousing Environments , 1999, DMDW.

[3]  Aïcha-Nabila Benharkat,et al.  A Translation Procedure to Clarify the Relationship Between Ontology and XML Schema , 2001, International Conference on Internet Computing.

[4]  John C. Grundy,et al.  Generating EDI message translations from visual specifications , 2001, Proceedings 16th Annual International Conference on Automated Software Engineering (ASE 2001).

[5]  Erhard Rahm,et al.  Generic Schema Matching with Cupid , 2001, VLDB.

[6]  Laura M. Haas,et al.  The Clio project: managing heterogeneity , 2001, SGMD.

[7]  Dan Corbett,et al.  An Ontology of Metadata for a Data Warehouse Represented in Description Logics , 1999, CODAS.

[8]  E. Alsene The computer integration of the enterprise , 1999 .

[9]  Michael Stonebraker,et al.  Content integration for e-business , 2001, SIGMOD '01.

[10]  Erhard Rahm,et al.  On Metadata Interoperability in Data Warehouses , 2000 .

[11]  Amihai Motro,et al.  Database Schema Matching Using Machine Learning with Feature Selection , 2002, CAiSE.

[12]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[13]  Manfred A. Jeusfeld,et al.  Proceedings of the Intl. Workshop on Design and Management of Data Warehouses, DMDW'99, Heidelberg, Germany, June 14-15, 1999 , 1999, Design and Management of Data Warehouses.

[14]  Laura M. Haas,et al.  Schema Mapping as Query Discovery , 2000, VLDB.

[15]  Cláudio de Souza Baptista,et al.  Metadata for an Extensible Data Warehouse Server , 2001, Workshop on Information Integration on the Web.

[16]  Martin Staudt,et al.  Metadata Management and Data Warehousing , 1999 .