Ontology based ETL process for creation of ontological data warehouse

Extraction, Transformation and Loading (ETL) is a key process of data warehouse building. It integrates data sources with diverse features and structures. Numerous approaches and implementations of ETL have been introduced. However, they still have the following disadvantages: human-dependence, information integration only in syntactic levels, incomplete the homogeneity solution, difficulty to install and configure, etc. In this paper, we propose an alternative approach to the ETL process by attacking the homogeneity in data sources with an ontology-based methodology. Our approach can overcome the drawbacks of most existing approaches; as it automates the key activities of the process, such as: extraction of metainformation, generation of logical and physical data models, and transformation of information.

[1]  Daniel Pol,et al.  Principles for an ETL Benchmark , 2009, TPCTC.

[2]  Mária Bieliková,et al.  An approach to object-ontology mapping , 2007 .

[3]  Ralph Kimball,et al.  The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data , 2004 .

[4]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[5]  Chengqi Zhang,et al.  Ontology-based integration of business intelligence , 2006, Web Intell. Agent Syst..

[6]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[7]  Kalina Bontcheva,et al.  Ontology-Based Information Extraction for Business Intelligence , 2007, ISWC/ASWC.

[8]  Steffen Staab,et al.  OIL: The Ontology Inference Layer , 2000 .

[9]  W. H. Inmon,et al.  Building the Data Warehouse,3rd Edition , 2002 .

[10]  Panos Vassiliadis,et al.  A Framework for the Design of ETL Scenarios , 2003, CAiSE.

[11]  David A. Ferrucci,et al.  UIMA: an architectural approach to unstructured information processing in the corporate research environment , 2004, Natural Language Engineering.

[12]  Narasimhaiah Gorla,et al.  Features to consider in a data warehousing system , 2003, CACM.

[13]  Huan Liu,et al.  Resource description framework: metadata and its applications , 2001, SKDD.

[14]  Longbing Cao,et al.  Ontological Engineering in Data Warehousing , 2006, APWeb.

[15]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[16]  Steffen Staab,et al.  Knowledge Processes and Ontologies , 2001, IEEE Intell. Syst..

[17]  Hilary Cheng,et al.  An ontology-based business intelligence application in a financial knowledge management system , 2009, Expert Syst. Appl..