An Ontology based Hybrid Approach to Derive Multidimensional Schema for Data warehouse

Due to the diversity of data source data integration has become a challenging task. Data warehouse system plays a vital role to integrate the data for making important business decisions. Data within the data warehouse is arranged as multidimensional schema. In past many works exist to carry out the design of the multidimensional schema for data warehouse from either requirements and/or data sources. These approaches are either manual or automated which work with only relational sources. But as today the data warehouse system needs to deal with semi-structured and unstructured sources, the design task becomes much tedious. Recently, ontology has been very useful for different data integration projects. The use of ontology could solve the syntactic and semantic conflicts that arise from heterogeneous sources. It also provides a way for automating the design of multidimensional schema and populating the data warehouse in a more meaningful way. This paper proposes a framework using ontology for the design of multidimensional schema. Our framework uses a hybrid approach where the reconciliation of requirements and data source are done at the early stage of design. We adopt ontology reasoning in order to automatically derive multidimensional elements such as facts and dimensions.

[1]  Michel Gagnon,et al.  Ontology-based integration of data sources , 2007, 2007 10th International Conference on Information Fusion.

[2]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[3]  Daniel L. Moody,et al.  From enterprise models to dimensional models: a methodology for data warehouse and data mart design , 2000, DMDW.

[4]  Isabelle Comyn-Wattiau,et al.  A UML-based data warehouse design method , 2006, Decis. Support Syst..

[5]  Gottfried Vossen,et al.  Conceptual data warehouse modeling , 2000, DMDW.

[6]  Alberto Abelló,et al.  Automatic validation of requirements to support multidimensional design , 2010, Data Knowl. Eng..

[7]  Paolo Giorgini,et al.  Goal-oriented requirement analysis for data warehouse design , 2005, DOLAP '05.

[8]  Jose-Norberto Mazón,et al.  Reconciling requirement-driven data warehouses with data sources via multidimensional normal forms , 2007, Data Knowl. Eng..

[9]  Karen C. Davis,et al.  Automating data warehouse conceptual schema design and evaluation , 2002, DMDW.

[10]  Mark A. Musen,et al.  PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment , 2000, AAAI/IAAI.

[11]  Antoni Olivé Ramon,et al.  EU-Rent car rentals specification , 2003 .

[12]  Torben Bach Pedersen,et al.  Discovering Multidimensional Structure in Relational Data , 2004, DaWaK.

[13]  Alberto Abelló,et al.  A framework for multidimensional design of data warehouses from ontologies , 2010, Data Knowl. Eng..

[14]  Alberto Abelló,et al.  GEM: Requirement-Driven Generation of ETL and Multidimensional Conceptual Designs , 2011, DaWaK.

[15]  Ladjel Bellatreche,et al.  A methodology and tool for conceptual designing a data warehouse from ontology-based sources , 2010, DOLAP '10.

[16]  Yannis Kalfoglou,et al.  Ontology mapping: the state of the art , 2003, The Knowledge Engineering Review.

[17]  Jose-Norberto Mazón,et al.  Using Ontologies for the Design of Data Warehouses , 2011, ArXiv.

[18]  Pedro Rosa,et al.  Moving from syntactic to semantic organizations using JXML2OWL , 2008, Comput. Ind..

[19]  Fausto Giunchiglia,et al.  Semantic Matching: Algorithms and Implementation , 2007, J. Data Semant..

[20]  Marco Furini,et al.  International Journal of Computer and Applications , 2010 .

[21]  Torben Bach Pedersen,et al.  Multidimensional Integrated Ontologies: A Framework for Designing Semantic Data Warehouses , 2009, J. Data Semant..

[22]  Ritu Khare,et al.  SAMSTAR: a semi-automated lexical method for generating star schemas from an entity-relationship diagram , 2007, DOLAP '07.

[23]  Matteo Golfarelli,et al.  The Dimensional Fact Model: A Conceptual Model for Data Warehouses , 1998, Int. J. Cooperative Inf. Syst..

[24]  Hongming Cai,et al.  An automatic method of data warehouses multi-dimension modeling for distributed information systems , 2011, Proceedings of the 2011 15th International Conference on Computer Supported Cooperative Work in Design (CSCWD).