Sangam: A Framework for Modeling Heterogeneous Database Transformations

A broad spectrum of data is available on-line in distinct heterogeneous sources, and stored under different formats. As the number of systems that utilize the heterogeneous data sources grows, the importance of data translation and conversion mechanisms increases greatly. The goal of our work is a to design a framework that simplifies the task of translation specification and execution. Translation specification between the source and the target schema can be accomplished via (1) the discovery of matches between the source and the target schemata; (2) the application of a pre-defined translation templates; or (3) via manual user specification. In this paper we present a flexible, extensible and re-usable translation modeling framework wherein users can (1) explicitly model the translations between schemas; (2) compose translations from an existing library of modeled translation patterns; (3) choose from a library of translation operators; (4) generate translation models based on a match process; (5) edit such translation models; and (5) for all of these translation models, choose automated execution strategies that transform the source schema and data to the desired target schema and data. In this paper, we present the system architecture for such a translation modeling framework.

[1]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[2]  Elke A. Rundensteiner,et al.  Sangam - a solution to support multiple data models, their mappings and maintenance , 2001, SIGMOD '01.

[3]  Stefan Göbel,et al.  Development of meta databases for geospatial data in the WWW , 1998, GIS '98.

[4]  Anthony Kosky,et al.  WOL: a language for database transformations and constraints , 1997, Proceedings 13th International Conference on Data Engineering.

[5]  Elke A. Rundensteiner,et al.  Clock: synchronizing internal relational storage with external XML documents , 2001, Proceedings Eleventh International Workshop on Research Issues in Data Engineering. Document Management for Data Intensive Business and Scientific Applications. RIDE 2001.

[6]  Erhard Rahm,et al.  Data Warehouse Scenarios for Model Management , 2000, ER.

[7]  Elke A. Rundensteiner,et al.  Gangam: a transformation modeling framework , 2003, Eighth International Conference on Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings..

[8]  Mike P. Papazoglou,et al.  A semantic meta-modelling approach to schema transformation , 1995, CIKM '95.

[9]  Paolo Atzeni,et al.  Management of Multiple Models in an Extensible Database Design Tool , 1996, EDBT.

[10]  James Martin,et al.  Object-oriented analysis and design , 1992 .

[11]  Jean-Marc Nerson,et al.  Object-Oriented Analysis and Design , 1992, TOOLS.

[12]  Vincent Y. Lum,et al.  CONVERT: a high level translation definition language for data conversion , 1975, CACM.

[13]  Nick Roussopoulos,et al.  Integration of Data, Schema and Meta-Schema in the Context of Self-Documenting Data Models , 1983, ER.