Semantic Integration and Querying of Heterogeneous Data Sources Using a Hypergraph Data Model

Information integration in the World Wide Web has evolved to a new framework where the information is represented and manipulated using a wide range of modeling languages. Current approaches to data integration use wrappers to convert the different modeling languages into a common data model. In this work we use a nested hypergraph based data model (called HDM) as a common data model for integrating different structured or semi-structured data. We present a hypergraph query language (HQL) that allows the integration of the wrapped data sources through the creation of views for mediators, and the querying of the wrapped data sources and the mediator views by the end users. We also show that HQL queries (views) can be constructed from other views and/or source schemas using a set of primitive transformations. Our integration architecture is flexible and allows some (or all) of the views in a mediator to be materialized.

[1]  Jennifer Widom,et al.  View maintenance in a warehousing environment , 1995, SIGMOD '95.

[2]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[3]  Alexandra Poulovassilis,et al.  A Semantic Approach to Integrating XML and Structured Data Sources , 2001, CAiSE.

[4]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[5]  Timos K. Sellis,et al.  Data Warehouse Configuration , 1997, VLDB.

[6]  Tiziana Catarci,et al.  Graphical interaction with heterogeneous databases , 1997, The VLDB Journal.

[7]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[8]  Mary Roth,et al.  Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources , 1997, VLDB.

[9]  Richard Hull,et al.  Managing semantic heterogeneity in databases: a theoretical prospective , 1997, PODS.

[10]  Patrick Valduriez,et al.  Scaling Access to Heterogeneous Data Sources with DISCO , 1998, IEEE Trans. Knowl. Data Eng..

[11]  Letizia Tanca,et al.  G-Log: A Declarative Graphical Query Language , 1991, DOOD.

[12]  Gang Zhou,et al.  A framework for supporting data integration using the materialized and virtual approaches , 1996, SIGMOD '96.

[13]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[14]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[15]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[16]  Alberto O. Mendelzon,et al.  Architecture and Applications of the Hy+ Visualization System , 1994, IBM Syst. J..

[17]  Alberto O. Mendelzon,et al.  GraphLog: a visual formalism for real life recursion , 1990, PODS '90.

[18]  Klaus R. Dittrich,et al.  An overview and classification of mediated query systems , 1999, SGMD.

[19]  Hector Garcia-Molina,et al.  Template-based wrappers in the TSIMMIS system , 1997, SIGMOD '97.

[20]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[21]  Marc Gyssens,et al.  A graph-oriented object model for database end-user interfaces , 1990, SIGMOD '90.

[22]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[23]  Peter J. H. King,et al.  Syntax and Semantics of Gql, a graphical query language , 1995, J. Vis. Lang. Comput..

[24]  Alberto O. Mendelzon,et al.  Hy+: a Hygraph-based query and visualization system , 1993, SIGMOD '93.

[25]  Jennifer Widom,et al.  Research problems in data warehousing , 1995, CIKM '95.

[26]  Alexandra Poulovassilis,et al.  A General Formal Framework for Schema Transformation , 1998, Data Knowl. Eng..

[27]  Alexandra Poulovassilis,et al.  A Uniform Approach to Inter-model Transformations , 1999, CAiSE.