Matching of Ontologies with XML Schemas Using a Generic Metamodel

Schema matching is the task of automatically computing correspondences between schema elements. A multitude of schema matching approaches exists for various scenarios using syntactic, semantic, or instance information. The schema matching problem is aggravated by the fact that models to be matched are often represented in different modeling languages, e.g. OWL, XML Schema, or SQL DDL. Consequently, besides being able to match models in the same metamodel, a schema matching tool must be able to compute reasonable results when matching models in heterogeneous modeling languages. Therefore, we developed a matching component as a part of our model management system GeRoMeSuite which is based on our generic metamodel GeRoMe. As GeRoMe provides a unified representation of models, the matcher is able to match models represented in different languages with each other. In this paper, we will show in particular the results for matching XML Schemas with OWL ontologies as it is often required for the semantic annotation of existing XML data sources. GeRoMeSuite allows for flexible configuration of the matching system; various matching algorithms for element and structure level matching are provided and can be combined freely using different ways of aggregation and filtering in order to define new matching strategies. This makes the matcher highly configurable and extensible. We evaluated our system with several pairs of XML Schemas and OWL ontologies and compared the performance with results from other systems. The results are considerably better which shows that a matching system based on a generic metamodel is favorable for heterogeneous matching tasks.

[1]  Silvana Castano,et al.  Global Viewing of Heterogeneous Data Sources , 2001, IEEE Trans. Knowl. Data Eng..

[2]  Philip A. Bernstein,et al.  Industrial-strength schema matching , 2004, SGMD.

[3]  Matthias Jarke,et al.  Generic Schema Mappings , 2007, ER.

[4]  Yannis Kalfoglou,et al.  Ontology mapping: the state of the art , 2003, The Knowledge Engineering Review.

[5]  Jérôme Euzenat,et al.  Similarity-Based Ontology Alignment in OWL-Lite , 2004, ECAI.

[6]  Yuzhong Qu,et al.  The Results of Falcon-AO in the OAEI 2006 Campaign , 2006, Ontology Matching.

[7]  Erhard Rahm,et al.  Generic Schema Matching with Cupid , 2001, VLDB.

[8]  Matthias Jarke,et al.  GeRoMe: A Generic Role Based Metamodel for Model Management , 2005, J. Data Semant..

[9]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.

[10]  Christoph Quix,et al.  Generic Schema Merging , 2007, CAiSE.

[11]  Laura M. Haas,et al.  Clio: a semi-automatic tool for schema mapping , 2001, SIGMOD '01.

[12]  Juan-Zi Li,et al.  Result of Ontology Alignment with RiMOM at OAEI'06 , 2006, Ontology Matching.

[13]  Erhard Rahm,et al.  Similarity flooding: a versatile graph matching algorithm and its application to schema matching , 2002, Proceedings 18th International Conference on Data Engineering.

[14]  Erhard Rahm,et al.  Schema and ontology matching with COMA++ , 2005, SIGMOD '05.

[15]  Stefanos D. Kollias,et al.  A String Metric for Ontology Alignment , 2005, SEMWEB.

[16]  Mary Czerwinski,et al.  Visualization of mappings between schemas , 2005, CHI.

[17]  Erhard Rahm,et al.  COMA++: Results for the Ontology Alignment Contest OAEI 2006 , 2006, Ontology Matching.

[18]  Bernadette M. Byrne,et al.  A longitudinal study of the diffusion of the ISO/IEC information resource dictionary system standard (IRDS.) , 2001 .

[19]  Jérôme Euzenat,et al.  A Survey of Schema-Based Matching Approaches , 2005, J. Data Semant..

[20]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[21]  Erhard Rahm,et al.  Comparison of Schema Matching Evaluations , 2002, Web, Web-Services, and Database Systems.

[22]  Matthew A. Jaro,et al.  Probabilistic linkage of large public health data files. , 1995, Statistics in medicine.

[23]  Enrico Motta,et al.  The Semantic Web - ISWC 2005, 4th International Semantic Web Conference, ISWC 2005, Galway, Ireland, November 6-10, 2005, Proceedings , 2005, SEMWEB.

[24]  William E. Winkler,et al.  The State of Record Linkage and Current Research Problems , 1999 .

[25]  Rainer Unland,et al.  Objects, Components, Architectures, Services, and Applications for a Networked World , 2003, Lecture Notes in Computer Science.

[26]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[27]  Yong Li,et al.  GeRoMeSuite: A System for Holistic Generic Model Management , 2007, VLDB.

[28]  Erhard Rahm,et al.  COMA - A System for Flexible Combination of Schema Matching Approaches , 2002, VLDB.