Instance-Based OWL Schema Matching

Schema matching is a fundamental issue in many database applications, such as query mediation and data warehousing. It becomes a challenge when different vocabularies are used to refer to the same real-world concepts. In this context, a convenient approach, sometimes called extensional, instance-based or semantic, is to detect how the same real world objects are represented in different databases and to use the information thus obtained to match the schemas. This paper describes an instance-based schema matching technique for an OWL dialect. The technique is based on similarity functions and is backed up by experimental results with real data downloaded from data sources found on the Web.

[1]  Silvana Castano,et al.  Semantic Information Interoperability in Open Networked Systems , 2004, ICSNW.

[2]  Amos Tversky,et al.  Studies of similarity , 1978 .

[3]  Marco A. Casanova,et al.  Database Conceptual Schema Matching , 2007, Computer.

[4]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .

[5]  Marco A. Casanova,et al.  Adaptative Matching of Database Web Services Export Schemas , 2008, ICEIS.

[6]  Ruy Luiz Milidiú,et al.  Mediation as Recommendation: An Approach to Design Mediators for Object Catalogs , 2006, OTM Workshops.

[7]  Zohra Bellahsene,et al.  XBenchMatch: a Benchmark for XML Schema Matching Tools , 2007, VLDB.

[8]  Ruy Luiz Milidiú,et al.  Towards Gazetteer Integration Through an Instance-based Thesauri Mapping Approach , 2006, GEOINFO.

[9]  Wei-Ying Ma,et al.  Instance-based Schema Matching for Web Databases by Domain-specific Query Probing , 2004, VLDB.

[10]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[11]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[12]  Marco A. Casanova,et al.  A Mediator for Heterogeneous Gazetteers , 2007 .

[13]  Antonio L. Furtado,et al.  Evaluation of Similarity Measures and Heuristics for Simple RDF Schema Matching , 2008 .

[14]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[15]  Felix Naumann,et al.  Schema matching using duplicates , 2005, 21st International Conference on Data Engineering (ICDE'05).

[16]  Marco A. Casanova,et al.  Semantic Web: Concepts, Technologies and Applications , 2007, NASA Monographs in Systems and Software Engineering.

[17]  AnHai Doan,et al.  Corpus-based schema matching , 2005, 21st International Conference on Data Engineering (ICDE'05).

[18]  Antonio L. Furtado,et al.  Database Mediation Using Multi-agent Systems , 2008, 2008 32nd Annual IEEE Software Engineering Workshop.

[19]  Marco A. Casanova,et al.  An Instance-based Approach for Matching Export Schemas of Geographical Database Web Services , 2007, GEOINFO.

[20]  Ruy Luiz Milidiú,et al.  Conceptual schema matching based on similarity heuristics , 2009 .

[21]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[22]  Marco A. Casanova,et al.  Matching object catalogues , 2008, Innovations in Systems and Software Engineering.