Semi-automatically Mapping Structured Sources into the Semantic Web

Linked data continues to grow at a rapid rate, but a limitation of a lot of the data that is being published is the lack of a semantic description. There are tools, such as D2R, that allow a user to quickly convert a database into RDF, but these tools do not provide a way to easily map the data into an existing ontology. This paper presents a semi-automatic approach to map structured sources to ontologies in order to build semantic descriptions (source models). Since the precise mapping is sometimes ambiguous, we also provide a graphical user interface that allows a user to interactively refine the models. The resulting source models can then be used to convert data into RDF with respect to a given ontology or to define a SPARQL end point that can be queried with respect to an ontology. We evaluated the overall approach on a variety of sources and show that it can be used to quickly build source models with minimal user interaction.

[1]  Timothy W. Finin,et al.  OWL as a Target for Information Extraction Systems (Statement of Interest) , 2008, OWLED.

[2]  Kristina Lerman,et al.  Using Conditional Random Fields to Exploit Token Structure and Labels for Accurate Semantic Annotation , 2011, AAAI.

[3]  John Mylopoulos,et al.  A Semantic Approach to Discovering Schema Mapping Expressions , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[4]  Jérôme Euzenat,et al.  A Survey of Schema-Based Matching Approaches , 2005, J. Data Semant..

[5]  Asunción Gómez-Pérez,et al.  Upgrading relational legacy data to the semantic web , 2006, WWW '06.

[6]  Christian Becker,et al.  Extending SMW+ with a Linked Data Integration Framework , 2010, ISWC Posters&Demos.

[7]  Todd D. Millstein,et al.  Navigational Plans For Data Integration , 1999, AAAI/IAAI.

[8]  Nikolas Mitrou,et al.  Bringing relational databases into the Semantic Web: A survey , 2012, Semantic Web.

[9]  Christian Bizer,et al.  The R2R Framework: Publishing and Discovering Mappings on the Web , 2010, COLD.

[10]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[11]  Laura M. Haas,et al.  Clio: Schema Mapping Creation and Data Exchange , 2009, Conceptual Modeling: Foundations and Applications.

[12]  Craig A. Knoblock,et al.  Building Mashups by Demonstration , 2011, TWEB.

[13]  Marcelo Arenas,et al.  Relational and XML Data Exchange , 2010, Relational and XML Data Exchange.

[14]  Phokion G. Kolaitis,et al.  Designing and refining schema mappings via data examples , 2011, SIGMOD '11.

[15]  C. Bizer,et al.  Enabling Tailored Therapeutics with Linked Data , 2009 .

[16]  Erhard Rahm,et al.  Evolution of the COMA match system , 2011, OM.

[17]  Erhard Rahm,et al.  Schema Matching and Mapping , 2013, Schema Matching and Mapping.

[18]  Christian Bizer,et al.  D2R Server - Publishing Relational Databases on the Semantic Web , 2004 .

[19]  Kristina Lerman,et al.  Semantic Labeling of Online Information Sources , 2007, Int. J. Semantic Web Inf. Syst..

[20]  George Markowsky,et al.  A fast algorithm for Steiner trees , 1981, Acta Informatica.

[21]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[22]  Pedro M. Domingos,et al.  Learning Source Description for Data Integration , 2000, WebDB.

[23]  Pedro M. Domingos,et al.  Learning Source Descriptions for Data Integration , 2000 .