Putting Hybrid Cultural Data on the Semantic Web

A prerequisite for joining the rapidly growing Semantic Web is to expose data as RDF triples. In the cultural heritage world the data in question is very often a mixture of structured database fields and associated textual documents. Transforming relational database (RDB) content to RDF is not altogether straightforward and the issues are examined as a preliminary to the much more difficult step of augmenting the RDB content by extracting structured RDF triples directly from natural language text, using a specially designed txt2rdf process. This opens the way to a true integration of the hybrid data so common in heritage management. Finally we lead up to experimental results showing structured queries (using SPARQL) that cannot be answered from the RDB material alone, but which are satisfied against the augmented graph. In this domain there are potentially vast amounts of textual material available for linking to structured records, so the future possibilities of the techniques described are exciting.

[1]  Zhisheng Huang,et al.  MultimediaN E-Culture Demonstrator , 2006, International Semantic Web Conference.

[2]  Kate Byrne,et al.  Populating the Semantic Web: Combining Text and Relational Databases as RDF , 2010 .

[3]  Nigel Shadbolt,et al.  Resource Description Framework (RDF) , 2009 .

[4]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[5]  Kate Byrne Nested Named Entity Recognition in Historical Archive Text , 2007, International Conference on Semantic Computing (ICSC 2007).

[6]  Kokou Yétongnon,et al.  DB2OWL : A Tool for Automatic Database-to-Ontology Mapping , 2007, SEBD.

[7]  Jeremy J. Carroll,et al.  Named graphs, provenance and trust , 2005, WWW '05.

[8]  Satya S. Sahoo,et al.  A Survey of Current Approaches for Mapping of Relational Databases to RDF , 2009 .

[9]  Asunción Gómez-Pérez,et al.  R2O, an extensible and semantically based database-to-ontology mapping language , 2004 .

[10]  Christian Bizer,et al.  D2R Server - Publishing Relational Databases on the Semantic Web , 2004 .

[11]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[12]  Lee Feigenbaum,et al.  The Semantic Web in action. , 2007, Scientific American.

[13]  Roy Fielding,et al.  Architectural Styles and the Design of Network-based Software Architectures"; Doctoral dissertation , 2000 .

[14]  Leo Sauermann,et al.  Cool URIs for the semantic web , 2007 .

[15]  Yuxin Mao,et al.  Dartgrid : a Semantic Web Toolkit for Integrating Heterogeneous Relational Databases , 2006 .

[16]  John Riley,et al.  Tim Berners-Lee , 1998 .

[17]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[18]  Eero Hyvönen,et al.  CultureSampo-Finnish Culture on the Semantic Web: The Vision and First Results , 2007 .

[19]  Kate Byrne Having Triplets – Holding Cultural Data as RDF , 2008 .