Internationalization of Linked Data: The case of the Greek DBpedia edition

This paper describes the deployment of the Greek DBpedia and the contribution to the DBpedia information extraction framework with regard to internationalization (I18n) and multilingual support. I18n filters are proposed as pluggable components in order to address issues when extracting knowledge from non-English Wikipedia editions. We report on our strategy for supporting the International Resource Identifier (IRI) and introduce two new extractors to complement the I18n filters. Additionally, the paper discusses the definition of Transparent Content Negotiation (TCN) rules for IRIs to address de-referencing and IRI serialization problems. The aim of this research is to establish best practices (complemented by software) to allow the DBpedia community to easily generate, maintain and properly interlink language-specific DBpedia editions. Furthermore, these best practices can be applied for the publication of Linked Data in non-Latin languages in general.

[1]  Dimitris Kontokostas,et al.  Greek Open Data in the Age of Linked Data: A Demonstration of LOD Internationalization , 2011 .

[2]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[3]  Jens Lehmann,et al.  DBpedia Live Extraction , 2009, OTM Conferences.

[4]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[5]  Andrew H. Mutz,et al.  Transparent Content Negotiation in HTTP , 1998, RFC.

[6]  Martin J. Dürst,et al.  Internationalized Resource Identifiers (IRIs) , 2005, RFC.

[7]  G Stix,et al.  The mice that warred. , 2001, Scientific American.

[8]  Jens Lehmann,et al.  DBpedia – A Linked Data Hub and Data Source for Web Applications and Enterprises , 2009 .

[9]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[10]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[11]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[12]  Takahiro Hara,et al.  Extraction of Bilingual Terminology from a Multilingual Web-based Encyclopedia , 2008, J. Inf. Process..

[13]  Leo Sauermann,et al.  Cool URIs for the semantic web , 2007 .

[14]  Asunción Gómez-Pérez,et al.  Ontology Evaluation , 2004, Handbook on Ontologies.

[15]  Jens Lehmann,et al.  I18n of Semantic Web Applications , 2010, SEMWEB.

[16]  Steven Pemberton,et al.  RDFa in XHTML: Syntax and Processing A collection of attributes and processing rules for extending XHTML to support RDF , 2008 .

[17]  Lukasz Bolikowski Scale-free topology of the interlanguage links in Wikipedia , 2009 .

[18]  Eun-Kyung Kim,et al.  Towards a Korean DBpedia and an Approach for Complementing the Korean Wikipedia based on DBpedia , 2010, OKCon.