LODE: Linking digital humanities content to the web of data

Numerous digital libraries projects maintain their data collections in the form of text, images, and metadata. While data may be stored in many formats, from plain text to XML to relational databases, the use of the resource description framework (RDF) as a standardized representation has gained considerable traction during the last five years. Almost every digital humanities meeting has at least one session concerned with the topic of digital humanities, RDF, and linked data, including JCDL. While most existing work in linked data has focused on improving algorithms for entity matching, the aim of our Linked Open Data Enhancer Lode is to work “out of the box”, enabling their use by humanities scholars, computer scientists, librarians, and information scientists alike. With Lode we enable non-technical users to enrich a local RDF repository with high-quality data from the Linked Open Data cloud. Lode links and enhances the local RDF repository without reducing the quality of the data. In particular, we support the user in the enhancement and linking process by providing intuitive user-interfaces and by suggesting high quality linking candidates using state of the art matching algorithms. We hope that the Lode framework will be useful to digital humanities scholars complementing other digital humanities tools.

[1]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[2]  Annabel Pollock,et al.  What''s Wrong with Internet Searching , 1997 .

[3]  Craig A. Knoblock,et al.  Learning domain-independent string transformation weights for high accuracy object identification , 2002, KDD.

[4]  Ahmed K. Elmagarmid,et al.  TAILOR: a record linkage toolbox , 2002, Proceedings 18th International Conference on Data Engineering.

[5]  Raymond J. Mooney,et al.  Adaptive duplicate detection using learnable string similarity measures , 2003, KDD '03.

[6]  Dieter Fensel,et al.  Towards Semantic Web Portals , 2004, WWW Workshop on Application Design, Development and Implementation Issues in the Semantic Web.

[7]  Jeffrey P. Bigham,et al.  Organizing and Searching the World Wide Web of Facts - Step One: The One-Million Fact Extraction Challenge , 2006, AAAI.

[8]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative 2007 , 2006, OM.

[9]  Evgeniy Gabrilovich,et al.  Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge , 2006, AAAI.

[10]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[11]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative , 2007 .

[12]  Mathias Niepert,et al.  A dynamic ontology for a dynamic reference work , 2007, JCDL '07.

[13]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[14]  Daniel Hahn,et al.  Talia: A Framework for Philosophy Scholars , 2007, SWAP.

[15]  Marius Pasca,et al.  Organizing and searching the world wide web of facts -- step two: harnessing the wisdom of the crowds , 2007, WWW '07.

[16]  Daniel Hahn,et al.  A Semantic Web Powered Distributed Digital Library System , 2008, ELPUB.

[17]  Mathias Niepert,et al.  Answer Set Programming on Expert Feedback to Populate and Extend Dynamic Ontologies , 2008, FLAIRS Conference.

[18]  Ivan Janciak,et al.  UK e-Science All Hands Meeting , 2009 .

[19]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[20]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[21]  Yi Li,et al.  RiMOM: A Dynamic Multistrategy Ontology Alignment Framework , 2009, IEEE Transactions on Knowledge and Data Engineering.

[22]  Mathias Niepert,et al.  From encyclopedia to ontology: toward dynamic representation of the discipline of philosophy , 2011, Synthese.

[23]  Raghav Kaushik,et al.  On active learning of record matching packages , 2010, SIGMOD Conference.

[24]  Heiner Stuckenschmidt,et al.  A Probabilistic-Logical Framework for Ontology Matching , 2010, AAAI.

[25]  Heiner Stuckenschmidt,et al.  Leveraging Terminological Structure for Object Reconciliation , 2010, ESWC.

[26]  Mathias Niepert A Delayed Column Generation Strategy for Exact k-Bounded MAP Inference in Markov Logic Networks , 2010, UAI.

[27]  Gisele L. Pappa,et al.  Active Learning Genetic programming for record deduplication , 2010, IEEE Congress on Evolutionary Computation.

[28]  Robert Isele,et al.  Silk Server - Adding missing Links while consuming Linked Data , 2010, COLD.

[29]  Estevam R. Hruschka,et al.  Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.

[30]  Heiner Stuckenschmidt,et al.  Coherent Top-k Ontology Alignment for OWL EL , 2011, SUM.

[31]  Heiner Stuckenschmidt,et al.  Probabilistic-Logical Web Data Integration , 2011, Reasoning Web.

[32]  Christian Meilicke,et al.  Alignment incoherence in ontology matching , 2011 .

[33]  Heiner Stuckenschmidt,et al.  Ontology Alignment Evaluation Initiative: Six Years of Experience , 2011, J. Data Semant..

[34]  Jens Lehmann,et al.  RAVEN - active learning of link specifications , 2011, OM.

[35]  Heiner Stuckenschmidt,et al.  Interactive Data Integration with MappingAssistant , 2011 .

[36]  Jan Nößner,et al.  CODI: Combinatorial Optimization for Data Integration: results for OAEI 2011 , 2010, OM.

[37]  Antoine Isaac,et al.  data.europeana.eu: The Europeana Linked Open Data Pilot , 2011, Dublin Core Conference.

[38]  Axel-Cyrille Ngonga Ngomo,et al.  EAGLE: Efficient Active Learning of Link Specifications Using Genetic Programming , 2012, ESWC.

[39]  Ian Horrocks,et al.  Large-scale Interactive Ontology Matching: Algorithms and Implementation , 2012, ECAI.

[40]  G. Schreiber,et al.  Key choices in the design of Simple Knowledge Organization System (SKOS) , 2013, J. Web Semant..

[41]  Kai Eckert Provenance and Annotations for Linked Data , 2013, Dublin Core Conference.

[42]  Alois Pichler,et al.  Sharing and debating Wittgenstein by using an ontology , 2013, Lit. Linguistic Comput..

[43]  Simone Paolo Ponzetto,et al.  Integrating Open and Closed Information Extraction: Challenges and First Steps , 2013, NLP-DBPEDIA@ISWC.

[44]  Robert Isele,et al.  Active learning of expressive linkage rules using genetic programming , 2013, J. Web Semant..

[45]  Francesco Piazza,et al.  Pundit: augmenting web contents with semantics , 2013, Lit. Linguistic Comput..

[46]  Alois Pichler,et al.  Overlapping and competing ontologies , 2013, DH-CASE '13.

[47]  S. Gradmann,et al.  Modellierung und Ontologien im Wissensmanagement , 2014 .

[48]  Kai Eckert,et al.  RESTful open workflows for data provenance and reuse , 2014, WWW '14 Companion.