Information extraction and integration for enriching cultural heritage collections

Cultural heritage plays an important role in preserving social characteristics and knowledge for future generations. To provide long-term access to these resources, many cultural materials are today archived digitally. The problem arises when each cultural archive, which has own a large database, has been collected with the same cultural types, but different proposes, Therefore, there are various metadata standards that come from each of these archives, making it difficult to enhance, refine, or even improve raw data. This necessitates the need for a novel framework to integrating various subjects and metadata standards as well as extracting relationship among archives for enriching information retrieval. In this paper, we propose a new approach for discovery semantic relations between entities from articles using Wikipedia and various cultural heritage archives as resources. There are (1) dictionary extraction patterns used for extracting terms and meaning for creating a cultural heritage dictionary and (2) semantic relation extraction for extraction relation following question words. For enriching cultural information, the method for enriching cultural heritage information with the result of semantic relation extraction is presented using semantic string similarity matching. An evaluation of different domains shows high performance of the proposed approach.

[1]  Md. Hasan Hafizur Rahman,et al.  Linked open data representation of historical heritage of Bangladesh , 2014, 16th Int'l Conf. Computer and Information Technology.

[2]  Stavros Christodoulakis,et al.  Elevating Natural History Museums' Cultural Collections to the Linked Data Cloud , 2013, SDA.

[3]  M. Albanese T-REX : A Domain-Independent System for Automated Cultural Information Extraction , 2007 .

[4]  Dana Dannélls,et al.  Reason-Able View of Linked Data for Cultural Heritage , 2011 .

[5]  Douglas Tudhope,et al.  Classical Art Semantics Information Extraction: CASIE pilot project , 2013 .

[6]  Craig A. Knoblock,et al.  Connecting the Smithsonian American Art Museum to the Linked Data Cloud , 2013, ESWC.

[7]  Lora Aroyo,et al.  Hacking History: Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections , 2011, DeRiVE@ISWC.

[8]  Antoine Isaac,et al.  data.europeana.eu: The Europeana Linked Open Data Pilot , 2011, Dublin Core Conference.

[9]  Dmitry Mouromtsev,et al.  Towards the Russian Linked Culture Cloud: Data Enrichment and Publishing , 2015, ESWC.

[10]  Antoine Isaac,et al.  Supporting Linked Data Production for Cultural Heritage Institutes: The Amsterdam Museum Case Study , 2012, ESWC.

[11]  Lais Barbudo Carrasco,et al.  Information Integration: Mapping Cultural Heritage Metadata into CIDOC CRM. , 2013 .

[12]  Eero Hyvönen,et al.  MuseumFinland - Finnish museums on the semantic web , 2005, J. Web Semant..