We are developing a knowledge base that integrates complementary archaeological information sources. Our source data comprise complementary scientific databases and corpora describing finds with inscriptions and iconography of the Roman era. The integration of such complementary information is innovative and of immense potential value for the cultural heritage domain. Integration is achieved by intellectually interpreting each source schema in terms of the CIDOC CRM model and storing it in an RDF knowledge base, thus creating a body of unique archaeological knowledge in digital form. Our main objective is to provide procedures for information extraction and global querying over all the contents of the complementary resources. Additionally we aim at performing reliable statistical evaluation of the integrated data. In order to ensure that the methods used converge towards the best state of knowledge available and that the results are of high quality, we apply data cleaning procedures both at the individual sources and at the integrated knowledge base.
[1]
Diego Calvanese,et al.
Description Logic Framework for Information Integration
,
1998,
KR.
[2]
Erhard Rahm,et al.
Data Cleaning: Problems and Current Approaches
,
2000,
IEEE Data Eng. Bull..
[3]
Amit P. Sheth,et al.
Complex relationships and knowledge discovery support in the InfoQuilt system
,
2003,
The VLDB Journal.
[4]
Dennis Shasha,et al.
Declaratively Cleaning your Data with AJAX
,
2000,
BDA.
[5]
M. Doerr,et al.
The CIDOC CRM – an Ontological Approach to Semantic Interoperability of Metadata
,
2003
.