Issues in an inference platform for generating deductive knowledge: a case study in cultural heritage digital libraries using the CIDOC CRM

Most information retrieval research focuses collecting documents that match the same set of concepts. This study considers a more advanced problem, namely how to discover knowledge not contained in a single source from combined historical facts. By using a well-designed core ontology in the cultural domain (CIDOC CRM, ISO21127), this study discusses the requirement for a robust inference platform for real-life knowledge discovery and integration over distributed sources. The methodology and design are justified in detail through functional requirements for an inference service with the capability of inferring new knowledge from combinations of facts distributed over different sources. A number of critical issues for developing such a robust inference platform are identified, namely (1) systematic accumulation of common concepts and inference rules; (2) extending the ontology with metaclasses; (3) accumulation of factual and categorical knowledge; (4) incorporation of fuzzy inference into the inference engine, and (5) improvement of performance and scalability in the inference engine.

[1]  Francesco M. Donini,et al.  AL-log: Integrating Datalog and Description Logics , 1998, Journal of Intelligent Information Systems.

[2]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[3]  Boris Motik,et al.  A Comparison of Reasoning Techniques for Querying Large Description Logic ABoxes , 2006, LPAR.

[4]  Martin Doerr Modelling Learning Subjects as Relationships , 2004, Intuitive Human Interfaces for Organizing and Accessing Intellectual Assets.

[5]  Riccardo Rosati,et al.  On the decidability and complexity of integrating ontologies and rules , 2005, J. Web Semant..

[6]  Amarnath Gupta,et al.  Visual information retrieval , 1997, CACM.

[7]  David Bearman,et al.  Open concepts: museum digital documentation for education through The AMICO Library™ , 2002, Art Libraries Journal.

[8]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[9]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[10]  Robert Orchard,et al.  Fuzzy Reasoning in JESS: The Fuzzyj Toolkit and Fuzzyjess , 2001, ICEIS.

[11]  Steffen Staab,et al.  KAON - Towards a Large Scale Semantic Web , 2002, EC-Web.

[12]  Dave Reynolds,et al.  Efficient RDF Storage and Retrieval in Jena2 , 2003, SWDB.

[13]  Ian Horrocks,et al.  FaCT++ Description Logic Reasoner: System Description , 2006, IJCAR.

[14]  Christine Golbreich,et al.  Combining Rule and Ontology Reasoners for the Semantic Web , 2004, RuleML.

[15]  G. Fauconnier,et al.  The Way We Think: Conceptual Blending and the Mind''s Hidden Complexities. Basic Books , 2002 .

[16]  York Sure-Vetter,et al.  Evaluation of Ontology-based Tools (EON 2003) : Proceedings of the 2nd International Workshop on Evaluation of Ontology-based Tools, held at the 2nd International Semantic Web Conference ISWC 2003, 20th October 2003 (Workshop day), Sundial Resort, Sanibel Island, Florida, USA , 2003 .

[17]  Benjamin Kuipers,et al.  Algernon—a tractable system for knowledge-representation , 1991, SGAR.

[18]  Robert Stevens,et al.  Using OWL to model biological knowledge , 2007, Int. J. Hum. Comput. Stud..

[19]  Alon Y. Halevy,et al.  Combining Horn Rules and Description Logics in CARIN , 1998, Artif. Intell..

[20]  Dieter Pfoser Indexing the Trajectories of Moving Objects , 2002 .

[21]  Ian Horrocks,et al.  Decidability of SHIQ with Complex Role Inclusion Axioms , 2003, IJCAI.

[22]  Boris Motik,et al.  Can OWL and Logic Programming Live Together Happily Ever After? , 2006, International Semantic Web Conference.

[23]  Philippe Martin,et al.  Knowledge Retrieval and the World Wide Web , 2000, IEEE Intell. Syst..

[24]  Christine Golbreich What Reasoning Support for Ontology and Rules? The Brain Anatomy Case Study , 2005, OWLED.

[25]  B. Hammond Ontology , 2004, Lawrence Booth’s Book of Visions.

[26]  Mieczyslaw M. Kokar,et al.  Using SWRL and OWL to Capture Domain Knowledge for a Situation Awareness Application Applied to a Supply Logistics Scenario , 2005, RuleML.

[27]  Deepak Ramachandran,et al.  First-Orderized ResearchCyc : Expressivity and Efficiency in a Common-Sense Ontology , 2005 .

[28]  M. Doerr The CIDOC CRM – an Ontological Approach to Semantic Interoperability of Metadata , 2003 .

[29]  Ian Horrocks,et al.  From SHIQ and RDF to OWL: the making of a Web Ontology Language , 2003, J. Web Semant..

[30]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[31]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[32]  Bijan Parsia,et al.  Pellet: An OWL DL Reasoner , 2004, Description Logics.

[33]  Nicolas Spyratos,et al.  Deriving and Retrieving Contextual Categorical Information through Instance Inheritance , 2000, Fundam. Informaticae.

[34]  David E. Millard,et al.  Automatic Ontology-Based Knowledge Extraction from Web Documents , 2003, IEEE Intell. Syst..

[35]  Martin Doerr Mapping of the Dublin Core Metadata Element Set to the CIDOC CRM , 2000 .

[36]  David Bearman,et al.  Standards Framework for the Computer Interchange of Museum Information , 1999 .

[37]  Volker Haarslev,et al.  Racer: A Core Inference Engine for the Semantic Web , 2003, EON.

[38]  Patrick Le Boeuf,et al.  FRBR and Further , 2001 .

[39]  Jeff Cowton SPECTRUM : The UK Museum Documentation Standard , 1997 .

[40]  L. Floridi Blackwell Guide to the Philosophy of Computing and Information , 2003 .

[41]  Jeff Heflin,et al.  Reading Between the Lines: Using SHOE to Discover Implicit Knowledge from the Web , 1998 .

[42]  Henrik Eriksson,et al.  Using JessTab to Integrate Protégé and Jess , 2003, IEEE Intell. Syst..

[43]  Martin Doerr,et al.  Mapping Language for Information Integration , 2006 .

[44]  Martin Doerr,et al.  The CIDOC Conceptual Reference Module: An Ontological Approach to Semantic Interoperability of Metadata , 2003, AI Mag..

[45]  Martin Doerr,et al.  Modelling Intellectual Processes: The FRBR - CRM Harmonization , 2007, DELOS.

[46]  Benjamin N. Grosof,et al.  Combining Rules and Ontologies . A survey . , 2005 .

[47]  Charles L. Forgy,et al.  Rete: a fast algorithm for the many pattern/many object pattern match problem , 1991 .

[48]  Theodore S. Papatheodorou,et al.  A Methodology for Conducting Knowledge Discovery on the Semantic Web , 2006, Adaptive and Personalized Semantic Web.

[49]  David E. Millard,et al.  Automatic Ontology-based Knowledge Extraction and Tailored Biography Generation from the Web , 2003 .

[50]  Alan L. Rector,et al.  Editing Description Logic Ontologies with the Protégé OWL Plugin , 2004, Description Logics.

[51]  Diego Calvanese,et al.  The Description Logic Handbook , 2007 .

[52]  Boris Motik,et al.  Query Answering for OWL-DL with Rules , 2004, International Semantic Web Conference.

[53]  Jane Hunter,et al.  The ABC Ontology and Model , 2001, J. Digit. Inf..

[54]  Timothy W. Finin,et al.  SweetJess: Translating DAMLRuleML to JESS , 2002, RuleML.

[55]  Amit P. Sheth,et al.  Complex relationships and knowledge discovery support in the InfoQuilt system , 2003, The VLDB Journal.

[56]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[57]  Hans Tompits,et al.  Combining answer set programming with description logics for the Semantic Web , 2004, Artif. Intell..

[58]  Ian Horrocks,et al.  OIL in a Nutshell , 2000, EKAW.

[59]  Michael J. Witbrock,et al.  Searching for Common Sense: Populating Cyc™ from the Web , 2005, AAAI.

[60]  Benjamin N. Grosof,et al.  Representing E-Business Rules for the Semantic Web: Situated Courteous Logic Programs in RuleML , 2001 .

[61]  Patrick J. Flynn,et al.  A 20th Anniversary Survey: Introduction to 'Content-Based Image Retrieval at the End of the Early Years' , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[62]  Martin Doerr Documenting Events in Metadata , 2006 .