Semantic Querying of Data Guided by Formal Concept Analysis

In this paper we present a novel approach to handle querying over a concept lattice of documents and annotations. We focus on the problem of "non-matching documents", which are those that, despite being semantically relevant to the user query, do not contain the query's elements and hence cannot be retrieved by typical string matching approaches. In order to find these documents, we modify the initial user query using the concept lattice as a guide. We achieve this by identifying in the lattice a formal concept that represents the user query and then by finding potentially relevant concepts, identified as such through the proposed notion of cousin concepts. Finally, we use a concept semantic similarity metric to order and present retrieved documents. The main contribution of this paper is the introduction of the notion of cousin concepts of a given formal concept followed by a discussion on how this notion is useful for lattice-based information indexing and retrieval.

[1]  Uta Priss,et al.  Lattice-based information retrieval , 2000 .

[2]  L. Beran,et al.  [Formal concept analysis]. , 1996, Casopis lekaru ceskych.

[3]  Claudio Carpineto,et al.  Order-theoretical ranking , 2000, J. Am. Soc. Inf. Sci..

[4]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[5]  George A. Miller WordNet: A Lexical Database for English , 1992, HLT.

[6]  Amedeo Napoli,et al.  Querying a Bioinformatic Data Sources Registry with , 2005 .

[7]  Marti A. Hearst,et al.  Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.

[8]  Anna Formica,et al.  Concept similarity in Formal Concept Analysis: An information content approach , 2008, Knowl. Based Syst..

[9]  Claudio Carpineto,et al.  A Survey of Automatic Query Expansion in Information Retrieval , 2012, CSUR.

[10]  Claudio Carpineto,et al.  Exploiting the Potential of Concept Lattices for Information Retrieval with CREDO , 2004, J. Univers. Comput. Sci..

[11]  Claudio Carpineto,et al.  Using Concept Lattices for Text Retrieval and Mining , 2005, Formal Concept Analysis.

[12]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[13]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[14]  Amedeo Napoli,et al.  Using Domain Knowledge to Guide Lattice-based Complex Data Exploration , 2010, ECAI.

[15]  Claudio Carpineto,et al.  Order-theoretical ranking , 2000 .

[16]  Bernhard Ganter,et al.  Formal Concept Analysis , 2013 .

[17]  Samir Elloumi,et al.  Using Concept Formal Analysis for Cooperative Information Retrieval , 2004, CLA.

[18]  Uta Priss Formal concept analysis in information science , 2006 .