Revisiting Exhaustivity and Specificity Using Propositional Logic and Lattice Theory

Exhaustivity and Specificity in logical Information Retrieval framework were introduced by Nie [16]. However, even with some attempts, they are still theoretical notions without a clear idea of how to be implemented. In this study, we present a new approach to deal with them. We use propositional logic and lattice theory in order to redefine the two implications and their uncertainty P(d → q) and P(q → d). We also show how to integrate the two notions into a concrete IR model for building a new effective model. Our proposal is validated against six corpora, and using two types of terms (words and concepts). The experimental results showed the validity of our viewpoint, which state: the explicit integration of Exhaustivity and Specificity into IR models will improve the retrieval performance of these models. Moreover, there should be a type of balance between the two notions.

[1]  C. J. van Rijsbergen,et al.  A Non-Classical Logic for Information Retrieval , 1997, Comput. J..

[2]  Donna K. Harman,et al.  Overview of the Sixth Text REtrieval Conference (TREC-6) , 1997, Inf. Process. Manag..

[3]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[4]  James Allan,et al.  A comparison of statistical significance tests for information retrieval evaluation , 2007, CIKM '07.

[5]  C. Berrut,et al.  A New Lattice-Based Information Retrieval Theory , 2013 .

[6]  Jean-Pierre Chevallet,et al.  About Retrieval Models and Logic , 1992, Comput. J..

[7]  Jian-Yun Nie An outline of a general model for information retrieval systems , 1988, SIGIR '88.

[8]  Tao Tao,et al.  A formal study of information retrieval heuristics , 2004, SIGIR '04.

[9]  Donna K. Harman,et al.  Overview of the Eighth Text REtrieval Conference (TREC-8) , 1999, TREC.

[10]  Kevin H. Knuth,et al.  Deriving Laws from Ordering Relations , 2004, physics/0403031.

[11]  Joo-Hwee Lim,et al.  Domain knowledge conceptual inter-media indexing: application to multilingual multimedia medical reports , 2007, CIKM '07.

[12]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[13]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[14]  Yves Chiaramella,et al.  A Model for Multimedia Information Retrieval , 1996 .

[15]  David E. Losada,et al.  A Logical Model for Information Retrieval based on Propositional Logic and Belief Revision , 2001, Comput. J..

[16]  W. Bruce Croft,et al.  A language modeling approach to information retrieval , 1998, SIGIR '98.

[17]  Jean-Pierre Chevallet,et al.  The Effective Relevance Link between a Document and a Query , 2012, DEXA.

[18]  Jean-Pierre Chevallet,et al.  Is uncertain logical-matching equivalent to conditional probability? , 2013, SIGIR.

[19]  Kevin H. Knuth,et al.  Lattice duality: The origin of probability and entropy , 2013, Neurocomputing.

[20]  S. Robertson The probability ranking principle in IR , 1997 .

[21]  Jean-Pierre Chevallet,et al.  MRIM at ImageCLEF2012. From Words to Concepts: A New Counting Approach , 2012, CLEF.

[22]  Fabio Crestani,et al.  Exploiting the Similarity of Non-Matching Terms at Retrieval Time , 2000, Information Retrieval.

[23]  Amit Singhal,et al.  Pivoted document length normalization , 1996, SIGIR 1996.

[24]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[25]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.