Graded-Inclusion-Based Information Retrieval Systems

This paper investigates the use of fuzzy logic mechanisms coming from the database community, namely graded inclusions, to model the information retrieval process. In this framework, documents and queries are represented by fuzzy sets, which are paired with operations like fuzzy implications and T-norms. Through different experiments, it is shown that only some among the wide range of fuzzy operations are relevant for information retrieval. When appropriate settings are chosen, it is possible to mimic classical systems, thus yielding results rivaling those of state-of-the-art systems. These positive results validate the proposed approach, while negative ones give some insights on the properties needed by such a model. Moreover, this paper shows the added-value of this graded inclusion-based model, which gives new and theoretically grounded ways for a user to easily weight his query terms, to include negative information in his queries, or to expand them with related terms.

[1]  Donald H. Kraft,et al.  Threshold values and Boolean retrieval systems , 1981, Inf. Process. Manag..

[2]  Didier Dubois,et al.  Flexible Queries in Relational Databases - The Example of the Division Operator , 1997, Theor. Comput. Sci..

[3]  Edward A. Fox,et al.  Research Contributions , 2014 .

[4]  Donald H. Kraft,et al.  A mathematical model of a weighted boolean retrieval system , 1979, Inf. Process. Manag..

[5]  Enrique Herrera-Viedma,et al.  Modeling the retrieval process for an information retrieval system using an ordinal fuzzy linguistic approach , 2001, J. Assoc. Inf. Sci. Technol..

[6]  Mohand Boughanem,et al.  Improving Document Ranking in Information Retrieval Using Ordered Weighted Aggregation and Leximin Refinement , 2005, EUSFLAT Conf..

[7]  Duncan A. Buell,et al.  An analysis of some fuzzy subset applications to information retrieval systems , 1982 .

[8]  Enrique Herrera-Viedma,et al.  A Fuzzy Linguistic IRS Model Based on a 2-Tuple Fuzzy Linguistic Approach , 2007, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[9]  D. Dubois,et al.  Fundamentals of fuzzy sets , 2000 .

[10]  Wojciech Rytter,et al.  Extracting Powers and Periods in a String from Its Runs Structure , 2010, SPIRE.

[11]  R. Yager,et al.  Fuzzy Set-Theoretic Operators and Quantifiers , 2000 .

[12]  Christiane Fellbaum,et al.  Using Wordnet for Text Retrieval , 1998 .

[13]  Patrick Bosc,et al.  On the use of tolerant graded inclusions in information retrieval , 2008, CORIA.

[14]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[15]  Mohand Boughanem,et al.  A Model for Information Retrieval Based on Possibilistic Networks , 2005, SPIRE.

[16]  Donald H. Kraft,et al.  Vagueness and uncertainty in information retrieval: how can fuzzy sets help? , 2006, IWRIDL '06.

[17]  Mounia Lalmas,et al.  Logical Models in Information Retrieval: Introduction and Overview , 1998, Inf. Process. Manag..

[18]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[19]  Abraham Bookstein,et al.  Fuzzy requests: An approach to weighted boolean searches , 1980, J. Am. Soc. Inf. Sci..

[20]  Samia Nefti-Meziani,et al.  Personalized Information Retrieval system in the Framework of Fuzzy Logic , 2008, EUSFLAT Conf..

[21]  Patrick Bosc,et al.  On a Parameterized Antidivision Operator for Database Flexible Querying , 2008, DEXA.