Semi-fuzzy Quantifiers for Information Retrieval

Recent research on fuzzy quantification for information retrieval has proposed the application of semi-fuzzy quantifiers for improving query languages. Fuzzy quantified sentences are useful as they allow additional restrictions to be imposed on the retrieval process unlike more popular retrieval approaches, which lack the facility to accurately express information needs. For instance, fuzzy quantification supplies a variety of methods for combining query terms whereas extended boolean models can only handle extended boolean-like operators to connect query terms. Although some experiments validating these advantages have been reported in recent works, a comparison against state-of-the-art techniques has not been addressed. In this work we provide empirical evidence on the adequacy of fuzzy quantifiers to enhance information retrieval systems. We show that our fuzzy approach is competitive with respect to models such as the vector-space model with pivoted document-length normalization, which is at the heart of some high-performance web search systems. These empirical results strengthen previous theoretical works that suggested fuzzy quantification as an appropriate technique for modeling information needs. In this respect, we demonstrate here the connection between the retrieval framework based on the concept of semi-fuzzy quantifier and the seminal proposals for modeling linguistic statements through Ordered Weighted Averaging operators (OWA).

[1]  Edward A. Fox,et al.  Research Contributions , 2014 .

[2]  Amit Singhal,et al.  AT&T at TREC-9 , 2000, TREC.

[3]  R. Yager Connectives and quantifiers in fuzzy sets , 1991 .

[4]  Fabio Crestani,et al.  Soft computing in information retrieval: techniques and applications , 2000 .

[5]  Fabio Crestani,et al.  Soft Computing in Web Information Retrieval - Models and Applications , 2006, Studies in Fuzziness and Soft Computing.

[6]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[7]  Ronald R. Yager,et al.  On ordered weighted averaging aggregation operators in multicriteria decisionmaking , 1988, IEEE Trans. Syst. Man Cybern..

[8]  Senén Barro,et al.  A framework for fuzzy quantification models analysis , 2003, IEEE Trans. Fuzzy Syst..

[9]  Tetsuya Morita,et al.  A fuzzy document retrieval system using the keyword connection matrix and a learning method , 1991 .

[10]  D. Kraft,et al.  Fuzzy Sets and Generalized Boolean Retrieval Systems , 1983, Int. J. Man Mach. Stud..

[11]  Gloria Bordogna,et al.  Modeling Vagueness in Information Retrieval , 2000, ESSIR.

[12]  R. Yager Families of OWA operators , 1993 .

[13]  Myoung-Ho Kim,et al.  On the evaluation of Boolean operators in the extended Boolean retrieval framework , 1993, SIGIR.

[14]  Peter Bailey,et al.  Overview of the TREC-8 Web Track , 2000, TREC.

[15]  Enrique Herrera-Viedma,et al.  Fuzzy approaches to access information on the web: recent developments and research trends , 2003, EUSFLAT Conf..

[16]  Senén Barro,et al.  Experiments on using fuzzy quantified sentences in adhoc retrieval , 2004, SAC '04.

[17]  Lotfi A. Zadeh,et al.  A COMPUTATIONAL APPROACH TO FUZZY QUANTIFIERS IN NATURAL LANGUAGES , 1983 .

[18]  Tadeusz Radecki Outline of a fuzzy logic approach to information retrieval , 1981 .

[19]  Fabio Crestani,et al.  Soft Computing in Information Retrieval , 2000 .

[20]  Senén Barro,et al.  Voting-model based evaluation of fuzzy quantified sentences: a general framework , 2004, Fuzzy Sets Syst..

[21]  H. Ritter,et al.  A Framework for Evaluating Approaches to Fuzzy Quantification , 1999 .

[22]  Donna K. Harman,et al.  Overview of the Third Text REtrieval Conference (TREC-3) , 1995, TREC.

[23]  Chris Buckley,et al.  Pivoted Document Length Normalization , 1996, SIGIR Forum.

[24]  Ronald R. Yager,et al.  On ordered weighted averaging aggregation operators in multicriteria decision-making , 1988 .

[25]  Ingo Glöckner Fuzzy quantifiers in natural language:: semantics and computational models , 2004 .

[26]  Daniel Sánchez,et al.  Fuzzy cardinality based evaluation of quantified sentences , 2000, Int. J. Approx. Reason..

[27]  Michael Collins,et al.  AT&T at TREC-8 , 1999, TREC.

[28]  Gloria Bordogna,et al.  Linguistic aggregation operators of selection criteria in fuzzy information retrieval , 1995, Int. J. Intell. Syst..

[29]  Patrick Bosc,et al.  Quantified Statements and Database Fuzzy Querying , 1995 .

[30]  A. Knoll,et al.  A formal theory of fuzzy natural language quantification and its role in granular computing , 2001 .

[31]  Joon Ho Lee,et al.  Properties of extended Boolean models in information retrieval , 1994, SIGIR '94.

[32]  Ronald R. Yager,et al.  A general approach to rule aggregation in fuzzy logic control , 1992, Applied Intelligence.

[33]  D. K. Harmon,et al.  Overview of the Third Text Retrieval Conference (TREC-3) , 1996 .

[34]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[35]  Donald H. Kraft,et al.  A model for a weighted retrieval system , 1981, J. Am. Soc. Inf. Sci..