Information Retrieval: From Language Models to Fuzzy Logic

This internship is concerned in modeling and testing an information retrieval (IR) system. These systems search in the content of a large collection of documents and retrieve the ones that are relevant to a user's demand, which is usually represented in the form of a query. The content of a document is generally indexed by terms, users' queries are also indexed by terms that are used to identify topics of interest. Terms in documents and queries are usually given specic weights in accordance to their signicance. An information retrieval system has a matching function that assigns scores to documents, each score represents a degree of relevance between a document and a query. Documents are then ranked according to their scores, and top n-documents are retrieved to the user. The subject of the internship aims to exploit fuzzy logic concepts as well as language modeling approaches for IR in order to build and test an information retrieval model.

[1]  H. Larsen,et al.  Importance weighted OWA aggregation of multicriteria queries , 1999, 18th International Conference of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.99TH8397).

[2]  Didier Dubois,et al.  Knowledge-Driven versus Data-Driven Logics , 2000, J. Log. Lang. Inf..

[3]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[4]  W. Bruce Croft,et al.  A general language model for information retrieval , 1999, CIKM '99.

[5]  Patrick Bosc,et al.  On the use of tolerant graded inclusions in information retrieval , 2008, CORIA.

[6]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[7]  Stanley F. Chen,et al.  An empirical study of smoothing techniques for language modeling , 1999 .

[8]  W. Bruce Croft,et al.  Statistical language modeling for information retrieval , 2006, Annu. Rev. Inf. Sci. Technol..

[9]  Ellen M. Voorhees,et al.  The Philosophy of Information Retrieval Evaluation , 2001, CLEF.

[10]  F ChenStanley,et al.  An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[11]  Gaspar Mayor,et al.  Aggregation Operators , 2002 .

[12]  Vincent Claveau Acquisition automatique de lexiques sémantiques pour la recherche d'information. (Automatic acquisition of semantic lexicons for information retrieval) , 2003 .

[13]  Didier Dubois,et al.  The three semantics of fuzzy sets , 1997, Fuzzy Sets Syst..

[14]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[15]  Jean-Luc Marichal,et al.  On Sugeno integral as an aggregation function , 2000, Fuzzy Sets Syst..

[16]  Vincent Claveau,et al.  Implication-based and cardinality-based inclusions in information retrieval , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[17]  W. Bruce Croft,et al.  A general language model for information retrieval (poster abstract) , 1999, SIGIR '99.