Fuzzy Association Rules for Query Refinement in Web Retrieval

In this paper, we present an application for helping users to find new query terms in web retrieval via fuzzy association rules. Once the user has made an initial query, a set of documents is retrieved from the web. Representing these documents as text transactions, each item in the transaction means the presence of the term in the document. From the set of transactions, fuzzy association rules are extracted. Based on the thresholds of support and certainty factor, a selection of rules is carried out and the terms in those rules are offered to the user to be added to the query and to improve the retrieval.

[1]  Edward H. Shortliffe,et al.  The application of certainty factors to neural computing for rule discovery , 2000, IEEE Trans. Neural Networks Learn. Syst..

[2]  C. J. van Rijsbergen,et al.  The selection of good search terms , 1981, Inf. Process. Manag..

[3]  Ron Weiss,et al.  Fast and effective query refinement , 1997, SIGIR '97.

[4]  Chris Buckley,et al.  Improving automatic query expansion , 1998, SIGIR '98.

[5]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[6]  Keith C. C. Chan,et al.  An effective algorithm for discovering fuzzy rules in relational databases , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[7]  Shian-Shyong Tseng,et al.  Intelligent query agent for structural document databases , 1999 .

[8]  Yehuda Lindell,et al.  Text Mining at the Term Level , 1998, PKDD.

[9]  Hsinchun Chen,et al.  A concept space approach to addressing the vocabulary problem in scientific information retrieval: an experiment on the worm community system , 1997 .

[10]  Gloria Bordogna,et al.  Fuzzy Approaches to Extend Boolean Information Retrieval , 1995 .

[11]  Donald H. Kraft,et al.  Rules and fuzzy rules in text: concept, extraction and usage , 2003, Int. J. Approx. Reason..

[12]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[13]  Daniel Sánchez,et al.  Fuzzy cardinality based evaluation of quantified sentences , 2000, Int. J. Approx. Reason..

[14]  Miguel Delgado,et al.  Mining Fuzzy Association Rules: An Overview , 2005 .

[15]  Man Hon Wong,et al.  Mining fuzzy association rules in databases , 1998, SGMD.

[16]  Susan Gauch,et al.  An expert system for automatic query reformation , 1993 .

[17]  Aviezri S. Fraenkel,et al.  Local Feedback in Full-Text Retrieval Systems , 1977, JACM.

[18]  G. Pasi,et al.  A Fuzzy Linguistic Approach Generalizing Boolean Information Retrieval: a Model and its Evaluation , 1993 .

[19]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[20]  W. Bruce Croft,et al.  I 3 R: a new approach to the design of document retrieval systems , 1987 .

[21]  Chi-Sheng Shih,et al.  Extracting classification knowledge of Internet documents with mining term associations: a semantic approach , 1998, SIGIR '98.

[22]  L. Zadeh A COMPUTATIONAL APPROACH TO FUZZY QUANTIFIERS IN NATURAL LANGUAGES , 1983 .

[23]  Fei Song,et al.  Knowledge-Based Approaches to Query Expansion in Information Retrieval , 1996, Canadian Conference on AI.

[24]  Peter Willett,et al.  The Limitations of Term Co-Occurrence Data for Query Expansion in Document Retrieval Systems , 1991 .

[25]  Daniel Sánchez,et al.  Fuzzy association rules: general model and applications , 2003, IEEE Trans. Fuzzy Syst..

[26]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[27]  Martin Rajman,et al.  Text Mining: Natural Language techniques and Text Mining applications , 1998 .

[28]  Gabriella Pasi,et al.  A fuzzy representation of HTML documents for information retrieval systems , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[29]  Tzung-Pei Hong,et al.  Mining association rules from quantitative data , 1999, Intell. Data Anal..

[30]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[31]  Daniel Sánchez,et al.  Mining Text Data: Special Features and Patterns , 2002, Pattern Detection and Discovery.

[32]  Hans-Peter Frei,et al.  Concept based query expansion , 1993, SIGIR.

[33]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[34]  Donald H. Kraft,et al.  Vocabulary mining for information retrieval: rough sets and fuzzy sets , 2001, Inf. Process. Manag..