Enhancing the power of Web search engines by means of fuzzy query

Commercial Web search engines such as Yahoo!, Google, etc., have been defined which manage information only in a crisp way (i.e., keyword-based). Their query languages do not allow the expression of preferences or vagueness. They generally return many Web pages irrelevant to user's query. In order to handle these problems, we propose the Perception Index (PI) that contains attributes associated with a focal keyword restricted by fuzzy term(s) used in fuzzy queries on the Internet. The PI assists the user to reflect his/her perception in the process of query. If we integrate the Document Index (DI) used in commercial Web search engines with the proposed PI, we can handle both crisp terms (keyword-based) and fuzzy terms (perception-based). In this respect, the proposed approach is softer than the keyword-based approach. The PI brings somewhat closer to natural language. It is a further step toward a human-friendly, natural language-based interface for Web searching. Consequently, Internet users can narrow thousands of hits to the few that users really want. In this respect, the PI provides a new tool for targeting queries that users really want. In this paper, we also present a personalized search and ranking based on the PI.

[1]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[2]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[3]  Lotfi A. Zadeh,et al.  From Computing with Numbers to Computing with Words - from Manipulation of Measurements to Manipulation of Perceptions , 2005, Logic, Thought and Action.

[4]  Tran Cao Son,et al.  The semantic web: a brain for humankind , 2001 .

[5]  Ronald R. Yager,et al.  On Linguistic Summaries of Data , 1991, Knowledge Discovery in Databases.

[6]  Nicholas J. Belkin,et al.  Helping people find what they don't know , 2000, CACM.

[7]  Donald H. Kraft,et al.  Fuzzy information systems: managing uncertainty in databases and information retrieval systems , 1997, Fuzzy Sets Syst..

[8]  John McCarthy Phenomenal data mining , 2000, CACM.

[9]  Patrick Bosc,et al.  Fuzzy databases : principles and applications , 1996 .

[10]  Lotfi A. Zadeh,et al.  Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic , 1997, Fuzzy Sets Syst..

[11]  Lotfi A. Zadeh,et al.  A New Direction in AI: Toward a Computational Theory of Perceptions , 2001, AI Mag..

[12]  Koichi Takeda,et al.  Information retrieval on the web , 2000, CSUR.

[13]  C. V. Negoiţă,et al.  Fuzzy logic in knowledge engineering , 1986 .

[14]  Michael David Williams,et al.  What Makes RABBIT Run? , 1984, Int. J. Man Mach. Stud..

[15]  Richard E. Blake,et al.  Indices and Data Structures in Information Systems , 1999, Informatica.

[16]  Ronald R. Yager,et al.  Finding fuzzy and gradual functional dependencies with SummarySQL , 1999, Fuzzy Sets Syst..

[17]  Philippe Martin,et al.  Knowledge Retrieval and the World Wide Web , 2000, IEEE Intell. Syst..

[18]  Arbee L. P. Chen,et al.  Enabling personalized recommendation on the Web based on user interests and behaviors , 2001, Proceedings Eleventh International Workshop on Research Issues in Data Engineering. Document Management for Data Intensive Business and Scientific Applications. RIDE 2001.

[19]  L. Zadeh A COMPUTATIONAL APPROACH TO FUZZY QUANTIFIERS IN NATURAL LANGUAGES , 1983 .

[20]  Rick Kazman,et al.  WebQuery: Searching and Visualizing the Web Through Connectivity , 1997, Comput. Networks.

[21]  David Wai-Lok Cheung,et al.  Anchor point indexing in Web document retrieval , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[22]  Ronald R. Yager Database discovery using fuzzy sets , 1996 .

[23]  Henri Prade,et al.  Fuzzy Logic Techniques in Multimedia Database Querying: A Preliminary Investigation of the Potentials , 2001, IEEE Trans. Knowl. Data Eng..