Information granulation for web-based information support systems

In this paper, we discuss the potential applications of data mining techniques for the design of Web based information retrieval support systems (IRSS). In particular, we apply clustering methods for the granulation of different entities involved in IRSS. Two types of granulations, single-level and multi-level granulations, are investigated. Issues of document space granulation, query space granulation, term space granulation, and retrieval results granulation are studied in detail. It is demonstrated that each different granulation supports a different user task.

[1]  Doug Beeferman,et al.  Agglomerative clustering of a search engine query log , 2000, KDD '00.

[2]  CroftW. Bruce,et al.  Information filtering and information retrieval , 1992 .

[3]  Marc El-Bèze,et al.  Query Expansion and Classification of Retrieved Documents , 1998, TREC.

[4]  Larry Fitzpatrick,et al.  Automatic feedback using past queries: social searching? , 1997, SIGIR '97.

[5]  Robert Villa,et al.  The effectiveness of query-specific hierarchic clustering in information retrieval , 2002, Inf. Process. Manag..

[6]  Peter Willett Query-specific automatic document classification , 1985 .

[7]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[8]  Yiyu Yao,et al.  Relational Interpretations of Neigborhood Operators and Rough Set Approximation Operators , 1998, Inf. Sci..

[9]  Y. Y. Yao,et al.  Information retrieval support systems , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[10]  Ji-Rong Wen,et al.  Query clustering using user logs , 2002, TOIS.

[11]  Oren Etzioni,et al.  Web document clustering: a feasibility demonstration , 1998, SIGIR '98.

[12]  Hiroyuki Sakai,et al.  On a Retrieval Support System by Suggesting Terms to a User , 2001, NTCIR.

[13]  Frans C. Heeman Granularity in Structured Documents , 1992, Electron. Publ..

[14]  Yiyu Yao,et al.  Granular Computing for the Organization and Retrieval of Scientific XML Documents , 2002, Joint Conference on Information Sciences.

[15]  Yiyu Yao,et al.  Granular computing using information tables , 2002 .

[16]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[17]  Y. Yao Granular Computing : basic issues and possible solutions , 2000 .

[18]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[19]  Yiyu Yao,et al.  On modeling information retrieval with probabilistic inference , 1995, TOIS.

[20]  Yiyu Yao,et al.  Granular Computing as a Basis for Consistent Classification Problems , 2002 .

[21]  Y. Yao Information granulation and rough set approximation , 2001 .

[22]  Yiyu Yao,et al.  PagePrompter: An Intelligent Web Agent Created Using Data Mining Techniques , 2002, Rough Sets and Current Trends in Computing.