As a featured function of search engine, clustering display of search results has been proved an efficient way to organize the web resource. However, for a given query, clustering results reached by any user are totally identical. In this paper, we explored a user-friendly clustering scheme that automatically learns users’ interests and accordingly generates interest-centric clustering. The basis of this personal clustering is a keyword based topic identifier. Trained by users’ individual search histories, the identifier provides most of personal topics. Each topic will be the clustering center of the retrieved pages. The scheme proposed distinguishes the functionality of clustering from that of topic identification, which makes the clustering more personal and flexible. To evaluate the proposed scheme, we experimented with sets of synthetic data. The experimental results prove it an effective scheme for search results clustering.
[1]
W. Scott Spangler,et al.
Clustering hypertext with applications to web searching
,
2000,
HYPERTEXT '00.
[2]
Yanchun Zhang,et al.
Utilizing Hyperlink Transitivity to Improve Web Page Clustering
,
2003,
ADC.
[3]
Martin Ester,et al.
Frequent term-based text clustering
,
2002,
KDD.
[4]
Doug Beeferman,et al.
Agglomerative clustering of a search engine query log
,
2000,
KDD '00.
[5]
Mark Hansen,et al.
Using navigation data to improve IR functions in the context of web search
,
2001,
CIKM '01.
[6]
Jian Pei,et al.
CLOSET+: searching for the best strategies for mining frequent closed itemsets
,
2003,
KDD '03.