Adaptive neural network clustering of Web users

The degree of personalization that a Web site offers in presenting its services to users is an important attribute contributing to the site's popularity. Web server access logs contain substantial data about user access patterns. One way to solve this problem is to group users on the basis of their Web interests and then organize the site's structure according to the needs of different groups. Two main difficulties inhibit this approach: the essentially infinite diversity of user interests and the change in these interests with time. We have developed a clustering algorithm that groups users according to their Web access patterns. The algorithm is based on the ART1 version of adaptive resonance theory. In our ART1-based algorithm, a prototype vector represents each user cluster by generalizing the URLs most frequently accessed by all cluster members. We have compared our algorithm's performance with the traditional k-means clustering algorithm. Results showed that the ART1-based technique performed better in terms of intracluster distances. We also applied the technique in a prefetching scheme that predicts future user requests.

[1]  S. Sitharama Iyengar,et al.  Faster Web Page Allocation with Neural Networks , 2002, IEEE Internet Comput..

[2]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[3]  Wei Lin,et al.  Web prefetching between low-bandwidth clients and proxies: potential and performance , 1999, SIGMETRICS '99.

[4]  PatternsYongjian,et al.  Clustering of Web Users Based on Access , 1999 .

[5]  Stephen Grossberg,et al.  A massively parallel architecture for a self-organizing neural pattern recognition machine , 1988, Comput. Vis. Graph. Image Process..

[6]  Georgios Paliouras,et al.  Clustering the Users of Large Web Sites into Communities , 2000, ICML.

[7]  B. Moore,et al.  ART1 and pattern clustering , 1989 .

[8]  Vaduvur Bharghavan,et al.  Alleviating the Latency and Bandwidth Problems in WWW Browsing , 1997, USENIX Symposium on Internet Technologies and Systems.

[9]  Kyuseok Shim,et al.  Data mining and the Web: past, present and future , 1999, WIDM '99.

[10]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[11]  Vir V. Phoha,et al.  Web user clustering from access log using belief function , 2001, K-CAP '01.

[12]  Daniel R. Tauritz,et al.  Adaptive Resonance Theory (ART): An Introduction , 1995 .

[13]  Evangelos P. Markatos,et al.  A top- 10 approach to prefetching on the web , 1996 .