Mining client-side activity for personalization

"Garbage in. garbage out" is a well-known phrase in computer analysis, and one that comes to mind when mining Web data to draw conclusions about Web users. The challenge is that data analysts wish to infer patterns of client-side behavior from server-side data. However, because only a fraction of the user's actions ever reach the Web server, analysts must rely on incomplete data. In this paper, we propose a client-side monitoring system that is unobtrusive and supports flexible data collection. Moreover, the proposed framework encompasses client-side applications beyond the Web browser. Expanding monitoring beyond the browser to incorporate standard office productivity tools enables analysts to derive a much richer and more accurate picture of user behavior on the Web.

[1]  Pedro Antunes,et al.  Beyond formal processes: augmenting workflow with group interaction techniques , 1995, COCS '95.

[2]  Mark Ginsburg,et al.  PATTERN ACQUISITION TO IMPROVE ORGANIZATIONAL KNOWLEDGE AND WORKFLOW MANAGEMENT , 2001 .

[3]  Andreas Abecker,et al.  Enterprise Information Infrastructure for Active, Context-Sensitive Knowledge Delivery , 1999, European Conference on Information Systems.

[4]  Jasmine Schwartz Giving the Web a Memory Cost Its Users Privacy , 2001 .

[5]  Mark Ginsburg,et al.  A Lightweight Framework for Cross-Application User Monitoring , 2002, Computer.

[6]  A. Tuzhilin,et al.  Extending Recommender Systems : A Multidimensional Approach , 2001 .

[7]  Brian A. LaMacchia Internet fish , 1996 .

[8]  Barbara Oliboni,et al.  Modeling users' navigation history , 2001, IJCAI 2001.

[9]  Abraham Bernstein,et al.  How can cooperative work tools support dynamic group process? bridging the specificity frontier , 2000, CSCW '00.

[10]  Bamshad Mobasher,et al.  Improving the Effectiveness of Collaborative Filtering on Anonymous Web Usage Data , 2001 .

[11]  David F. Redmiles,et al.  Extracting usability information from user interface events , 2000, CSUR.

[12]  Steffen Staab,et al.  A Proactive Inferencing Agent for Desk Support , 2000 .

[13]  Balaji Padmanabhan,et al.  On Usage Metrics for Determining Authoritative Sites , 2000 .

[14]  Mark Ginsburg,et al.  HTML and CGI unleashed , 1995 .

[15]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[16]  Zhiqiang Zheng,et al.  Personalization from incomplete data: what you don't know can hurt , 2001, KDD '01.

[17]  Mark Ginsburg,et al.  Annotate: a Web-based knowledge management support system for document collections , 1998, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[18]  Roger M. Stein,et al.  Analysis of Web Site Usage Data: How Much Can We Learn About the Consumer from Web Logfiles? , 1996 .