How current web generation affects prediction algorithms performance

Web prefetching techniques have been widely used to reduce the latency perceived by users. An important part of these techniques is the prediction algorithm whose main aim is to predict the future users’ accesses based on the previous visiting patterns. For this purpose, the algorithms proposed in the open literature usually consider a training period before making predictions. The length of this period may impact on performance, either improving or degrading it. In addition, this length may involve a high amount of information and therefore important computer resources are consumed. In this work we analyze how the training affects the prediction performance using current and old web traces, still used in the recent literature. Our results show that while in old traces the training, in general, improves performance, when using recent traces this training may degrade the performance. This fact happens because those users’ visiting patterns have changed, what means that old traces are not appropriate for prediction studies in current web environments.

[1]  M. Tamer Özsu,et al.  A Web page prediction model based on click-stream tree representation of user behavior , 2003, KDD '03.

[2]  Christos Bouras,et al.  Predictive Prefetching on the Web and Its Potential Impact in the Wide Area , 2004, World Wide Web.

[3]  Themistoklis Palpanas,et al.  Web prefetching using partial match prediction , 1998 .

[4]  Ramesh R. Sarukkai,et al.  Link prediction and path analysis using Markov chains , 2000, Comput. Networks.

[5]  Ana Pont,et al.  Modeling continuous changes of the user's dynamic behavior in the WWW , 2005, WOSP '05.

[6]  D. M. Hutton,et al.  Web Dynamics - Adapting to Change in Content, Size, Topology and Use , 2006 .

[7]  Yannis Manolopoulos,et al.  A Data Mining Algorithm for Generalized Web Prefetching , 2003, IEEE Trans. Knowl. Data Eng..

[8]  Yiu-Kai Ng,et al.  A Client-Based Web Prefetching Management System Based on Detection Theory , 2004, WCW.

[9]  Ana Pont,et al.  An experimental framework for testing Web prefetching techniques , 2004, Proceedings. 30th Euromicro Conference, 2004..

[10]  Brian D. Davison Learning Web Request Patterns , 2004, Web Dynamics.

[11]  Junyi Shen,et al.  A new Markov model for Web access prediction , 2002, Comput. Sci. Eng..

[12]  Jeffrey C. Mogul,et al.  Using predictive prefetching to improve World Wide Web latency , 1996, CCRV.

[13]  Ana Pont,et al.  About the Heterogeneity of Web Prefetching Performance Key Metrics , 2004, INTELLCOMM.

[14]  Xin Chen,et al.  A Popularity-Based Prediction Model for Web Prefetching , 2003, Computer.