Analyzing the browse patterns of mobile clients

We study the dynamics of a large popular commercial Web site designed specifically for users who access it via their cell-phones and PDAs. Unlike most previous Web studies that have analyzed accesses seen by proxies and servers from clients connected via the wired network, we focus primarily on client accesses made over wireless channels and made for downloading content on small devices for offline browsing. We carry out user-behavior analysis as users authenticate themselves before accessing and then every access is logged with a unique user identifier. Using browser traces gathered over a period of 12 days, we perform detailed content analysis, document popularity analysis and server load analysis. We answer questions like what sorts of content wireless users are most interested in, when and how much load they put on the servers, and how much time they spend on the channel while accessing the Web wirelessly. We discuss the implications of our findings for techniques such as query caching, server scheduling, channel use and TCP optimization.

[1]  Thomas Kunz,et al.  WAP traffic: description and comparison to WWW traffic , 2000, MSWIM '00.

[2]  Yin Zhang,et al.  Speeding Up Short Data Transfers: Theory, Architectural Support, and Simulation Results , 2000 .

[3]  Terence Kelly,et al.  Thin-client Web access patterns: Measurements from a cache-busting proxy , 2002, Comput. Commun..

[4]  Lili Qiu,et al.  The content and access dynamics of a busy Web site: findings and implications , 2000 .

[5]  Randy H. Katz,et al.  TCP Fast Start: A Technique For Speeding Up Web Transfers , 1998 .

[6]  Martin Arlitt,et al.  Workload Characterization of the 1998 World Cup Web Site , 1999 .

[7]  Hiroshi Tsuji,et al.  Memory-Based Architecture for Distributed WWW Caching Proxy , 1998, Comput. Networks.

[8]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[9]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[10]  Paul Barford,et al.  Changes in Web Client Access Patterns , 1998, The Web Conference.

[11]  Mark Crovella,et al.  Characteristics of WWW Client-based Traces , 1995 .

[12]  Steven Glassman,et al.  A Caching Relay for the World Wide Web , 1994, Comput. Networks ISDN Syst..

[13]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[14]  G. Voelker,et al.  On the scale and performance of cooperative Web proxy caching , 2000, OPSR.