The WWW is currently the hottest testbed for future interactive digital systems. While much is understood technically about how the WWW functions, substantially less is known about how this technology is used collectively and on an individual basis. This disparity of knowledge exists largely as a direct consequence of the decentralized nature of Web. Since each user of the Web is not uniquely identifiable across the system and the system employs various levels of caching, measurement of actual usage is problematic. This paper establishes terminology to frame the problem of reliably determining usage of WWW resources while reviewing current practice and their shortcomings. A review of the various metrics and analyses that can be performed to determine usage is then presented. This is followed by a discussion of the strengths and weaknesses of the hit-metering proposal [8] currently in consideration by the HTTP working group. Lastly, new proposals, based upon server-side sampling are introduced and assessed against the other proposal. It is argued that server-side sampling provides more reliable and useful usage data while requiring no change to the current HTTP protocol and enhancing user privacy.
[1]
James E. Pitkow,et al.
Characterizing Browsing Behaviors on the World-Wide Web
,
1995
.
[2]
P. J. Green,et al.
Probability and Statistical Inference
,
1978
.
[3]
Bernard P. Zajac,et al.
Pretty good privacy
,
1994
.
[4]
Roy Fielding.
RFC 2068 : Hypertext Transfer Protocol-HTTP/1.1
,
1997
.
[5]
Mark Guzdial,et al.
Characterizing Process Change Using Log File Data
,
1993
.
[6]
Umeshwar Dayal,et al.
From User Access Patterns to Dynamic Hypertext Linking
,
1996,
Comput. Networks.
[7]
Donna L. Hoffman,et al.
New metrics for new media: toward the development of Web measurement standards
,
1997,
World Wide Web J..
[8]
Roger W. Schvaneveldt,et al.
Pathfinder associative networks: studies in knowledge organization
,
1990
.
[9]
Ramana Rao,et al.
Silk from a sow's ear: extracting usable structures from the Web
,
1996,
CHI.
[10]
Linda Marie Tauscher.
Evaluating History Mechanisms: An Empirical Study of Reuse Patterns in World Wide Web Navigation
,
1996
.
[11]
James E. Pitkow,et al.
Results from the Third WWW User Survey
,
1996,
World Wide Web J..