SpeedTracer, a World Wide Web usage mining and analysis tool, was developed to understand user surfing behavior by exploring the Web server log files with data mining techniques. As the popularity of the Web has exploded, there is a strong desire to understand user surfing behavior. However, it is difficult to perform user-oriented data mining and analysis directly on the server log files because they tend to be ambiguous and incomplete. With innovative algorithms, SpeedTracer first identifies user sessions by reconstructing user traversal paths. It does not require “cookies” or user registration for session identification. User privacy is protected. Once user sessions are identified, data mining algorithms are then applied to discover the most common traversal paths and groups of pages frequently visited together. Important user browsing patterns are manifested through the frequent traversal paths and page groups, helping the understanding of user surfing behavior. Three types of reports are prepared: user-based reports, path-based reports and group-based reports. In this paper, we describe the design of SpeedTracer and demonstrate some of its features with a few sample reports.
[1]
Philip S. Yu,et al.
An effective hash-based algorithm for mining association rules
,
1995,
SIGMOD '95.
[2]
Ramakrishnan Srikant,et al.
Fast Algorithms for Mining Association Rules in Large Databases
,
1994,
VLDB.
[3]
Ramakrishnan Srikant,et al.
Mining sequential patterns
,
1995,
Proceedings of the Eleventh International Conference on Data Engineering.
[4]
W. Gropp,et al.
Does increased aromatase activity in adipose fibroblasts cause low sexual desire in patients with HIV lipodystrophy?
,
2002,
Sexually transmitted infections.
[5]
Philip S. Yu,et al.
Data mining for path traversal patterns in a web environment
,
1996,
Proceedings of 16th International Conference on Distributed Computing Systems.
[6]
Ramana Rao,et al.
Silk from a sow's ear: extracting usable structures from the Web
,
1996,
CHI.
[7]
Jiawei Han,et al.
Discovery of Multiple-Level Association Rules from Large Databases
,
1995,
VLDB.
[8]
Tomasz Imielinski,et al.
Mining association rules between sets of items in large databases
,
1993,
SIGMOD Conference.