WebFrame: In Pursuit of Computationally and Cognitively Efficient Web Mining

The goal of web mining is relatively simple: provide both computationally and cognitively efficient methods for improving the value of information to users of the WWW. The need for computational efficiency is well-recognized by the data mining community, which sprung from the database community concern for efficient manipulation of large datasets. The motivation for cognitive efficiency is more elusive but at least as important. In as much as cognitive efficiency can be informally construed as ease of understanding, then what is important is any tool or technique that presents cognitively manageable abstractions of large datasets.We present our initial development of a framework for gathering, analyzing, and redeploying web data. Not dissimilar to conventional data mining, the general idea is that good use of web data first requires the careful selection of data (both usage and content data), the deployment of appropriate learning methods, and the evaluation of the results of applying the results of learning in a web application. Our framework includes tools for building, using, and visualizing web abstractions.We present an example of the deployment of our framework to navigation improvement. The abstractions we develop are called Navigation Compression Models (NCMs), and we show a method for creating them, using them, and visualizing them to aid in their understanding.

[1]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[2]  Edward R. Tufte,et al.  The Visual Display of Quantitative Information , 1986 .

[3]  P. Fayers,et al.  The Visual Display of Quantitative Information , 1990 .

[4]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[5]  B. Marx The Visual Display of Quantitative Information , 1985 .

[6]  Piotr Kaczmarek,et al.  Applications of isometric projection for visualizing web sites , 2001 .

[7]  Padhraic Smyth,et al.  Visualization of navigation patterns on a Web site using model-based clustering , 2000, KDD '00.

[8]  Michael D. Smith,et al.  Using Path Profiles to Predict HTTP Requests , 1998, Comput. Networks.

[9]  Rick Stout Web Site Stats: Tracking Hits and Analyzing Web Traffic , 1996 .

[10]  Jock D. Mackinlay,et al.  Visualizing the evolution of Web ecologies , 1998, CHI.

[11]  Ed Huai-hsin Chi,et al.  Sensemaking of evolving Web sites using visualization spreadsheets , 1999, Proceedings 1999 IEEE Symposium on Information Visualization (InfoVis'99).

[12]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[13]  Ingrid Zukerman,et al.  Predicting users' requests on the WWW , 1999 .

[14]  Oren Etzioni,et al.  Adaptive Web Sites: Automatically Synthesizing Web Pages , 1998, AAAI/IAAI.

[15]  Tamara Munzner,et al.  Visualizing the structure of the World Wide Web in 3D hyperbolic space , 1995, VRML '95.

[16]  Download Book,et al.  Information Visualization in Data Mining and Knowledge Discovery , 2001 .