A Web Usage Mining for Modeling Buying Behavior at a Web Store using Network Analysis

Understanding visitors’ invisible behaviors and responding with appropriate answers are important issues in continually increasing online market. To promote online transactions, customers’ behavior should be predicted correctly to keep low purchase conversion rate. In this study, we suggest an approach based on the idea that customers’ sessions in a web store can be transformed into the structure of a graph, which are represented as density of a session based on a graph theory. Online users visit lots of sites and their activities include information acquisition and browsing. The history of these activities can be used to construct a relationship network among web sites. This study analyzes this visit history made by website visitors with graph theory. The density of a network refers to the differentiated degrees of relationship among objects. In this study, we dichotomize into “purchase” and “no purchase group” since predicting whether a customer will buy or not buy our products is an important issue in web stores. We collect data on sessions which are a sequence of page views or a period of sustained web browsing. We model the sessions on the basis of density of a graph, which resulted in DOS (Density of a Session). The performance of other predictors including DOS is compared to that of suggested method in this study. Predictors are TVT (Total Visit Time during a period of a visit), AVT (The Average Time per Page Viewed), TNC (Total Number of Clicks), TPP (Total Number of Product-Related Pages Viewed), and DOS (Density of a Session Based on Graph Analysis). The study found that all predictors except total visit time are useful to differentiate between “purchase” and “no purchase” group. And we conducted Logit Analysis to examine the performance of each purchase prediction method. The results from Logit Analysis show that DOS predicts purchase behavior better in comparison with other predictors. It means understanding customers’ sessions with respect to a graph structure is useful to predict whether a customer will buy or not buy products in a web store.

[1]  Dirk Van den Poel,et al.  Predicting online-purchasing behaviour , 2005, Eur. J. Oper. Res..

[2]  Mark S. Granovetter Economic Action and Social Structure: The Problem of Embeddedness , 1985, American Journal of Sociology.

[3]  Kathy Hammond,et al.  Internet usage: Predictors of Active Users and Frequency of Use , 2000 .

[4]  Limsoon Wong,et al.  DATA MINING TECHNIQUES , 2003 .

[5]  R. Bucklin,et al.  Modeling Purchase Behavior at an E-Commerce Web Site: A Task-Completion Approach , 2004 .

[6]  Gerald L. Lohse,et al.  Predictors of online buying behavior , 1999, CACM.

[7]  C. Narasimhan,et al.  Customer Profitability in a Supply Chain , 2001 .

[8]  Chris Janiszewski,et al.  The Influence of Display Characteristics on Visual Exploratory Search Behavior , 1998 .

[9]  Gerald L. Lohse,et al.  Consumer Buying Behavior on the Internet: Findings from Panel Data , 2000 .

[10]  Sunil Gupta,et al.  Choice and the Internet: From Clickstream to Research Stream , 2002 .

[11]  John Riedl,et al.  E-Commerce Recommendation Applications , 2004, Data Mining and Knowledge Discovery.

[12]  K. Shimizu,et al.  The effect of pfl gene knockout on the metabolism for optically pure d-lactate production by Escherichia coli , 2004, Applied Microbiology and Biotechnology.

[13]  B. H. Mayhew, Structuralism Versus Individualism: Part 1, Shadowboxing in the Dark , 1980 .

[14]  Kannan Srinivasan,et al.  Modeling Online Browsing and Path Analysis Using Clickstream Data , 2004 .

[15]  R. Burt,et al.  Applied Network Analysis: A Methodological Introduction , 1983 .

[16]  Catarina Sismeiro,et al.  A Model of Web Site Browsing Behavior Estimated on Clickstream Data , 2003 .

[17]  Malgorzata Sterna,et al.  A novel representation of graph structures in web mining and data analysis , 2005 .

[18]  Wendy W. Moe,et al.  The Influence of Goal‐Directed and Experiential Activities on Online Flow Experiences , 2003 .

[19]  A. K. Pujari,et al.  Data Mining Techniques , 2006 .

[20]  Zhiqiang Zheng,et al.  Personalization from incomplete data: what you don't know can hurt , 2001, KDD '01.

[21]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[22]  Paul D. Allison,et al.  Logistic Regression Using the SAS System : Theory and Application , 1999 .

[23]  John Scott Social Network Analysis , 1988 .

[24]  M. Castells The rise of the network society , 1996 .

[25]  Peter S. Fader,et al.  Dynamic Conversion Behavior at E-Commerce Sites , 2004, Manag. Sci..

[26]  S. Raghavan,et al.  A visualization model based on adjacency data , 2002, Decision Support Systems.

[27]  Shibo Li,et al.  Modeling Category Viewership of Web Users with Multivariate Count Models , 2002 .