Using Sequence Analysis to Classify Web Usage Patterns across Websites

This study applies sequence analysis to identify the distinct web browsing patterns based on 200 China users' 30-days web usage. Our results reveal four key, unique web navigation behavior categories, namely search-information browsing, social-information browsing, ecommerce-information browsing, and direct browsing. Of these, the ratio of ecommerce activities in the social-information cluster is higher than the others, with the exception of the ecommerce-information cluster. To test the robustness of the proposed method based on our classification, we also summarize the characteristics of each category after they were segmented according to two demographic indicators, i.e. gender and occupation. Different online shopping behaviors are also discussed through the proposed classified groups. Complementing the extant methods which are based on within-website categorization of consumers, the demonstration of the sequence analysis application to e-commerce affords a deeper, integrated understanding of an individual's online activity and behavior (i.e., navigation across multiple websites).

[1]  MAGDALINI EIRINAKI,et al.  Web mining for web personalization , 2003, TOIT.

[2]  T. Joachims WebWatcher : A Tour Guide for the World Wide Web , 1997 .

[3]  Diana Gosálvez Prados,et al.  Six ways to make Web 2.0 work , 2009 .

[4]  Scott A. Neslin,et al.  Next-product-to-buy models for cross-selling applications , 2002 .

[5]  Maurice D. Mulvenna,et al.  Personalization on the Net using Web mining: introduction , 2000, CACM.

[6]  Rajiv Sabherwal,et al.  An Empirical Taxonomy of Implementation Processes Based on Sequences of Events in Information System Development , 1993 .

[7]  Andrew Abbott,et al.  A Primer on Sequence Methods , 1990 .

[8]  K. Sheehan,et al.  An investigation of gender differences in on-line privacy concerns and resultant behaviors , 1999 .

[9]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[10]  Thorsten Joachims,et al.  Web Watcher: A Tour Guide for the World Wide Web , 1997, IJCAI.

[11]  A. Abbott,et al.  Sequence Analysis and Optimal Matching Methods in Sociology , 2000 .

[12]  Osmar R. Zaïane,et al.  Web Usage Mining for a Better Web-Based Learning Environment , 2001 .

[13]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[14]  Anupam Joshi,et al.  On Mining Web Access Logs , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[15]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[16]  A. Kaplan,et al.  Users of the world, unite! The challenges and opportunities of Social Media , 2010 .

[17]  Thompson S. H. Teo Differential effects of occupation on Internet usage , 1998, Internet Res..

[18]  Phyllis Schumacher,et al.  Internet Use Among Female and Male College Students , 2000, Cyberpsychology Behav. Soc. Netw..

[19]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[20]  Sara A. Bly,et al.  Media spaces: bringing people together in a video, audio, and computing environment , 1993, CACM.

[21]  Mary Blair-Loy Career Patterns of Executive Women in Finance: An Optimal Matching Analysis1 , 1999, American Journal of Sociology.

[22]  Marshall Scott Poole,et al.  Decision Development in Computer-Assisted Group Decision Making , 1995 .

[23]  A. Abbott Sequence analysis: new methods for old ideas , 1995 .

[24]  Tao Luo,et al.  Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization , 2004, Data Mining and Knowledge Discovery.

[25]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[26]  Atreyi Kankanhalli,et al.  Customers’ preference of online store visit strategies: an investigation of demographic variables , 2010, Eur. J. Inf. Syst..

[27]  Peter S. Fader,et al.  Dynamic Conversion Behavior at E-Commerce Sites , 2004, Manag. Sci..

[28]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.

[29]  Eric B. Weiser,et al.  Gender Differences in Internet Use Patterns and Internet Application Preferences: A Two-Sample Comparison , 2000, Cyberpsychology Behav. Soc. Netw..

[30]  J. Devereux,et al.  A comprehensive set of sequence analysis programs for the VAX , 1984, Nucleic Acids Res..

[31]  A. Abbott,et al.  Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers , 1990, American Journal of Sociology.

[32]  Munmun De Choudhury Modeling and predicting group activity over time in online social media , 2009, HT '09.

[33]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[34]  M. Savage,et al.  Ascription into Achievement: Models of Career Systems at Lloyds Bank, 1890-1970 , 1996, American Journal of Sociology.

[35]  Clarke Wilson Analysis of Travel Behavior Using Sequence Alignment Methods , 1998 .

[36]  Dirk Van den Poel,et al.  Predicting home-appliance acquisition sequences: Markov/Markov for Discrimination and survival analysis for modeling sequential information in NPTB models , 2007, Decis. Support Syst..

[37]  Brendan Halpin,et al.  Class careers as sequences : An optimal matching analysis of work-life histories , 1998 .

[38]  France Bélanger,et al.  Gender differences in perceptions of web-based shopping , 2002, CACM.

[39]  L A Lillard,et al.  Simultaneous equations for hazards: marriage duration and fertility timing. , 1993, Journal of econometrics.

[40]  R. Dholakia,et al.  Factors Driving Consumer Intention to Shop Online: An Empirical Investigation , 2003 .

[41]  Martha G. Russell,et al.  The Impact of Perceived Channel Utilities, Shopping Orientations, and Demographics on the Consumer's Online Buying Behavior , 2006, J. Comput. Mediat. Commun..