Mining Customer Behavior in Trial Period of a Web Application Usage - Case Study

This paper proposes models for predicting customer conversion from trial account to full paid account of web application. Two models are proposed with focus on content of the application and time. In order to make a customer’s behavior prediction, data is extracted from web application’s usage log in trial period and processed with data mining techniques. For both models, content and time based, the same selected classification algorithms are used: decision trees, Naive Bayes, k-Nearest Neighbors and One Rule classification. Additionally, a cluster algorithm k-means is used to see if clustering by two clusters (for converted and not-converted users) can be formed and used for classification. Results showed high accuracy of classification algorithms in early stage of trial period which can serve as a basis for an identification of users that are likely to abandon the application and not convert.

[1]  Eric W. T. Ngai,et al.  Customer churn prediction using improved balanced random forests , 2009, Expert Syst. Appl..

[2]  Serhat Güden,et al.  Online Shopping Customer Data Analysis by Using Association Rules and Cluster Analysis , 2013, ICDM.

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  George Dimitoglou,et al.  Comparison of the C4.5 and a Naive Bayes Classifier for the Prediction of Lung Cancer Survivability , 2012, ArXiv.

[5]  Chin-Feng Lee,et al.  On-line personalized sales promotion in electronic commerce , 2004, Expert Syst. Appl..

[6]  D. Jayalatchumy Web Mining Research Issues and Future Directions - A Survey , 2013 .

[7]  Bart Baesens,et al.  Building comprehensible customer churn prediction models with advanced rule induction techniques , 2011, Expert Syst. Appl..

[8]  Tina R. Patil,et al.  Performance Analysis of Naive Bayes and J 48 Classification Algorithm for Data Classification , 2013 .

[9]  Neeraj Bhargava,et al.  Decision Tree Analysis on J48 Algorithm for Data Mining , 2013 .

[10]  Jian Pei,et al.  2012- Data Mining. Concepts and Techniques, 3rd Edition.pdf , 2012 .

[11]  Opher Etzion,et al.  e-CLV: A Modeling Approach for Customer Lifetime Evaluation in e-Commerce Domains, with an Application and Case Study for Online Auction , 2005, Inf. Syst. Frontiers.

[12]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[13]  Mohammad Mehdi Sepehri,et al.  Applying Data Mining to Customer Churn Prediction in an Internet Service Provider , 2010 .

[14]  Michel Ballings,et al.  Customer event history for churn prediction: How long is long enough? , 2012, Expert Syst. Appl..

[15]  James A. M. McHugh,et al.  Mining the World Wide Web: An Information Search Approach , 2001 .

[16]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[17]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[18]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[19]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[20]  Ketul M. Patel,et al.  Process of Web Usage Mining to find Interesting Patterns from Web Usage Data , 2012, BIOINFORMATICS 2012.

[21]  R. J. Kuo,et al.  Integration of ART2 neural network and genetic K-means algorithm for analyzing Web browsing paths in electronic commerce , 2005, Decis. Support Syst..