Real-time prediction of online shoppers’ purchasing intention using multilayer perceptron and LSTM recurrent neural networks

In this paper, we propose a real-time online shopper behavior analysis system consisting of two modules which simultaneously predicts the visitor’s shopping intent and Web site abandonment likelihood. In the first module, we predict the purchasing intention of the visitor using aggregated pageview data kept track during the visit along with some session and user information. The extracted features are fed to random forest (RF), support vector machines (SVMs), and multilayer perceptron (MLP) classifiers as input. We use oversampling and feature selection preprocessing steps to improve the performance and scalability of the classifiers. The results show that MLP that is calculated using resilient backpropagation algorithm with weight backtracking produces significantly higher accuracy and F1 Score than RF and SVM. Another finding is that although clickstream data obtained from the navigation path followed during the online visit convey important information about the purchasing intention of the visitor, combining them with session information-based features that possess unique information about the purchasing interest improves the success rate of the system. In the second module, using only sequential clickstream data, we train a long short-term memory-based recurrent neural network that generates a sigmoid output showing the probability estimate of visitor’s intention to leave the site without finalizing the transaction in a prediction horizon. The modules are used together to determine the visitors which have purchasing intention but are likely to leave the site in the prediction horizon and take actions accordingly to improve the Web site abandonment and purchase conversion rates. Our findings support the feasibility of accurate and scalable purchasing intention prediction for virtual shopping environment using clickstream and session information data.

[1]  Ahmad Taher Azar,et al.  Fast neural network learning algorithms for medical applications , 2012, Neural Computing and Applications.

[2]  Brad Warner,et al.  Understanding Neural Networks as Statistical Tools , 1996 .

[3]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[4]  Payal,et al.  Predicting User’s Web Navigation Behavior Using Hybrid Approach , 2015 .

[5]  P. Cimiano,et al.  Comparing Hidden Markov Models and Long Short Term Memory Neural Networks for Learning Action Representations , 2016, MOD.

[6]  Alok Gupta,et al.  GIST: A Model for Design and Management of Content and Interactivity of Customer-Centric Web Sites , 2004, MIS Q..

[7]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[9]  A. Kau,et al.  Typology of online shoppers , 2003 .

[10]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[11]  Issa M. Khalil,et al.  Prediction of User's Web-Browsing Behavior: Application of Markov Model , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  Ricardo Filipe Fernandes e Costa Magalhães Teixeira,et al.  Using clickstream data to analyze online purchase intentions , 2015 .

[13]  Grzegorz Chodak,et al.  Using association rules to assess purchase probability in online stores , 2016, Information Systems and e-Business Management.

[14]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[15]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[16]  José Luis Rojo-Álvarez,et al.  Support vector machines in engineering: an overview , 2014, WIREs Data Mining Knowl. Discov..

[17]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[18]  Chee Peng Lim,et al.  A review of online learning in supervised neural networks , 2014, Neural Computing and Applications.

[19]  Wendy W. Moe,et al.  The Influence of Goal‐Directed and Experiential Activities on Online Flow Experiences , 2003 .

[20]  Yoshua Bengio,et al.  Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[21]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[22]  Tao Luo,et al.  Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization , 2004, Data Mining and Knowledge Discovery.

[23]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[24]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[25]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[26]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Amy Wenxuan Ding,et al.  Learning User Real-Time Intent for Optimal Dynamic Web Page Transformation , 2015, Inf. Syst. Res..

[28]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[29]  Muhammad Muazzem Hossain,et al.  Why do shoppers abandon shopping cart? Perceived waiting time, risk, and transaction inconvenience , 2009 .

[30]  Zachary Chase Lipton A Critical Review of Recurrent Neural Networks for Sequence Learning , 2015, ArXiv.

[31]  Alexandros Karatzoglou,et al.  Session-based Recommendations with Recurrent Neural Networks , 2015, ICLR.

[32]  Brian Clifton,et al.  Advanced Web Metrics with Google Analytics , 2008 .

[33]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[34]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[35]  Shuai Li,et al.  Distributed Recurrent Neural Networks for Cooperative Control of Manipulators: A Game-Theoretic Perspective , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Hong Gu,et al.  Imbalanced classification using support vector machine ensemble , 2011, Neural Computing and Applications.

[37]  M. J. del Jesus,et al.  Web usage mining to improve the design of an e-commerce website: OrOliveSur.com , 2012, Expert Syst. Appl..

[38]  Wolfram Schiffmann,et al.  Optimization of the Backpropagation Algorithm for Training Multilayer Perceptrons , 1994 .

[39]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[40]  YuQing Miao,et al.  Predicting the Next Scenic Spot a User Will Browse on a Tourism Website Based on Markov Prediction Model , 2013, 2013 IEEE 25th International Conference on Tools with Artificial Intelligence.

[41]  Grażyna Suchacka,et al.  A k-Nearest Neighbors Method for Classifying User Sessions in E-Commerce Scenario , 2015 .

[42]  Grazyna Suchacka,et al.  Classification Of E-Customer Sessions Based On Support Vector Machine , 2015, ECMS.

[43]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[44]  Jaewon Kang,et al.  Online Shopping Hesitation , 2006, Cyberpsychology Behav. Soc. Netw..

[45]  Fikret S. Gürgen,et al.  A feature selection method based on kernel canonical correlation analysis and the minimum Redundancy-Maximum Relevance filter method , 2012, Expert Syst. Appl..

[46]  Jordi Torres,et al.  Web Customer Modeling for Automated Session Prioritization on High Traffic Sites , 2007, User Modeling.

[47]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[48]  Germanas Budnikas,et al.  Computerised Recommendations on E-Transaction Finalisation by Means of Machine Learning , 2015 .

[49]  Stefan Fritsch,et al.  neuralnet: Training of Neural Networks , 2010, R J..

[50]  Shuai Li,et al.  Kinematic Control of Redundant Manipulators Using Neural Networks , 2017, IEEE Transactions on Neural Networks and Learning Systems.