Distributed model for customer churn prediction using convolutional neural network

Purpose The purpose of the proposed model is to assist the e-business to predict the churned users using machine learning. This paper aims to monitor the customer behavior and to perform decision-making accordingly. Design/methodology/approach The proposed model uses the 2-D convolutional neural network (CNN; a technique of deep learning). The proposed model is a layered architecture that comprises two different phases that are data load and preprocessing layer and 2-D CNN layer. In addition, the Apache Spark parallel and distributed framework is used to process the data in a parallel environment. Training data is captured from Kaggle by using Telco Customer Churn. Findings The proposed model is accurate and has an accuracy score of 0.963 out of 1. In addition, the training and validation loss is extremely less, which is 0.004. The confusion matric results show the true-positive values are 95% and the true-negative values are 94%. However, the false-negative is only 5% and the false-positive is only 6%, which is effective. Originality/value This paper highlights an inclusive description of preprocessing required for the CNN model. The data set is addressed more carefully for the successful customer churn prediction.

[1]  M. Stephens,et al.  A simple new approach to variable selection in regression, with application to genetic fine mapping , 2020, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[2]  Hui Liu,et al.  A distributed computing framework for wind speed big data forecasting on Apache Spark , 2020 .

[3]  In Lee,et al.  Machine learning for enterprises: Applications, algorithm selection, and challenges , 2020 .

[4]  Donghyun Kim,et al.  Predicting User Purchase in E-commerce by Comprehensive Feature Engineering and Decision Boundary Focused Under-Sampling , 2015, RecSys Challenge.

[5]  T. Jung,et al.  Exploring Consumer Behavior in Virtual Reality Tourism Using an Extended Stimulus-Organism-Response Model , 2018, Journal of Travel Research.

[6]  Dirk Van den Poel,et al.  Improving Purchasing Behavior Predictions by Data Augmentation with Situational Variables , 2010, Int. J. Inf. Technol. Decis. Mak..

[7]  Jian Wang,et al.  Opportunity model for e-commerce recommendation: right product; right time , 2013, SIGIR.

[8]  Herbert Kimura,et al.  Literature review: Machine learning techniques applied to financial market prediction , 2019, Expert Syst. Appl..

[9]  Analytic-thinking predicts hoax beliefs and helping behaviors in response to the COVID-19 pandemic , 2020 .

[10]  Monica Khanna,et al.  Factors Influencing Online Buying Behavior of College Students: A Qualitative Analysis , 2016 .

[11]  Mariarosaria Taddeo,et al.  Artificial Intelligence and the ‘Good Society’: the US, EU, and UK approach , 2016, Sci. Eng. Ethics.

[12]  Basar Oztaysi,et al.  A decision model for information technology selection using AHP integrated TOPSIS-Grey , 2014 .

[13]  Ankur Taly,et al.  Explainable machine learning in deployment , 2020, FAT*.

[14]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[15]  Mohamed Medhat Gaber,et al.  Random forests: from early developments to recent advancements , 2014 .

[16]  Seung-won Hwang,et al.  Predicting Online Purchase Conversion for Retargeting , 2017, WSDM.

[17]  Dhruv Grewal,et al.  How artificial intelligence will change the future of marketing , 2019, Journal of the Academy of Marketing Science.

[18]  Laura Maria Badea Predicting Consumer Behavior with Artificial Neural Networks , 2014 .

[19]  Robin Singh Bhadoria,et al.  Predictive analytics in data science for business intelligence solutions , 2017, 2017 7th International Conference on Communication Systems and Network Technologies (CSNT).