A data modeling approach for classification problems: application to bank telemarketing prediction

In this paper, we present a new data modeling approach for five common classification algorithms to optimize the prediction of telemarketing target calls for selling bank long-term deposits. A Portuguese retail bank addressed, from 2008 until 2013, data on its clients, products and social-economic attributes including the effects of the financial crisis. An original set of 150 features has been explored and 21 features are retained for the proposed approach including label. This paper introduces a new modeling approach that preprocessed separately each type of features and normalize them to optimize prediction performance. To evaluate the proposed approach, this paper compares the results obtained with five most known machine learning techniques: Naïve Bayes (NB), Logistic Regression (LR), Decision Trees (DT), Artificial Neural Network (ANN) and Support Vector Machines (SVM) and it yielded better improved performances for all these algorithms in terms of accuracy and f-measure.

[1]  Z. Chu Bank Marketing with Machine Learning , 2015 .

[2]  Masao Ueki,et al.  SPARSE PREDICTIVE MODELING FOR BANK TELEMARKETING SUCCESS USING SMOOTH-THRESHOLD ESTIMATING EQUATIONS , 2015 .

[3]  Walid Cherif,et al.  Optimization of K-NN algorithm by clustering and reliability coefficients: application to breast-cancer diagnosis , 2018 .

[4]  Li Xiu,et al.  Application of data mining techniques in customer relationship management: A literature review and classification , 2009, Expert Syst. Appl..

[5]  Xiaoyuan Zhang,et al.  Support vector machine with parameter optimization by a novel hybrid method and its application to fault diagnosis , 2015, Neurocomputing.

[6]  B. Radhakrishnan,et al.  Application of Data Mining In Marketing , 2013, ArXiv.

[7]  Ana S. Camanho,et al.  Predicting direct marketing response in banking: comparison of class imbalance methods , 2017 .

[8]  Anshul Goyal,et al.  Performance Comparison of Naïve Bayes and J 48 Classification Algorithms , 2012 .

[9]  Paulo Cortez,et al.  Using data mining for bank direct marketing: an application of the CRISP-DM methodology , 2011 .

[10]  M. Sudheep Elayidom,et al.  An Efficient CRM-Data Mining Framework for the Prediction of Customer Behaviour☆ , 2015 .

[11]  A. Elsalamony,et al.  Bank Direct Marketing Based on Neural Network and C5.0 Models , 2013 .

[12]  Paulo Cortez,et al.  A data-driven approach to predict the success of bank telemarketing , 2014, Decis. Support Syst..

[13]  Songul Kakilli Acaravci,et al.  Using Data Mining Techniques for Detecting the Important Features of the Bank Direct Marketing Data , 2017 .

[14]  E. Venkatesan,et al.  Performance Analysis of Decision Tree Algorithms for Breast Cancer Classification , 2015 .

[15]  Daniel Grzonka,et al.  Application of selected supervised classification methods to bank marketing campaign , 2016 .

[16]  Walid Cherif,et al.  Hybrid Reliability-Similarity-Based Approach for Supervised Machine Learning , 2018 .

[17]  Sven F. Crone,et al.  The impact of preprocessing on data mining: An evaluation of classifier sensitivity in direct marketing , 2006, Eur. J. Oper. Res..

[18]  Paulo Cortez,et al.  Using customer lifetime value and neural networks to improve the prediction of bank deposit subscription in telemarketing campaigns , 2014, Neural Computing and Applications.

[20]  Anirut Suebsing,et al.  Feature Selection with Data Balancing for Prediction of Bank Telemarketing , 2014 .

[21]  Anver Muhammed Application of Data Mining In Marketing Application of Data Mining In Marketing Application of Data Mining In Marketing Application of Data Mining In Marketing , 2013 .

[22]  Belete Biazen Bezabeh The application of data mining techniques to support customer relationship management: the case of ethiopian revenue and customs authority , 2017, ArXiv.

[23]  Hany A. Elsalamony,et al.  Bank Direct Marketing Analysis of Data Mining Techniques , 2014 .

[24]  D. Y. Sha,et al.  Linking innovative product development with customer knowledge: a data-mining approach , 2006 .