Customer Value Prediction in Direct Marketing Using Hybrid Support Vector Machine Rule Extraction Method

Data mining techniques can aid companies in evaluation of customers that generate highest amount of revenue in a direct marketing campaign. Most commonly, customer value is evaluated by a uniform segmentation of customers (20% for each segment) based on buying behavior using recency, frequency and monetary (RFM) attributes, whereby for direct campaigns the segments with the highest score of these attributes are subjectively selected. In this paper, the method of k-means clustering, according to RFM attributes is proposed, based on which the customer value can be more objectively determined. The most valuable customers, as a rule, are the smallest group compared to other clusters, so the problem of class imbalance occurs. In order to overcome this problem, a hybrid Support Vector Machine Rule Extraction (SVM-RE) method is proposed for predicting which customer belongs to a cluster, based on data on consumer characteristics and offered products. The SVM classifier is known as a good predictor in case of class imbalance, but does not generate an interpretable model. Therefore, the Decision Tree (DT) method generates rules, based on the prediction result of the SVM classifier. The results of the empirical case study showed, that using this hybrid method with good classification performance, customer value level can be predicted, i.e. targeting existing and new buyers for direct marketing campaigns can be efficiently done, regardless of the class imbalance problem. It’s also shown that using the hybrid SVM-RE method, it is possible to obtain significantly better prediction accuracy than using the DT method.

[1]  Zheng Lin,et al.  Research on customer segmentation model by clustering , 2005, ICEC '05.

[2]  Miomir Jovanovic,et al.  Hybrid support vector machine rule extraction method for discovering the preferences of stock market investors: Evidence from Montenegro , 2015, Intell. Autom. Soft Comput..

[3]  Hyoungjoo Lee,et al.  Response modeling with support vector regression , 2008, Expert Syst. Appl..

[4]  M. A. H. Farquad,et al.  Preprocessing unbalanced data using support vector machine , 2012, Decis. Support Syst..

[5]  Arthur Middleton Hughes,et al.  Strategic database marketing , 2005 .

[6]  Joachim Diederich,et al.  Rule Extraction from Support Vector Machines: An Introduction , 2008, Rule Extraction from Support Vector Machines.

[7]  Bart Baesens,et al.  Rule Extraction from Support Vector Machines: An Overview of Issues and Application in Credit Scoring , 2008, Rule Extraction from Support Vector Machines.

[8]  You-Shyang Chen,et al.  Classifying the segmentation of customer value via RFM model and RS theory , 2009, Expert Syst. Appl..

[9]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[10]  John A. McCarty,et al.  SEGMENTATION APPROACHES IN DATA MINING: A COMPARISON OF RFM, CHAID, AND LOGISTIC REGRESSION , 2007 .

[11]  Andrew P. Bradley,et al.  Rule extraction from support vector machines: A review , 2010, Neurocomputing.

[12]  Dirk Van den Poel,et al.  Joint optimization of customer segmentation and marketing policy to maximize long-term profitability , 2002, Expert Syst. Appl..

[13]  Nan-Chen Hsieh,et al.  An integrated data mining and behavioral scoring model for analyzing bank customers , 2004, Expert Syst. Appl..

[14]  U. Kaymak Fuzzy target selection using RFM variables , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[15]  Hidayet Takçi,et al.  Performance evaluation of different customer segmentation approaches based on RFM and demographics analysis , 2016, Kybernetes.

[16]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[17]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[18]  Ana S. Camanho,et al.  Predicting direct marketing response in banking: comparison of class imbalance methods , 2017 .

[19]  A. Hughes Strategic Database Marketing: The Masterplan for Starting and Managing a Profitable, Customer-Based Marketing Program , 1994 .

[20]  Chih-Hsuan Wang,et al.  Apply robust segmentation to the service industry using kernel induced fuzzy clustering techniques , 2010, Expert Syst. Appl..

[21]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  David L. Olson,et al.  Direct marketing decision support through predictive customer response modeling , 2012, Decis. Support Syst..

[23]  C.-Y. Tsai,et al.  A purchase-based market segmentation methodology , 2004, Expert Syst. Appl..

[24]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[25]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[26]  Man Leung Wong,et al.  Targeting High Value Customers While under Resource Constraint: Partial Order Constrained Optimization with Genetic Algorithm , 2015 .

[27]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[28]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[29]  David L. Olson,et al.  Comparison of customer response models , 2009 .

[30]  Edward C. Malthouse,et al.  Ridge regression and direct marketing scoring models , 1999 .

[31]  David L. Olson,et al.  A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models , 2012, Service Business.

[32]  Seyed Mohammad Seyedhosseini,et al.  Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty , 2010, Expert Syst. Appl..

[33]  Farshid Abdi,et al.  Hybrid soft computing approach based on clustering, rule mining, and decision tree analysis for customer segmentation problem: Real case of customer-centric industries , 2018, Appl. Soft Comput..