Hybrid soft computing approach based on clustering, rule mining, and decision tree analysis for customer segmentation problem: Real case of customer-centric industries

Abstract This paper proposes a hybrid soft computing approach on the basis of clustering, rule extraction, and decision tree methodology to predict the segment of the new customers in customer-centric companies. In the first module, K-means algorithm is applied to cluster the past customers of company on the basis of their purchase behavior. In the second module, a hybrid feature selection method based on filtering and a multi-attribute decision making method is proposed. Finally, On the basis of customers’ characteristics and using decision tree analysis, IF–THEN rules are mined. The proposed approach is applied in two case studies in the field of insurance and telecommunication in order to predict potentially profitable leads and outline the most influential features available to customers in order to perform this prediction. The results validate the efficacy and applicability of proposed approach to handle real-life cases.

[1]  Xi Chen,et al.  Detecting the migration of mobile service customers using fuzzy clustering , 2015, Inf. Manag..

[2]  You-Shyang Chen,et al.  Classifying the segmentation of customer value via RFM model and RS theory , 2009, Expert Syst. Appl..

[3]  Abdulkadir Hiziroglu,et al.  Soft computing applications in customer segmentation: State-of-art review and critique , 2013, Expert Syst. Appl..

[4]  Stjepan Oreski,et al.  Genetic algorithm-based heuristic for feature selection in credit risk assessment , 2014, Expert Syst. Appl..

[5]  Yongtae Park,et al.  Design of convergent product concepts based on functionality: An association rule mining and decision tree approach , 2012, Expert Syst. Appl..

[6]  Mengjie Zhang,et al.  Differential evolution for filter feature selection based on information theory and feature ranking , 2018, Knowl. Based Syst..

[7]  Chee Peng Lim,et al.  A hybrid FAM–CART model and its application to medical data classification , 2015, Neural Computing and Applications.

[8]  Qiang Zeng,et al.  Rule extraction from an optimized neural network for traffic crash frequency modeling. , 2016, Accident; analysis and prevention.

[9]  Jian Jin,et al.  A biclustering-based method for market segmentation using customer pain points , 2016, Eng. Appl. Artif. Intell..

[10]  Andrew P. Bradley,et al.  Rule extraction from support vector machines: A review , 2010, Neurocomputing.

[11]  Fei Wang,et al.  Efficient tree classifiers for large scale datasets , 2018, Neurocomputing.

[12]  Yen-Liang Chen,et al.  Using decision trees to summarize associative classification rules , 2009, Expert Syst. Appl..

[13]  James J. H. Liou,et al.  A novel decision rules approach for customer relationship management of the airline market , 2009, Expert Syst. Appl..

[14]  Dianhui Wang,et al.  Building feedforward neural networks with random weights for large scale datasets , 2018, Expert Syst. Appl..

[15]  Nima Jafari Navimipour,et al.  Customer relationship management mechanisms: A systematic review of the state of the art literature and recommendations for future research , 2016, Comput. Hum. Behav..

[16]  You-Shyang Chen,et al.  Identifying patients in target customer segments using a two-stage clustering-classification approach: A hospital-based assessment , 2012, Comput. Biol. Medicine.

[17]  Ferani E. Zulvia,et al.  An application of a metaheuristic algorithm-based clustering ensemble method to APP customer segmentation , 2016, Neurocomputing.

[18]  Chih-Fong Tsai,et al.  Combining instance selection for better missing value imputation , 2016, J. Syst. Softw..

[19]  Stjepan Oreski,et al.  Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment , 2012, Expert Syst. Appl..

[20]  Mei Bai,et al.  An efficient algorithm for distributed density-based outlier detection on big data , 2016, Neurocomputing.

[21]  Taeho Hong,et al.  Segmenting customers in online stores based on factors that affect the customer's intention to purchase , 2012, Expert Syst. Appl..

[22]  Hossein Safari,et al.  A hybrid algorithm for feature subset selection in high-dimensional datasets using FICA and IWSSr algorithm , 2015, Appl. Soft Comput..

[23]  López Griselda,et al.  Using Decision Trees to Extract Decision Rules from Police Reports on Road Accidents , 2012 .

[24]  Griselda López,et al.  Extracting decision rules from police accident reports through decision trees. , 2013, Accident; analysis and prevention.

[25]  Malin Song,et al.  Customer profitability forecasting using Big Data analytics: A case study of the insurance industry , 2016, Comput. Ind. Eng..

[26]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Ricardo Tanscheit,et al.  Fuzzy rules extraction from support vector machines for multi-class classification , 2012, Neural Computing and Applications.

[28]  Mohammad Modarres,et al.  Developing an approach to evaluate stocks by forecasting effective features with data mining methods , 2015, Expert Syst. Appl..

[29]  Yoichi Hayashi,et al.  Neural network training and rule extraction with augmented discretized input , 2016, Neurocomputing.

[30]  Mohamed Limam,et al.  A hybrid feature selection method based on instance learning and cooperative subset search , 2016, Pattern Recognit. Lett..

[31]  Der-Chiang Li,et al.  A two-stage clustering method to analyze customer characteristics to build discriminative customer management: A case of textile manufacturing business , 2011, Expert Syst. Appl..

[32]  Chun-Che Huang,et al.  Autonomous rule induction from data with tolerances in customer relationship management , 2011, Expert Syst. Appl..

[33]  Daoliang Li,et al.  A two-stage feature selection method with its application , 2015, Comput. Electr. Eng..

[34]  Andrew W. H. Ip,et al.  Customer grouping for better resources allocation using GA based clustering technique , 2012, Expert Syst. Appl..

[35]  Li Xiu,et al.  Application of data mining techniques in customer relationship management: A literature review and classification , 2009, Expert Syst. Appl..

[36]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[37]  Carlos Soares,et al.  Customer segmentation in a large database of an online customized fashion business , 2015 .

[38]  R. Thakur,et al.  Customer portfolio management (CPM) for improved customer relationship management (CRM): Are your customers platinum, gold, silver, or bronze? , 2016 .

[39]  Chui-Yu Chiu,et al.  An intelligent market segmentation system using k-means and particle swarm optimization , 2009, Expert Syst. Appl..

[40]  Hsin-Hung Wu,et al.  A case study of applying LRFM model in market segmentation of a children's dental clinic , 2012, Expert Syst. Appl..

[41]  Francisco Herrera,et al.  Enhancing the effectiveness and interpretability of decision tree and rule induction classifiers with evolutionary training set selection over imbalanced problems , 2009, Appl. Soft Comput..

[42]  Lale Özbakir,et al.  Fuzzy DIFACONN-miner: A novel approach for fuzzy rule extraction from neural networks , 2013, Expert Syst. Appl..

[43]  Hsuan-Kai Chen,et al.  Customer relationship management in the hairdressing industry: An application of data mining techniques , 2013, Expert Syst. Appl..

[44]  Veronica Oliveira de Carvalho,et al.  Combining K-Means and K-Harmonic with Fish School Search Algorithm for data clustering task on graphics processing units , 2016, Appl. Soft Comput..

[45]  Juan Ramón Rico-Juan,et al.  Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation , 2018, Pattern Recognit..

[46]  Bruno Agard,et al.  Forecasting Supply Chain Demand by Clustering Customers , 2015 .

[47]  Juan de Oña,et al.  Analysis of traffic accident severity using Decision Rules via Decision Trees , 2013, Expert Syst. Appl..

[48]  D. Wood Supplier selection for development of petroleum industry facilities, applying multi-criteria decision making techniques including fuzzy and intuitionistic fuzzy TOPSIS with flexible entropy weighting , 2016 .

[49]  Mehdi Mohammadzadeh,et al.  A data mining approach for modeling churn behavior via RFM model in specialized clinics Case study: A public sector hospital in Tehran , 2017, Procedia Computer Science.

[50]  Michalis Vazirgiannis,et al.  Cluster validity methods: part I , 2002, SGMD.

[51]  Katsutoshi Yada,et al.  Clustering of Customer Shopping Paths in Japanese Grocery Stores , 2016, KES.

[52]  Parham Moradi,et al.  OSFSMI: Online stream feature selection method based on mutual information , 2017, Appl. Soft Comput..

[53]  Meltem Caber,et al.  Using data mining techniques for profiling profitable hotel customers: An application of RFM analysis , 2016 .

[54]  Alfredo Núñez,et al.  Load estimation for microgrid planning based on a self-organizing map methodology , 2017, Appl. Soft Comput..

[55]  Qinghua Hu,et al.  Rule extraction from support vector machines based on consistent region covering reduction , 2013, Knowl. Based Syst..

[56]  Indry Dessy Nurpratami,et al.  Classification Rules for Hotspot Occurrences Using Spatial Entropy-based Decision Tree Algorithm , 2015 .

[57]  Bart Baesens,et al.  Identifying next relevant variables for segmentation by using feature selection approaches , 2015, Expert Syst. Appl..

[58]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[59]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[60]  Chun-Che Huang,et al.  Rough set based rule induction in decision making using credible classification and preference from medical application perspective , 2016, Comput. Methods Programs Biomed..

[61]  Uday V. Kulkarni,et al.  Extracting classification rules from modified fuzzy min-max neural network for data with mixed attributes , 2016, Appl. Soft Comput..

[62]  Arturo Molina,et al.  A segmentation study of cinema consumers based on values and lifestyle , 2018 .

[63]  Saïd Rechak,et al.  On the extraction of rules in the identification of bearing defects in rotating machinery using decision tree , 2010, Expert Syst. Appl..

[64]  M. Esmel ElAlami,et al.  Extracting rules from trained neural network using GA for managing E-business , 2004, Appl. Soft Comput..

[65]  Lan Bai,et al.  Stochastic Gradient Twin Support Vector Machine for Large Scale Problems , 2017, Inf. Sci..

[66]  Guo Chen,et al.  A new approach for rule extraction of expert system based on SVM , 2014 .

[67]  Vadlamani Ravi,et al.  Churn prediction using comprehensible support vector machine: An analytical CRM application , 2014, Appl. Soft Comput..

[68]  Peng Wang,et al.  A novel hybrid MCDM model combining the SAW, TOPSIS and GRA methods based on experimental design , 2016, Inf. Sci..