Customer value segmentation based on cost-sensitive learning Support Vector Machine

Two types of errors, 'rejecting true and accepting false' are inevitable in customer value segmentation. The traditional data mining method which is on the total accuracy rate cannot reflect the influence caused by the great difference of misclassification costs and unbalanced quantity distribution of customers who have various values. The thesis proposes two enhancements. The first is developing the cost-sensitive Support Vector Machine (SVM) classifier by presenting misclassification cost function based on customer value, which is evaluated by the function of the exceptional lost. The second enhancement is a method of multi-category classification based on the binary classification version of SVM. The data test result proves that the method can control the different types of errors distribution with various cost of misclassification accurately, reduce the total misclassification cost and distinguish the customer value effectively.

[1]  David R. Musicant,et al.  Data Discrimination via Nonlinear Generalized Support Vector Machines , 2001 .

[2]  Zou Peng Study on method of evaluating customer profitability based on data mining , 2004 .

[3]  Linda Kaufman,et al.  Solving the quadratic programming problem arising in support vector classification , 1999 .

[4]  Duen-Ren Liu,et al.  Integrating AHP and data mining for product recommendation based on customer lifetime value , 2005, Inf. Manag..

[5]  G. Lieberman,et al.  Introduction to Mathematical Programming , 1990 .

[6]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[7]  Marek Kretowski,et al.  An Evolutionary Algorithm for Cost-Sensitive Decision Rule Learning , 2001, ECML.

[8]  Darrell K. Rigby,et al.  Avoid the four perils of CRM. , 2002, Harvard business review.

[9]  James A. Bucklew,et al.  Support vector machines and the multiple hypothesis test problem , 2001, IEEE Trans. Signal Process..

[10]  Qiang Yang,et al.  Decision trees with minimal costs , 2004, ICML.

[11]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[12]  Clive W. J. Granger,et al.  Prediction with a generalized cost of error function , 1969 .

[13]  Bart Baesens,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Bayesian Network Classifiers for Identifying the Slope of the Customer Lifecycle of Long-life Customers Bayesian Network Classifiers for Identifying the Slope of the Customer Lifecycle of Long-life Customers , 2022 .

[14]  Dirk Van den Poel,et al.  Joint optimization of customer segmentation and marketing policy to maximize long-term profitability , 2002, Expert Syst. Appl..

[15]  Nan-Chen Hsieh,et al.  An integrated data mining and behavioral scoring model for analyzing bank customers , 2004, Expert Syst. Appl..

[16]  Suzan Burton,et al.  Determinants of customer loyalty and share of wallet in retail banking , 2005 .

[17]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .