A Retail Outlet Classification Model Based on AdaBoost

This paper proposes a framework to get a stable classification rule under unsupervised learning, and the term “stable” means that the rule remains unchanged when the sample set increases. This framework initially makes use of clustering analysis and then use the result of clustering analysis as a reference-studying sample. Secondly, AdaBoost integrated several classification methods is used to classify the samples and get a stable classification rule. To prove the method feasible, this paper shows an empirical study of classifying retail outlets of a tobacco market in a city of China. In this practice, k-means is used to make clustering analysis, and AdaBoost integrated RBF neural network, CART, and SVM is used in classification. In the empirical study, this method successfully divides retail outlets into different classes based on the sales ability.

[1]  Lijun Wei,et al.  Building Behavior Scoring Model Using Genetic Algorithm and Support Vector Machines , 2007, International Conference on Computational Science.

[2]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[3]  Yi Jiang,et al.  A Comparison Study of Credit Scoring Models , 2007, Third International Conference on Natural Computation (ICNC 2007).

[4]  Jih-Jeng Huang,et al.  Marketing segmentation using support vector clustering , 2007, Expert Syst. Appl..

[5]  D. Broomhead,et al.  Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[6]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[7]  Stephen C. H. Leung,et al.  Vertical bagging decision trees model for credit scoring , 2010, Expert Syst. Appl..

[8]  J. Aczel,et al.  On Measures of Information and Their Characterizations , 2012 .

[9]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[10]  Harald Hruschka,et al.  Comparing performance of feedforward neural nets and K-means for cluster-based market segmentation , 1999, Eur. J. Oper. Res..

[11]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[12]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[13]  Defu Zhang,et al.  A Decision Tree Scoring Model Based on Genetic Algorithm and K-Means Algorithm , 2008, 2008 Third International Conference on Convergence and Hybrid Information Technology.