Rule Extraction from Support Vector Machines and Its Applications

Support Vector Machines are the state-of-the-art tools in data mining. However, their strength are also their main weakness, as the generated nonlinear models are typically regarded as incomprehensible black-box models. Therefore, opening the black-boxor making SVMs explainable became more important and necessary in areas such as medical diagnosis and credit evaluation. Rule extraction from SVMs, which is in order to make SVMs more explainable has developed during recent years. However, existing rule extracted algorithms have limitations in real applications especially when the problems are large scale with high dimensions. In this paper, we combined two feature selection techniques with rule extraction from SVMs in order to deal with this case. And we also proposed a new criteria to evaluate the extracted rules in order to rich the evaluation standards. Numerical experiments show the efficiency of our method.

[1]  Bart Baesens,et al.  Rule Extraction from Support Vector Machines: An Overview of Issues and Application in Credit Scoring , 2008, Rule Extraction from Support Vector Machines.

[2]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[3]  Jude W. Shavlik,et al.  in Advances in Neural Information Processing , 1996 .

[4]  W. Marsden I and J , 2012 .

[5]  Nahla H. Barakat,et al.  Hybrid rule-extraction from support vector machines , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[6]  Zhengxin Chen,et al.  Classifying Credit Card Accounts for Business Intelligence and Decision Making: a Multiple-criteria Quadratic Programming Approach , 2005, Int. J. Inf. Technol. Decis. Mak..

[7]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[8]  Gang Kou,et al.  Classification of HIV-I-Mediated neuronal dendritic and synaptic damage using multiple criteria linear programming , 2007, Neuroinformatics.

[9]  Jianping Li,et al.  Optimization Based Data Mining: Theory and Applications , 2011, Advanced Information and Knowledge Processing.

[10]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[11]  Zhengxin Chen,et al.  A Descriptive Framework for the Field of Data Mining and Knowledge Discovery , 2008, Int. J. Inf. Technol. Decis. Mak..

[12]  Bart Baesens,et al.  Minerva: Sequential Covering for Rule Extraction , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[13]  Zhengxin Chen,et al.  Multiple criteria mathematical programming for multi-class classification and application in network intrusion detection , 2009, Inf. Sci..

[14]  Yi Peng,et al.  Multiple criteria linear programming approach to data mining: Models, algorithm designs and software development , 2003, Optim. Methods Softw..

[15]  Yi Peng,et al.  Discovering Credit Cardholders’ Behavior by Multiple Criteria Linear Programming , 2005, Ann. Oper. Res..