Incorporating support vector machines with multiple criteria decision making for financial crisis analysis

Feature selection is an essential pre-processing technique in data mining that eliminates redundant or unrepresentative attributes and improves the performance of classifiers. However, a classifier with different feature selection approaches results in diverse outcomes. Thus, determining how to integrate feature selection methods and yield an appropriate feature set is an issue worth further study. Based on ensemble learning, this investigation develops a SVMMCDM (support vector machines with multiple criteria decision making) model that employs various feature selection techniques as data preprocessing schemes and then uses SVM for financial crisis prediction. The study uses MCDM to determine the most suitable feature selection mechanism when many performance criteria are considered. After the feature selection mechanism has been determined, the study decomposes the SVM to obtain support vectors and predicted labels which are then fed into a decision tree to generate rules. The numerical results for the ex-ante and ex-post periods relative to the financial tsunami show that the proposed SVMMCDM model is an effective way to predict a financial crisis and can provide useful rules for decision makers.

[1]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.

[2]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[3]  Samia Boukir,et al.  Relevance of airborne lidar and multispectral image data for urban scene classification using Random Forests , 2011 .

[4]  Uzay Kaymak,et al.  Fuzzy criteria for feature selection , 2012, Fuzzy Sets Syst..

[5]  Gaurav Kapoor,et al.  Detecting evolutionary financial statement fraud , 2011, Decis. Support Syst..

[6]  Gang Kou,et al.  An empirical study of classification algorithm evaluation for financial risk prediction , 2011, Appl. Soft Comput..

[7]  Boubakeur Boufama,et al.  A novel SVM+NDA model for classification with an application to face recognition , 2012, Pattern Recognit..

[8]  Ping-Feng Pai,et al.  Support Vector Machines with Simulated Annealing Algorithms in Electricity Load Forecasting , 2005 .

[9]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[10]  Thomas Marill,et al.  On the effectiveness of receptors in recognition systems , 1963, IEEE Trans. Inf. Theory.

[11]  Andrew P. Bradley,et al.  Rule extraction from support vector machines: A review , 2010, Neurocomputing.

[12]  Guray Kucukkocaoglu,et al.  IPO mechanism selection by using Classification and Regression Trees , 2012 .

[13]  Xiang Yu,et al.  Financial distress prediction based on SVM and MDA methods: the case of Chinese listed companies , 2011 .

[14]  Jean-Michel Poggi,et al.  Variable selection using random forests , 2010, Pattern Recognit. Lett..

[15]  A study on the modified components of Asian Currency Unit: an application of the Artificial Neural Network , 2011 .

[16]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[17]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[18]  Indranil Bose,et al.  Deciding the financial health of dot-coms using rough sets , 2006, Inf. Manag..

[19]  Chih-Fong Tsai,et al.  Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches , 2010, Decis. Support Syst..

[20]  I-Shuo Chen,et al.  Present and future: a trend forecasting and ranking of university types for innovative development from an intellectual capital perspective , 2011, Quality & Quantity.

[21]  Gwo-Hshiung Tzeng,et al.  Compromise solution by MCDM methods: A comparative analysis of VIKOR and TOPSIS , 2004, Eur. J. Oper. Res..

[22]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[23]  Gwo-Hshiung Tzeng,et al.  Multicriteria Planning of Post‐Earthquake Sustainable Reconstruction , 2002 .

[24]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  David Casasent,et al.  An improvement on floating search algorithms for feature subset selection , 2009, Pattern Recognit..

[26]  Howard Mark Schilit Financial Shenanigans: How to Detect Accounting Gimmicks and Fraud in Financial Reports , 1993 .