Rank Aggregation for Filter Feature Selection in Credit Scoring

The credit industry is a fast growing field, credit institutions collect data about credit customer and use them to build credit model. The collected information may be full of unwanted and redundant features which may speed down the learning process, so, effective feature selection methods are needed for credit dataset. In general, Filter feature selection methods outperform other feature selection techniques because they are effective and computationally fast. Choosing the appropriate filtering method from the wide variety of classical filtering methods proposed in the literature is a crucial issue in machine learning. So, we propose a feature selection fusion model that fuses the results obtained by different filter feature selection methods via aggregation techniques. Evaluations on four credit datasets show that the fusion model achieves good results.

[1]  Jun Gao,et al.  Rank Aggregation Based Text Feature Selection , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[2]  Charles Elkan,et al.  Quadratic Programming Feature Selection , 2010, J. Mach. Learn. Res..

[3]  José Manuel Benítez,et al.  Consistency measures for feature selection , 2008, Journal of Intelligent Information Systems.

[4]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[5]  Michael G. Madden,et al.  The Effect of Principal Component Analysis on Machine Learning Accuracy with High Dimensional Spectral Data , 2005, SGAI Conf..

[6]  James C. Bezdek,et al.  Decision templates for multiple classifier fusion: an experimental comparison , 2001, Pattern Recognit..

[7]  Yin-Fu Huang,et al.  Evolutionary-based feature selection approaches with new criteria for data mining: A case study of credit approval data , 2009, Expert Syst. Appl..

[8]  Bouaguel Waad,et al.  An improvement direction for filter selection techniques using information theory measures and quadratic optimization , 2012, ArXiv.

[9]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[10]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[11]  Taghi M. Khoshgoftaar,et al.  Mean Aggregation versus Robust Rank Aggregation for Ensemble Gene Selection , 2012, 2012 11th International Conference on Machine Learning and Applications.

[12]  Moncef Gabbouj,et al.  Feature selection for content-based image retrieval , 2008, Signal Image Video Process..