Feature Selection and Interpretable Feature Transformation: A Preliminary Study on Feature Engineering for Classification Algorithms

This paper explores the limitation of consistency-based measures in the context of feature selection. These kinds of filters are not very widespread in large-dimensionality problems. Typically, the number of selected of attributes is very small and the ability to do right predictions is a drawback. The principal contribution of this work is the introduction of a new approach within feature engineering to create new attributes after the feature selection stage. The experimentation on multi-class problems with a feature space in the order of tens of thousands shed light on that some improvements took place with the new proposal. As a final insight, some new relationships were discovered due to the combined application of feature selection and feature transformation. Additionally, a new measure for classification problems which relates the number of features and the number of classes or labels is also proposed.

[1]  David Leinweder Expert System in Space , 1987, IEEE Expert.

[2]  H. Motoda,et al.  Feature Transformation And Subset Selection , 1998, IEEE Intelligent Systems and their Applications.

[3]  Rüdiger Wirth,et al.  CRISP-DM: Towards a Standard Process Model for Data Mining , 2000 .

[4]  Huan Liu,et al.  Consistency Based Feature Selection , 2000, PAKDD.

[5]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[6]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[7]  Ian H. Witten,et al.  Weka-A Machine Learning Workbench for Data Mining , 2005, Data Mining and Knowledge Discovery Handbook.

[8]  José Manuel Benítez,et al.  Consistency measures for feature selection , 2008, Journal of Intelligent Information Systems.

[9]  Kilho Shin,et al.  Consistency-Based Feature Selection , 2009, KES.

[10]  Antonio J. Tallón-Ballesteros,et al.  Merging subsets of attributes to improve a hybrid consistency-based filter: a case of study in product unit neural networks , 2016, Connect. Sci..

[11]  Antonio J. Tallón-Ballesteros,et al.  Simplifying pattern recognition problems via a scatter search algorithm , 2016 .

[12]  Sung-Bae Cho,et al.  Visual Tools to Lecture Data Analytics and Engineering , 2017, IWINAC.

[13]  Tetsuji Kuboyama,et al.  sCwc/sLcc: Highly Scalable Feature Selection Algorithms , 2017, Inf..

[14]  Sung-Bae Cho,et al.  Stochastic and Non-Stochastic Feature Selection , 2017, IDEAL.

[15]  Antonio J. Tallón-Ballesteros,et al.  Low Dimensionality or Same Subsets as a Result of Feature Selection: An In-Depth Roadmap , 2017, IWINAC.

[16]  Kaicheng Li,et al.  Fuzzy systems and data mining IV , 2018 .

[17]  Huan Liu,et al.  Feature Engineering for Machine Learning and Data Analytics , 2018 .

[18]  Antonio J. Tallón-Ballesteros,et al.  Featuring the Attributes in Supervised Machine Learning , 2018, HAIS.