Towards scalable fuzzy-rough feature selection

Abstract Research in the area of fuzzy–rough set theory, and its application to feature or attribute selection in particular, has enjoyed much attention in recent years. Indeed, with the growth of larger and larger data dimensionality, the number of data objects required in order to generate accurate models increases exponentially. Thus, for model learning, feature selection has become increasingly necessary. The use of fuzzy–rough sets as dataset pre-processors offer much in the way of flexibility, however the underlying complexity of the subset evaluation metric often presents a problem and can result in a great deal of potentially unnecessary computational effort. This paper proposes two different novel ways to address this problem using a neighbourhood approximation step and attribute grouping in order to alleviate the processing overhead and reduce complexity. A series of experiments are conducted on benchmark datasets which demonstrate that much computational effort can be avoided, and as a result the efficiency of the feature selection process for fuzzy–rough sets can be improved considerably.

[1]  Qiang Shen,et al.  New Approaches to Fuzzy-Rough Feature Selection , 2009, IEEE Transactions on Fuzzy Systems.

[2]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[3]  Dominik Slezak,et al.  Utilization of attribute clustering methods for scalable computation of reducts from high-dimensional data , 2012, 2012 Federated Conference on Computer Science and Information Systems (FedCSIS).

[4]  Chris Cornelis,et al.  Feature grouping-based fuzzy-rough feature selection , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[5]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[6]  Chris Cornelis,et al.  Computing fuzzy rough approximations in large scale information systems , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[7]  Ivor W. Tsang,et al.  Towards ultrahigh dimensional feature selection for big data , 2012, J. Mach. Learn. Res..

[8]  John Langford,et al.  Scaling up machine learning: parallel and distributed approaches , 2011, KDD '11 Tutorials.

[9]  Robert Marti,et al.  A Novel Breast Tissue Density Classification Methodology , 2008, IEEE Transactions on Information Technology in Biomedicine.

[10]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[11]  Yichao Wu,et al.  Ultrahigh Dimensional Feature Selection: Beyond The Linear Model , 2009, J. Mach. Learn. Res..

[12]  Chris Cornelis,et al.  Attribute selection with fuzzy decision reducts , 2010, Inf. Sci..

[13]  David Ellsworth,et al.  Managing Big Data for Scientific Visualization , 2015 .

[14]  Richard Jensen,et al.  Nearest Neighbour-Based Fuzzy-Rough Feature Selection , 2014, RSCTC.

[15]  Jeremy Kubica,et al.  Parallel Large Scale Feature Selection for Logistic Regression , 2009, SDM.

[16]  Didier Dubois,et al.  Putting Rough Sets and Fuzzy Sets Together , 1992, Intelligent Decision Support.

[17]  Wei Wu,et al.  Fuzzy similarity-based nearest-neighbour classification as alternatives to their fuzzy-rough parallels , 2013, Int. J. Approx. Reason..

[18]  Qiang Shen,et al.  Finding rough and fuzzy-rough set reducts with SAT , 2014, Inf. Sci..

[19]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[20]  Da Ruan,et al.  A parallel method for computing rough set approximations , 2012, Inf. Sci..

[21]  Anna Maria Radzikowska,et al.  A comparative study of fuzzy rough sets , 2002, Fuzzy Sets Syst..

[22]  Chris Cornelis,et al.  Ordered Weighted Average Based Fuzzy Rough Sets , 2010, RSKT.

[23]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[24]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[25]  Qinghua Hu,et al.  On Robust Fuzzy Rough Set Models , 2012, IEEE Transactions on Fuzzy Systems.