Exploring the boundary region of tolerance rough sets for feature selection

Of all of the challenges which face the effective application of computational intelligence technologies for pattern recognition, dataset dimensionality is undoubtedly one of the primary impediments. In order for pattern classifiers to be efficient, a dimensionality reduction stage is usually performed prior to classification. Much use has been made of rough set theory for this purpose as it is completely data-driven and no other information is required; most other methods require some additional knowledge. However, traditional rough set-based methods in the literature are restricted to the requirement that all data must be discrete. It is therefore not possible to consider real-valued or noisy data. This is usually addressed by employing a discretisation method, which can result in information loss. This paper proposes a new approach based on the tolerance rough set model, which has the ability to deal with real-valued data whilst simultaneously retaining dataset semantics. More significantly, this paper describes the underlying mechanism for this new approach to utilise the information contained within the boundary region or region of uncertainty. The use of this information can result in the discovery of more compact feature subsets and improved classification accuracy. These results are supported by an experimental evaluation which compares the proposed approach with a number of existing feature selection techniques.

[1]  Bart Kosko,et al.  Fuzzy entropy and conditioning , 1986, Inf. Sci..

[2]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[3]  Qiang Shen,et al.  Tolerance-based and Fuzzy-Rough Feature Selection , 2007, 2007 IEEE International Fuzzy Systems Conference.

[4]  Selwyn Piramuthu,et al.  The Hausdorff distance measure for feature selection in learning applications , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[5]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[6]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[7]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..

[8]  Tsau Young Lin,et al.  Rough Set Methods and Applications , 2000 .

[9]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[10]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[11]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[12]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[13]  Andrzej Skowron,et al.  Tolerance Approximation Spaces , 1996, Fundam. Informaticae.

[14]  A. Kai Qin,et al.  Rapid and brief communication Uncorrelated heteroscedastic LDAbasedon theweightedpairwise Chernoff criterion , 2004 .

[15]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[16]  Qiang Shen,et al.  Centre for Intelligent Systems and Their Applications Fuzzy Rough Attribute Reduction with Application to Web Categorization Fuzzy Rough Attribute Reduction with Application to Web Categorization Fuzzy Sets and Systems ( ) – Fuzzy–rough Attribute Reduction with Application to Web Categorization , 2022 .

[17]  Andrzej Skowron,et al.  Searching for Relational Patterns in Data , 1997, PKDD.

[18]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[19]  Khairul A. Rasmani,et al.  Data-driven fuzzy rule generation and its application for student academic performance evaluation , 2006, Applied Intelligence.

[20]  Vijay V. Raghavan,et al.  Exploiting Upper Approximation in the Rough Set Methodology , 1995, KDD.

[21]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[22]  Wen-Xiu Zhang,et al.  Applying Indiscernibility Attribute Sets to Knowledge Reduction , 2005, Australian Conference on Artificial Intelligence.

[23]  Kim-Hui Yap,et al.  Fuzzy SVM for content-based image retrieval: a pseudo-label support vector machine framework , 2006, IEEE Computational Intelligence Magazine.

[24]  Didier Dubois,et al.  Putting Rough Sets and Fuzzy Sets Together , 1992, Intelligent Decision Support.

[26]  Qiang Shen,et al.  A rough-fuzzy approach for generating classification rules , 2002, Pattern Recognit..

[27]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[28]  Bl. Sendov Hausdorff distance and image processing , 2004 .

[29]  Andrzej Skowron,et al.  Rough set methods in feature selection and recognition , 2003, Pattern Recognit. Lett..

[30]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[31]  S. Lanteri,et al.  Chemometric analysis of Tuscan olive oils , 1989 .

[32]  Qiang Shen,et al.  Rough set-aided keyword reduction for text categorization , 2001, Appl. Artif. Intell..

[33]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[34]  Aboul Ella Hassanien,et al.  Rough set approach for attribute reduction and rule generation: A case of patients with suspected breast cancer , 2004, J. Assoc. Inf. Sci. Technol..

[35]  Kui Wu,et al.  A soft relevance framework in content-based image retrieval systems , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[36]  Robert I. Damper,et al.  A fast separability-based feature-selection method for high-dimensional remotely sensed image classification , 2008, Pattern Recognit..

[37]  Qiang Shen,et al.  Distance Measure Assisted Rough Set Feature Selection , 2007, 2007 IEEE International Fuzzy Systems Conference.

[38]  Ning Zhong,et al.  Using Rough Sets with Heuristics for Feature Selection , 1999, Journal of Intelligent Information Systems.

[39]  Dominik Ślęzak,et al.  Various approaches to reasoning with frequency based decision reducts: a survey , 2000 .

[40]  Adam C. Winstanley,et al.  Invariant optimal feature selection: A distance discriminant and feature ranking based solution , 2008, Pattern Recognit..

[41]  Daoqiang Zhang,et al.  Constraint Score: A new filter method for feature selection with pairwise constraints , 2008, Pattern Recognit..

[42]  Hiroshi Motoda,et al.  Book Review: Computational Methods of Feature Selection , 2007, The IEEE intelligent informatics bulletin.

[43]  Qiang Shen,et al.  Finding fuzzy-rough reducts with fuzzy entropy , 2008, 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence).

[44]  J. Franklin,et al.  The elements of statistical learning: data mining, inference and prediction , 2005 .

[45]  Masoud Nikravesh,et al.  Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing) , 2006 .

[46]  Kan Li,et al.  Rough set based attribute reduction approach in data mining , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[47]  Maciej Modrzejewski,et al.  Feature Selection Using Rough Sets Theory , 1993, ECML.

[48]  Huan Liu,et al.  Consistency Based Feature Selection , 2000, PAKDD.

[49]  William Rucklidge,et al.  Efficient Visual Recognition Using the Hausdorff Distance , 1996, Lecture Notes in Computer Science.