Feature selection in haptic-based handwritten signatures using rough sets

This paper explores the use of rough set theory for feature selection in high dimensional haptic-based handwritten signatures (exploited for user identification). Two rough set-based methods for feature selection are analyzed, the first is a greedy approach while the second relies on genetic algorithms to find minimal subsets of attributes. Also, to further reduce the haptic feature space while maximizing user identification accuracy, a method is proposed where feature vectors are subsampled prior to the feature selection procedure. Rough setgenerated minimal subsets are initially exploited to determine the importance of different haptic data types (e.g. force, position, torque and orientation) in discriminating between different users. In addition, a comparison between rough set-based methods and classical machine learning techniques in the selection of minimal information-preserving subsets of features in high dimensional haptic datasets, is provided. The criteria for comparison are the length of the selected subsets of features and their corresponding discrimination power. Support Vector Machine classifiers are used to evaluate the accuracy of the selected minimal feature vectors. The results demonstrated that the combination of rough set and genetic algorithm techniques can outperform well-established machine learning methods in the selection of minimal subsets of features present in haptic-based handwritten signatures.

[1]  Jakub Wroblewski,et al.  Ensembles of Classifiers Based on Approximate Reducts , 2001, Fundam. Informaticae.

[2]  Abdulmotaleb El-Saddik,et al.  Relevant Feature Selection and Generation in High Dimensional Haptic-based Biometric Data , 2009, DMIN.

[3]  Aleksander Øhrn,et al.  Discernibility and Rough Sets in Medicine: Tools and Applications , 2000 .

[4]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[5]  Abdulmotaleb El-Saddik,et al.  A Novel Biometric System for Identification and Verification of Haptic Users , 2007, IEEE Transactions on Instrumentation and Measurement.

[6]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Staal A. Vinterbo,et al.  Minimal approximate hitting sets and rule templates , 2000, Int. J. Approx. Reason..

[9]  Piotr Jedrzejowicz,et al.  Data Reduction Algorithm for Machine Learning and Data Mining , 2008, IEA/AIE.

[10]  David S. Johnson,et al.  Approximation algorithms for combinatorial problems , 1973, STOC.

[11]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[12]  Andrzej Skowron,et al.  Dynamic Reducts as a Tool for Extracting Laws from Decisions Tables , 1994, ISMIS.

[13]  Abdulmotaleb El-Saddik,et al.  Experiments in haptic-based authentication of humans , 2008, Multimedia Tools and Applications.

[14]  Abdulmotaleb El-Saddik,et al.  Feature selection and classification in genetic programming: Application to haptic-based biometric data , 2009, 2009 IEEE Symposium on Computational Intelligence for Security and Defense Applications.

[15]  Eibe Frank,et al.  Large-scale attribute selection using wrappers , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .