Application of Dimensionality Reduction Methods for Eye Movement Data Classification

In this paper we apply two data dimensionality reduction methods to eye movement dataset and analyse how the feature reduction method improves classification accuracy. Due to the specificity of the recording process, eye movement datasets are characterized by both big size and high-dimensionality that make them difficult to analyse and classify using standard classification approaches. Here, we analyse eye movement data from BioEye 2015 competition and to deal with the problem of high dimensionality we apply SVM combined with PCA feature extraction and random forests wrapper variable selection. Our results show that the reduction of the number of variables improves classification results. We also show that some of classes (participants) can be classified (recognised) with high accuracy while others are very difficult to be correctly identified.

[1]  Usman Saeed A Survey of Automatic Person Recognition using eye movements , 2014, Int. J. Pattern Recognit. Artif. Intell..

[2]  Bertrand Michel,et al.  Correlation and variable importance in random forests , 2013, Statistics and Computing.

[3]  Ting Wang,et al.  Application of Breiman's Random Forest to Modeling Structure-Activity Relationships of Pharmaceutical Molecules , 2004, Multiple Classifier Systems.

[4]  Marek Sikora Redefinition of Decision Rules Based on the Importance of Elementary Conditions Evaluation , 2013, Fundam. Informaticae.

[5]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[6]  Pawel Kasprowski,et al.  Eye Movements in Biometrics , 2004, ECCV Workshop BioAW.

[7]  Alexander J. Smola,et al.  Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[8]  Katarzyna Harezlak,et al.  The Second Eye Movements Verification and Identification Competition , 2014, IEEE International Joint Conference on Biometrics.

[9]  Marek Sikora,et al.  Quality improvement of rule-based gene group descriptions using information about GO terms importance occurring in premises of determined rules , 2010, Int. J. Appl. Math. Comput. Sci..

[10]  Pawel Kasprowski,et al.  First eye movement verification and identification competition at BTAS 2012 , 2012, 2012 IEEE Fifth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[11]  Jos Boekhorst,et al.  Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle? , 2012, Briefings Bioinform..

[12]  Witold R. Rudnicki,et al.  Boruta - A System for Feature Selection , 2010, Fundam. Informaticae.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Wolfgang Rosenstiel,et al.  Feature selection for high-dimensional industrial data , 2005, ESANN.

[15]  Robert P. W. Duin,et al.  Prototype selection for dissimilarity-based classifiers , 2006, Pattern Recognit..

[16]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[17]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[18]  Christoph F. Eick,et al.  A framework for spatial feature selection and scoping and its application to geo-targeting , 2011, Proceedings 2011 IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services.

[19]  Nassir Navab,et al.  Efficient Learning of Linear Predictors Using Dimensionality Reduction , 2012, ACCV.

[20]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[21]  Tomi Kinnunen,et al.  Eye-Movements as a Biometric , 2005, SCIA.

[22]  Christopher J. C. Burges,et al.  Dimension Reduction: A Guided Tour , 2010, Found. Trends Mach. Learn..

[23]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.