Feature Selection and Combination for Stress Identification Using Correlation and Diversity

Using multiple physiological sensors to detect different stress level has become an important and popular task in improving human health and well-being. In the process, the selection of a smaller set of independent features is a necessary, yet challenging, step for feature combination, situation analysis and decision making. In this paper, we investigate feature selection methods using both concepts of correlation and diversity. Six feature combination methods (C4.5, Naïve Bayes, Linear Discriminant Function, Support Vector Machine, K Nearest Neighbors and Combinatorial Fusion) are applied to the selected features in the detection of the stress levels. Our results demonstrated that (a) diversity based feature selection is as good as correlation based selection across all six combination methods, and (b) combinatorial fusion method performs better than five other combination methods across all features selected by using both correlation and diversity.

[1]  D. Frank Hsu,et al.  Comparing Rank and Score Combination Methods for Data Fusion in Information Retrieval , 2005, Information Retrieval.

[2]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[3]  D. Frank Hsu,et al.  Combinatorial Fusion Analysis: Methods and Practices of Combining Multiple Scoring Systems , 2006 .

[4]  Yong Deng,et al.  Evaluating feature selection for stress identification , 2012, 2012 IEEE 13th International Conference on Information Reuse & Integration (IRI).

[5]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[6]  Frederic Angus Front-end Analog Pre-Processing for Real Time Psychophysiological Stress Measurements , 2007 .

[7]  Salvatore Ruggieri,et al.  Efficient C4.5 , 2002, IEEE Trans. Knowl. Data Eng..

[8]  Emil Jovanov,et al.  Stress monitoring using a distributed wireless intelligent sensor system. , 2003, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[9]  D. Frank Hsu,et al.  Rank-Score Characteristics (RSC) Function and Cognitive Diversity , 2010, Brain Informatics.

[10]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[11]  Lei Zhang,et al.  Hierarchical multiple sensor fusion using structurally learned Bayesian network , 2010, Wireless Health.

[12]  Jennifer Healey,et al.  Detecting stress during real-world driving tasks using physiological sensors , 2005, IEEE Transactions on Intelligent Transportation Systems.

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  Mykola Pechenizkiy,et al.  What's Your Current Stress Level? Detection of Stress Patterns from GSR Sensor Data , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[15]  Martin L. Griss,et al.  Activity-Aware Mental Stress Detection Using Physiological Sensors , 2010, MobiCASE.

[16]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[17]  Jennifer A. Healey,et al.  Wearable and automotive systems for affect recognition from physiology , 2000 .