论文信息 - Facial Expression Recognition Utilizing Local Direction-Based Robust Features and Deep Belief Network

Facial Expression Recognition Utilizing Local Direction-Based Robust Features and Deep Belief Network

Emotional health plays very vital role to improve people’s quality of lives, especially for the elderly. Negative emotional states can lead to social or mental health problems. To cope with emotional health problems caused by negative emotions in daily life, we propose efficient facial expression recognition system to contribute in emotional healthcare system. Thus, facial expressions play a key role in our daily communications, and recent years have witnessed a great amount of research works for reliable facial expressions recognition (FER) systems. Therefore, facial expression evaluation or analysis from video information is very challenging and its accuracy depends on the extraction of robust features. In this paper, a unique feature extraction method is presented to extract distinguished features from the human face. For person independent expression recognition, depth video data is used as input to the system where in each frame, pixel intensities are distributed based on the distances to the camera. A novel robust feature extraction process is applied in this work which is named as local directional position pattern (LDPP). In LDPP, after extracting local directional strengths for each pixel such as applied in typical local directional pattern (LDP), top directional strength positions are considered in binary along with their strength sign bits. Considering top directional strength positions with strength signs in LDPP can differentiate edge pixels with bright as well as dark regions on their opposite sides by generating different patterns whereas typical LDP only considers directions representing the top strengths irrespective of their signs as well as position orders (i.e., directions with top strengths represent 1 and rest of them 0), which can generate the same patterns in this regard sometimes. Hence, LDP fails to distinguish edge pixels with opposite bright and dark regions in some cases which can be overcome by LDPP. Moreover, the LDPP capabilities are extended through principal component analysis (PCA) and generalized discriminant analysis (GDA) for better face characteristic illustration in expression. The proposed features are finally applied with deep belief network (DBN) for expression training and recognition.

[1] Matti Pietikäinen,et al. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2] Md. Zia Uddin,et al. An enhanced independent component-based human facial expression recognition from video , 2009, IEEE Transactions on Consumer Electronics.

[3] Min Chen,et al. iDoctor: Personalized and professionalized medical recommendations based on hybrid matrix factorization , 2017, Future Gener. Comput. Syst..

[4] Shaogang Gong,et al. Facial expression recognition based on Local Binary Patterns: A comprehensive study , 2009, Image Vis. Comput..

[5] Trevor Darrell,et al. Pose estimation using 3D view-based eigenspaces , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[6] J. Gregory Trafton,et al. Cognitive Architectures for social human-robot interaction , 2016, 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[7] Luc Van Gool,et al. Head Pose Estimation from Passive Stereo Images , 2009, SCIA.

[8] Tae-Seong Kim,et al. Human computer interface using the recognized finger parts of hand depth silhouette via random forests , 2013, 2013 13th International Conference on Control, Automation and Systems (ICCAS 2013).

[9] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[10] Ioannis Pitas,et al. ICA and Gabor representation for facial expression recognition , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[11] Andrew J. Calder,et al. PII: S0042-6989(01)00002-5 , 2001 .

[12] Stan Sclaroff,et al. Sign Language Spotting with a Threshold Model Based on Conditional Random Fields , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Erkki Oja,et al. Independent Component Analysis , 2001 .

[14] Ying Wang,et al. Human Activity Recognition Based on R Transform , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15] Andrew McCallum,et al. Maximum Entropy Markov Models for Information Extraction and Segmentation , 2000, ICML.

[16] Marian Stewart Bartlett,et al. Face recognition by independent component analysis , 2002, IEEE Trans. Neural Networks.

[17] Dong-Sun Kim,et al. Embedded face recognition based on fast genetic algorithm for intelligent digital photography , 2006, IEEE Transactions on Consumer Electronics.

[18] A. Young,et al. Configural information in facial expression perception. , 2000, Journal of experimental psychology. Human perception and performance.

[19] Stefan Müller,et al. Hand Gesture Recognition with a Novel IR Time-of-Flight Range Camera-A Pilot Study , 2007, MIRAGE.

[20] Xiaodong Yang,et al. EigenJoints-based action recognition using Naïve-Bayes-Nearest-Neighbor , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[21] Luc Van Gool,et al. Real-time face pose estimation from single range images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[22] Md. Zia Uddin. A Facial Expression Recognition System from Depth Video , 2014 .

[23] Tadeusz Szkodny. Application of vision information to planning trajectories of Adept Six-300 robot , 2016, MMAR.

[24] Limei Peng,et al. CADRE: Cloud-Assisted Drug REcommendation Service for Online Pharmacies , 2014, Mobile Networks and Applications.

[25] Antonis A. Argyros,et al. Tracking the articulated motion of two strongly interacting hands , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[26] Jiafu Wan,et al. A multimedia healthcare data sharing approach through cloud-based body area network , 2017, Future Gener. Comput. Syst..

[27] Seyed Kamaledin Setarehdan,et al. Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal , 2008, Artif. Intell. Medicine.

[28] Harry Wechsler,et al. The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[29] Ulrich Neumann,et al. Real-time Hand Pose Recognition Using Low-Resolution Depth Images , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[30] Zicheng Liu,et al. HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[31] Oksam Chae,et al. Local Directional Pattern (LDP) – A Robust Image Descriptor for Object Recognition , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[32] S. Mitra,et al. Gesture Recognition: A Survey , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[33] Md. Zia Uddin. A depth video-based facial expression recognition system utilizing generalized local directional deviation-based binary pattern feature discriminant analysis , 2015, Multimedia Tools and Applications.

[34] Marian Stewart Bartlett,et al. Classifying Facial Actions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[35] Wanqing Li,et al. Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[36] Luc Van Gool,et al. Fast 3D Scanning with Automatic Motion Compensation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[37] Andrea Cavallaro,et al. 3-D Face Detection, Landmark Localization, and Registration Using a Point Distribution Model , 2009, IEEE Transactions on Multimedia.

[38] Rainer Stiefelhagen,et al. Head pose estimation using stereo vision for human-robot interaction , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[39] Xia Liu,et al. Hand gesture recognition using depth data , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[40] Chengjun Liu,et al. Enhanced independent component analysis and its application to content based face image retrieval , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[41] H. Seidel,et al. Pattern-aware Deformation Using Sliding Dockers , 2011, SIGGRAPH 2011.

[42] Yasue Mitsukura,et al. Classification of hand postures based on 3D vision model for human-robot interaction , 2010, 19th International Symposium in Robot and Human Interactive Communication.

[43] Garrison W. Cottrell,et al. Representing Face Images for Emotion Classification , 1996, NIPS.

[44] Christian Igel,et al. Training restricted Boltzmann machines: An introduction , 2014, Pattern Recognit..

[45] Surendra Ranganath,et al. Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[46] Joachim Hornegger,et al. Gesture recognition with a Time-Of-Flight camera , 2008, Int. J. Intell. Syst. Technol. Appl..

[47] Frank Y. Shih,et al. Recognizing facial action units using independent component analysis and support vector machine , 2006, Pattern Recognit..

[48] Luc Van Gool,et al. Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[49] Shaogang Gong,et al. Robust facial expression recognition using local binary patterns , 2005, IEEE International Conference on Image Processing 2005.

[50] Anil K. Jain,et al. Automatic feature extraction for multiview 3D face recognition , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[51] Yin Zhang,et al. GroRec: A Group-Centric Intelligent Recommender System Integrating Social, Mobile and Big Data Technologies , 2016, IEEE Transactions on Services Computing.

[52] Fan Chen,et al. Facial Expression Recognition by Supervised Independent Component Analysis Using MAP Estimation , 2008, IEICE Trans. Inf. Syst..

[53] Hema Swetha Koppula,et al. Learning human activities and object affordances from RGB-D videos , 2012, Int. J. Robotics Res..

[54] Dieter Fox,et al. Fine-grained kitchen activity recognition using RGB-D , 2012, UbiComp.

[55] Zhengyou Zhang,et al. 3D Deformable Face Tracking with a Commodity Depth Camera , 2010, ECCV.

[56] Dan Xu,et al. Comparison of PCA, LDA and GDA for palmprint verification , 2010, 2010 International Conference on Information, Networking and Automation (ICINA).

[57] M. Lewicki,et al. Learning higher-order structures in natural images , 2003, Network.

[58] Franck Davoine,et al. A solution for facial expression representation and recognition , 2002, Signal Process. Image Commun..

[59] Md. Zia Uddin. Facial expression recognition using depth information and spatiotemporal features , 2016 .

[60] Iven Van Mechelen,et al. Probabilistic feature analysis of facial perception of emotions , 2005 .

[61] Maurício Pamplona Segundo,et al. Automatic Face Segmentation and Facial Landmark Detection in Range Images , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[62] Mario Fernando Montenegro Campos,et al. STOP: Space-Time Occupancy Patterns for 3D Action Recognition from Depth Map Sequences , 2012, CIARP.

[63] Luc Van Gool,et al. Combining RGB and ToF cameras for real-time 3D hand gesture interaction , 2011, WACV.

[64] T. Sejnowski,et al. Face image analysis for expression measurement and detection of deceit , 1999 .

[65] Xiaodong Yang,et al. Recognizing actions using depth motion maps-based histograms of oriented gradients , 2012, ACM Multimedia.

[66] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[67] Zicheng Liu,et al. Expandable Data-Driven Graphical Modeling of Human Actions Based on Salient Postures , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[68] Lijun Yin,et al. Automatic pose estimation of 3D facial models , 2008, 2008 19th International Conference on Pattern Recognition.

[69] James M. Rehg,et al. Learning the basic units in American Sign Language using discriminative segmental feature selection , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[70] Joachim Hornegger,et al. Robust real-time 3D time-of-flight based gesture navigation , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[71] Marvin Minsky,et al. Perceptrons: An Introduction to Computational Geometry , 1969 .

[72] Mohammed Bennamoun,et al. Automatic 3D Face Detection, Normalization and Recognition , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[73] Tae-Seong Kim,et al. Daily Human Activity Recognition Using Depth Silhouettes and R\mathcal{R} Transformation for Smart Home , 2011, ICOST.

[74] Luc Van Gool,et al. Tracking a hand manipulating an object , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[75] Hao Li,et al. Realtime performance-based facial animation , 2011, ACM Trans. Graph..

[76] P. Ekman,et al. Facial action coding system: a technique for the measurement of facial movement , 1978 .

[77] Luc Van Gool,et al. An object-dependent hand pose prior from sparse training data , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[78] Ying Wu,et al. Robust 3D Action Recognition with Random Occupancy Patterns , 2012, ECCV.

[79] Zhi Li,et al. Real time Hand Gesture Recognition using a Range Camera , 2009, ICRA 2009.

[80] Hermann Ney,et al. The SignSpeak Project - Bridging the Gap Between Signers and Speakers , 2010, LREC.

[81] Bart Selman,et al. Unstructured human activity detection from RGBD images , 2011, 2012 IEEE International Conference on Robotics and Automation.

[82] Victor C. M. Leung,et al. CAP: community activity prediction based on big data analysis , 2014, IEEE Network.

[83] Aggelos K. Katsaggelos,et al. Automatic facial expression recognition using facial animation parameters and multistream HMMs , 2006, IEEE Transactions on Information Forensics and Security.

[84] Joachim Hornegger,et al. 3-D gesture-based scene navigation in medical imaging applications using Time-of-Flight cameras , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[85] Michael J. Lyons,et al. Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[86] Patrick J. Flynn,et al. Multiple Nose Region Matching for 3D Face Recognition under Varying Facial Expression , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[87] Joanna Marnik,et al. The Polish Finger Alphabet Hand Postures Recognition Using Elastic Graph Matching , 2008, Computer Recognition Systems 2.

[88] Nicu Sebe,et al. Facial expression recognition from video sequences: temporal and static modeling , 2003, Comput. Vis. Image Underst..