Measuring cues for stand-off deception detection based on full-body nonverbal features in body-worn cameras

Deception detection is valuable in the security domain to distinguish truth from lies. It is desirable in many security applications, such as suspect and witness interviews and airport passenger screening. Interviewers are constantly trying to assess the credibility of a statement, usually based on intuition without objective technical support. However, psychological research has shown that humans can hardly perform better than random guessing. Deception detection is a multi-disciplinary research area with an interest from different fields, such as psychology and computer science. In the last decade, several developments have helped to improve the accuracy of lie detection (e.g., with a concealed information test, increasing the cognitive load, or measurements with motion capture suits) and relevant cues have been discovered (e.g., eye blinking or fiddling with the fingers). With an increasing presence of mobile phones and bodycams in society, a mobile, stand-off, automatic deception detection methodology based on various cues from the whole body would create new application opportunities. In this paper, we study the feasibility of measuring these visual cues automatically on different parts of the body, laying the groundwork for stand-off deception detection in more flexible and mobile deployable sensors, such as body-worn cameras. We give an extensive overview of recent developments in two communities: in the behavioral-science community the developments that improve deception detection with a special attention to the observed relevant non-verbal cues, and in the computer-vision community the recent methods that are able to measure these cues. The cues are extracted from several body parts: the eyes, the mouth, the head and the fullbody pose. We performed an experiment using several state-of-the-art video-content-analysis (VCA) techniques to assess the quality of robustly measuring these visual cues.

[1]  Nicu Sebe,et al.  Combining Head Pose and Eye Location Information for Gaze Estimation , 2012, IEEE Transactions on Image Processing.

[2]  Václav Hlavác,et al.  Pose primitive based human action recognition in videos or still images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Heng Yang,et al.  Facial feature point detection: A comprehensive survey , 2014, Neurocomputing.

[4]  Yiying Tong,et al.  FaceWarehouse: A 3D Facial Expression Database for Visual Computing , 2014, IEEE Transactions on Visualization and Computer Graphics.

[5]  Leonid Sigal Human Pose Estimation , 2014, Computer Vision, A Reference Guide.

[6]  Cordelia Schmid,et al.  Mixing Body-Part Sequences for Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Yuan Yao,et al.  Contour Model-Based Hand-Gesture Recognition Using the Kinect Sensor , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Yi Yang,et al.  Articulated Human Detection with Flexible Mixtures of Parts , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Klamer Schutte,et al.  Incremental concept learning with few training examples and hierarchical classification , 2015, SPIE Security + Defence.

[10]  Mubarak Shah,et al.  Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Samuel R. Gross,et al.  Convicting the Innocent , 2008 .

[12]  M. Cody,et al.  Gender and Vocal Stress Differences During Truthful and Deceptive Information Sequences , 1987 .

[13]  Lijun Yin,et al.  Static and dynamic 3D facial expression recognition: A comprehensive survey , 2012, Image Vis. Comput..

[14]  Gershon Ben-Shakhar,et al.  The validity of psychophysiological detection of information with the Guilty Knowledge Test: a meta-analytic review. , 2003, The Journal of applied psychology.

[15]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[17]  Rachel Taylor,et al.  Believed cues to deception: Judgments in self‐generated trivial and serious situations , 2007 .

[18]  Nassir Navab,et al.  3D Pictorial Structures for Multiple Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Mark Everingham,et al.  Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation , 2010, BMVC.

[20]  Andrea Cavallaro,et al.  Automatic Analysis of Facial Affect: A Survey of Registration, Representation, and Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Peter Robinson,et al.  3D Constrained Local Model for rigid and non-rigid facial tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Brandon L. Garrett Convicting the Innocent: Where Criminal Prosecutions Go Wrong , 2011 .

[23]  Klamer Schutte,et al.  Interactive detection of incrementally learned concepts in images with ranking and semantic query interpretation , 2015, 2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI).

[24]  Stefanos Zafeiriou,et al.  A comparison of different features for automatic eye blinking detection with an application to analysis of deceptive behavior , 2012, 2012 5th International Symposium on Communications, Control and Signal Processing.

[25]  Bernt Schiele,et al.  2D Human Pose Estimation: New Benchmark and State of the Art Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Yoichi Sato,et al.  Appearance-Based Gaze Estimation Using Visual Saliency , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  James J. Lindsay,et al.  Cues to deception. , 2003, Psychological bulletin.

[28]  Klamer Schutte,et al.  Instantaneous threat detection based on a semantic representation of activities, zones and trajectories , 2014, Signal, Image and Video Processing.

[29]  Vinay Bettadapura,et al.  Face Expression Recognition and Analysis: The State of the Art , 2012, ArXiv.

[30]  Tinne Tuytelaars Wide baseline matching , 2014 .

[31]  C. Heaps,et al.  Comparing recollective experience in true and false autobiographical memories. , 2001, Journal of experimental psychology. Learning, memory, and cognition.

[32]  Kun Guo,et al.  Automatic blush detection in “Concealed information” test using visual stimuli , 2010, 2010 International Conference of Soft Computing and Pattern Recognition.

[33]  Rémi Ronfard,et al.  A survey of vision-based methods for action representation, segmentation and recognition , 2011, Comput. Vis. Image Underst..

[34]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Gwen Littlewort,et al.  The computer expression recognition toolbox (CERT) , 2011, Face and Gesture 2011.

[36]  Mohamed Abouelenien,et al.  Trimodal Analysis of Deceptive Behavior , 2015, WMDD@ICMI.

[37]  Jasper R. van Huis,et al.  Automatic detection of suspicious behavior of pickpockets with track-based features in a shopping mall , 2014, Security and Defence.

[38]  L. Manelis,et al.  Effect of Awareness on an Indicator of Cognitive Load , 1979 .

[39]  Andrew Zisserman,et al.  Flowing ConvNets for Human Pose Estimation in Videos , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[40]  Mario Fritz,et al.  Appearance-based gaze estimation in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  A. Vrij,et al.  Outsmarting the Liars: The Benefit of Asking Unanticipated Questions , 2009, Law and human behavior.

[42]  Jitendra Malik,et al.  Human Pose Estimation with Iterative Error Feedback , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Maja Pantic,et al.  The first facial expression recognition and analysis challenge , 2011, Face and Gesture 2011.

[44]  Reyer Zwiggelaar,et al.  Thermal Facial Analysis for Deception Detection , 2014, IEEE Transactions on Information Forensics and Security.

[45]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  B. Depaulo,et al.  Accuracy of Deception Judgments: Appendix A , 2006 .

[47]  Mohamed Abouelenien,et al.  Verbal and Nonverbal Clues for Real-life Deception Detection , 2015, EMNLP.

[48]  Timothy F. Cootes,et al.  Feature Detection and Tracking with Constrained Local Models , 2006, BMVC.

[49]  H. Emrah Tasli,et al.  Deep learning based FACS Action Unit occurrence and intensity estimation , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[50]  E. Loftus Planting misinformation in the human mind: a 30-year investigation of the malleability of memory. , 2005, Learning & memory.

[51]  Cristian Sminchisescu,et al.  Iterated Second-Order Label Sensitive Pooling for 3D Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Klamer Schutte,et al.  Selection of negative samples and two-stage combination of multiple features for action detection in thousands of videos , 2013, Machine Vision and Applications.

[53]  Robin R. Murphy,et al.  Hand gesture recognition with depth images: A review , 2012, 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication.

[54]  Léon J. M. Rothkrantz,et al.  Recognizing Stress Using Semantics and Modulation of Speech and Gestures , 2016, IEEE Transactions on Affective Computing.

[55]  Fernando De la Torre,et al.  Spatio-Temporal Matching for Human Pose Estimation in Video , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  L. Padma Suresh,et al.  on Circuit , Power and Computing Technologies [ ICCPCT ] Literature survey on Face and Face Expression Recognition , 2016 .

[57]  Lin Su,et al.  Does "lie to me" lie to you? An evaluation of facial clues to high-stakes deception , 2016, Comput. Vis. Image Underst..

[58]  Zoran Duric,et al.  Using Image Flow to Detect Eye Blinks in Color Videos , 2007, 2007 IEEE Workshop on Applications of Computer Vision (WACV '07).

[59]  Erik Cambria,et al.  Deep Convolutional Neural Network Textual Features and Multiple Kernel Learning for Utterance-level Multimodal Sentiment Analysis , 2015, EMNLP.

[60]  Yimin Zhou,et al.  A novel finger and hand pose estimation technique for real-time hand gesture recognition , 2016, Pattern Recognit..

[61]  Juan Carlos Niebles,et al.  A Hierarchical Model of Shape and Appearance for Human Action Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[62]  Mircea Nicolescu,et al.  Vision-based hand pose estimation: A review , 2007, Comput. Vis. Image Underst..

[63]  Martial Hebert,et al.  Event Detection in Crowded Videos , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[64]  Paul Ekman,et al.  Lying and nonverbal behavior: Theoretical issues and new findings , 1988 .

[65]  M. Zuckerman Verbal and nonverbal communication of deception , 1981 .

[66]  K. Fiedler,et al.  Training lie detectors to use nonverbal cues instead of global heuristics , 1993 .

[67]  Xuelong Li,et al.  A deep structure for human pose estimation , 2015, Signal Process..

[68]  Venu Govindaraju,et al.  Lie to Me: Deceit detection via online behavioral learning , 2011, Face and Gesture 2011.

[69]  Pawel Strumillo,et al.  Eye-blink detection system for human–computer interaction , 2011, Universal Access in the Information Society.

[70]  Verónica Pérez-Rosas,et al.  Utterance-Level Multimodal Sentiment Analysis , 2013, ACL.

[71]  E. Granholm,et al.  Differentiation of deception using pupillary responses as an index of cognitive processing. , 2001, Psychophysiology.

[72]  Mohamed Abouelenien,et al.  Deception Detection using Real-life Trial Data , 2015, ICMI.

[73]  Paola Campadelli,et al.  Precise Eye Localization through a General-to-specific Model Definition , 2006, BMVC.

[74]  Qiang Ji,et al.  In the Eye of the Beholder: A Survey of Models for Eyes and Gaze , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[75]  Matthew L. Jensen,et al.  Deception detection through automatic, unobtrusive analysis of nonverbal behavior , 2005, IEEE Intelligent Systems.

[76]  Paul Ekman,et al.  Why lies fail and what behaviors betray a lie. , 1989 .

[77]  G. Gudjonsson,et al.  The Psychology of Confessions , 2004, Psychological science in the public interest : a journal of the American Psychological Society.

[78]  Cordelia Schmid,et al.  Estimating Human Pose with Flowing Puppets , 2013, 2013 IEEE International Conference on Computer Vision.

[79]  Peter Robinson,et al.  OpenFace: An open source facial behavior analysis toolkit , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[80]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[81]  Darrin J. Griffin,et al.  Who Told You That? Uncovering the Source of Believed Cues to Deception , 2014 .

[82]  Cristian Sminchisescu,et al.  Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[83]  J. Gross,et al.  Emotional suppression: physiology, self-report, and expressive behavior. , 1993, Journal of personality and social psychology.

[84]  Peter Robinson,et al.  Learning an appearance-based gaze estimator from one million synthesised images , 2016, ETRA.

[85]  Michel Valstar,et al.  Advances, Challenges, and Opportunities in Automatic Facial Expression Recognition , 2016 .

[86]  Paul J. Taylor,et al.  AMAB: Automated measurement and analysis of body motion , 2013, Behavior research methods.

[87]  Aldert Vrij,et al.  A cognitive approach to lie detection: A meta‐analysis , 2017 .

[88]  Andrew Zisserman,et al.  Progressive search space reduction for human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[89]  Paul J. Taylor,et al.  To freeze or not to freeze A motion-capture approach to detecting deceit , 2015 .

[90]  Aldert Vrij,et al.  Saccadic eye movement rate as a cue to deceit , 2015 .

[91]  Jonathan Tompson,et al.  Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[92]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[93]  J. Shaw,et al.  Constructing Rich False Memories of Committing Crime , 2015, Psychological science.

[94]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[95]  Anupam Agrawal,et al.  Vision based hand gesture recognition for human computer interaction: a survey , 2012, Artificial Intelligence Review.

[96]  P. Ekman,et al.  What the face reveals : basic and applied studies of spontaneous expression using the facial action coding system (FACS) , 2005 .

[97]  R. Johansson,et al.  Hand Movements , 2001 .

[98]  Lijun Yin,et al.  FERA 2015 - second Facial Expression Recognition and Analysis challenge , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[99]  A. Vrij,et al.  Creating suspects in police interviews , 1999 .

[100]  J. Burgoon,et al.  Interpersonal Deception Theory , 2015 .

[101]  Ray Bull,et al.  Increasing Cognitive Load to Facilitate Lie Detection: The Benefit of Recalling an Event in Reverse Order , 2008, Law and human behavior.

[102]  B. Depaulo,et al.  Accuracy of Deception Judgments , 2006, Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc.

[103]  Fei Yang,et al.  Is Interactional Dissynchrony a Clue to Deception? Insights From Automated Analysis of Nonverbal Visual Cues , 2015, IEEE Transactions on Cybernetics.

[104]  Pinar Duygulu Sahin,et al.  Recognizing actions from still images , 2008, 2008 19th International Conference on Pattern Recognition.

[105]  Elizabeth F. Loftus,et al.  Current Issues and Advances in Misinformation Research , 2011 .

[106]  Liang-Tien Chia,et al.  Motion Context: A New Representation for Human Action Recognition , 2008, ECCV.

[107]  Leif A. Strömwall,et al.  Repeated interrogations: verbal and non‐verbal cues to deception , 2002 .

[108]  Dacheng Tao,et al.  A Comprehensive Survey on Pose-Invariant Face Recognition , 2015, ACM Trans. Intell. Syst. Technol..