Unsupervised Audio-Visual Subspace Alignment for High-Stakes Deception Detection

Automated systems that detect deception in high-stakes situations can enhance societal well-being across medical, social work, and legal domains. Existing models for detecting high-stakes deception in videos have been supervised, but labeled datasets to train models can rarely be collected for most real-world applications. To address this problem, we propose the first multimodal unsupervised transfer learning approach that detects real-world, high-stakes deception in videos with-out using high-stakes labels. Our subspace-alignment (SA) approach adapts audio-visual representations of deception in lab-controlled low-stakes scenarios to detect deception in real-world, high-stakes situations. Our best unsupervised SA models outperform models without SA, outperform human ability, and perform comparably to a number of existing supervised models. Our research demonstrates the potential for introducing subspace-based transfer learning to model high-stakes deception and other social behaviors in real-world contexts with a scarcity of labeled behavioral data.

[1]  Hugo Jair Escalante,et al.  High-Level Features for Multimodal Deception Detection in Videos , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[2]  Rajiv Bajpai,et al.  The Truth and Nothing But the Truth: Multimodal Analysis for Deception Detection , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[3]  Zhiwu Lu,et al.  Face-Focused Cross-Stream Network for Deception Detection in Videos , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Tinne Tuytelaars,et al.  Unsupervised Visual Domain Adaptation Using Subspace Alignment , 2013, 2013 IEEE International Conference on Computer Vision.

[5]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[6]  Björn W. Schuller,et al.  The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing , 2016, IEEE Transactions on Affective Computing.

[7]  Leena Mathur,et al.  Introducing Representations of Facial Affect in Automated Multimodal Deception Detection , 2020, ICMI.

[8]  Louis-Philippe Morency,et al.  Multimodal Machine Learning: A Survey and Taxonomy , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Md. Kamrul Hasan,et al.  Automated Dyadic Data Recorder (ADDR) Framework and Analysis of Facial Cues in Deceptive Communication , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[10]  Mohamed Abouelenien,et al.  Multimodal deception detection , 2018, The Handbook of Multimodal-Multisensor Interfaces, Volume 2.

[11]  W. Dupont,et al.  Power and sample size calculations. A review and computer program. , 1990, Controlled clinical trials.

[12]  Rahul Gupta,et al.  Transfer Learning Between Concepts for Human Behavior Modeling: An Application to Sincerity and Deception Prediction , 2017, INTERSPEECH.

[13]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[14]  Björn W. Schuller,et al.  Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.

[15]  Wenming Zheng,et al.  Feature Selection Based Transfer Subspace Learning for Speech Emotion Recognition , 2020, IEEE Transactions on Affective Computing.

[16]  Andreas W. Kempa-Liehr,et al.  Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh - A Python package) , 2018, Neurocomputing.

[17]  S. Porter,et al.  The truth about lies: What works in detecting high‐stakes deception? , 2010 .

[18]  Shrikanth S. Narayanan,et al.  Weighted geodesic flow kernel for interpersonal mutual influence modeling and emotion recognition in dyadic interactions , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).

[19]  Louis-Philippe Morency,et al.  OpenFace 2.0: Facial Behavior Analysis Toolkit , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[20]  Larry S. Davis,et al.  Deception Detection in Videos , 2017, AAAI.

[21]  Shrikanth S. Narayanan,et al.  Identifying Truthful Language in Child Interviews , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Luigi Cinque,et al.  Automatic Deception Detection in RGB videos using Facial Action Units , 2019, ICDSC.

[23]  Taylan Sen,et al.  Facial Expression Based Imagination Index and a Transfer Learning Approach to Detect Deception , 2019, 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII).

[24]  P. Ekman,et al.  The ability to detect deceit generalizes across different types of high-stake lies. , 1997, Journal of personality and social psychology.

[25]  B. Depaulo,et al.  Accuracy of Deception Judgments , 2006, Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc.

[26]  Jiliang Tang,et al.  Toward End-to-End Deception Detection in Videos , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[27]  D. Lakens,et al.  Why Psychologists Should by Default Use Welch's t-test Instead of Student's t-test with Unequal Group Sizes , 2017 .

[28]  Mohamed Abouelenien,et al.  Deception Detection using Real-life Trial Data , 2015, ICMI.