论文信息 - Unsupervised Audio-Visual Subspace Alignment for High-Stakes Deception Detection

Unsupervised Audio-Visual Subspace Alignment for High-Stakes Deception Detection

Automated systems that detect deception in high-stakes situations can enhance societal well-being across medical, social work, and legal domains. Existing models for detecting high-stakes deception in videos have been supervised, but labeled datasets to train models can rarely be collected for most real-world applications. To address this problem, we propose the first multimodal unsupervised transfer learning approach that detects real-world, high-stakes deception in videos with-out using high-stakes labels. Our subspace-alignment (SA) approach adapts audio-visual representations of deception in lab-controlled low-stakes scenarios to detect deception in real-world, high-stakes situations. Our best unsupervised SA models outperform models without SA, outperform human ability, and perform comparably to a number of existing supervised models. Our research demonstrates the potential for introducing subspace-based transfer learning to model high-stakes deception and other social behaviors in real-world contexts with a scarcity of labeled behavioral data.

Leena Mathur | Maja J Matari'c

[1] Hugo Jair Escalante,et al. High-Level Features for Multimodal Deception Detection in Videos , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[2] Rajiv Bajpai,et al. The Truth and Nothing But the Truth: Multimodal Analysis for Deception Detection , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[3] Zhiwu Lu,et al. Face-Focused Cross-Stream Network for Deception Detection in Videos , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Tinne Tuytelaars,et al. Unsupervised Visual Domain Adaptation Using Subspace Alignment , 2013, 2013 IEEE International Conference on Computer Vision.

[5] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[6] Björn W. Schuller,et al. The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing , 2016, IEEE Transactions on Affective Computing.

[7] Leena Mathur,et al. Introducing Representations of Facial Affect in Automated Multimodal Deception Detection , 2020, ICMI.

[8] Louis-Philippe Morency,et al. Multimodal Machine Learning: A Survey and Taxonomy , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Md. Kamrul Hasan,et al. Automated Dyadic Data Recorder (ADDR) Framework and Analysis of Facial Cues in Deceptive Communication , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[10] Mohamed Abouelenien,et al. Multimodal deception detection , 2018, The Handbook of Multimodal-Multisensor Interfaces, Volume 2.

[11] W. Dupont,et al. Power and sample size calculations. A review and computer program. , 1990, Controlled clinical trials.

[12] Rahul Gupta,et al. Transfer Learning Between Concepts for Human Behavior Modeling: An Application to Sincerity and Deception Prediction , 2017, INTERSPEECH.

[13] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[14] Björn W. Schuller,et al. Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.

[15] Wenming Zheng,et al. Feature Selection Based Transfer Subspace Learning for Speech Emotion Recognition , 2020, IEEE Transactions on Affective Computing.

[16] Andreas W. Kempa-Liehr,et al. Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh - A Python package) , 2018, Neurocomputing.

[17] S. Porter,et al. The truth about lies: What works in detecting high‐stakes deception? , 2010 .

[18] Shrikanth S. Narayanan,et al. Weighted geodesic flow kernel for interpersonal mutual influence modeling and emotion recognition in dyadic interactions , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).

[19] Louis-Philippe Morency,et al. OpenFace 2.0: Facial Behavior Analysis Toolkit , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[20] Larry S. Davis,et al. Deception Detection in Videos , 2017, AAAI.

[21] Shrikanth S. Narayanan,et al. Identifying Truthful Language in Child Interviews , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22] Luigi Cinque,et al. Automatic Deception Detection in RGB videos using Facial Action Units , 2019, ICDSC.

[23] Taylan Sen,et al. Facial Expression Based Imagination Index and a Transfer Learning Approach to Detect Deception , 2019, 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII).

[24] P. Ekman,et al. The ability to detect deceit generalizes across different types of high-stake lies. , 1997, Journal of personality and social psychology.

[25] B. Depaulo,et al. Accuracy of Deception Judgments , 2006, Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc.

[26] Jiliang Tang,et al. Toward End-to-End Deception Detection in Videos , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[27] D. Lakens,et al. Why Psychologists Should by Default Use Welch's t-test Instead of Student's t-test with Unequal Group Sizes , 2017 .

[28] Mohamed Abouelenien,et al. Deception Detection using Real-life Trial Data , 2015, ICMI.