Lying Speech Characteristic Extraction Based on SSAE Deep Learning Model

Lie speech detection is a typical psychological calculation problem. As the lie information is hidden in speech flow and cannot be easily found, so lie speech is a complex research object. Lie speech detection is not only need to pay attention to the surface information such as words, symbols and sentence, it is more important to pay attention to the internal essence structure characteristics. Therefore, based on the study of speech signal sparse representation, this paper proposes a Stack Sparse Automatic Encoder (SSAE) deep learning model for lying speech characteristics extraction. The proposed method is an effective one, it can reflect people’s deep lying characteristics, and weaken lying person’s personality traits. The deep characteristics compensate the lack of lie expression of basic acoustic features. This improved the lying state correct recognition rate. The experimental results show that, due to the introduction of deep learning characteristics, the individual lying recognition rate has increased by 4%–10%. This result suggests that, the lie detection based on speech analysis method is feasible. Furthermore, the proposed lying state detection based on speech characteristic provides a new research way of psychological calculation.

[1]  Nitish Srivastava,et al.  Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[2]  Xiangang Li,et al.  A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition , 2013, Neurocomputing.

[3]  Dapeng Oliver Wu,et al.  Why Deep Learning Works: A Manifold Disentanglement Perspective , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Yi Jiang,et al.  Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[5]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[6]  Leslie M. Collins,et al.  Financial fraud detection using vocal, linguistic and financial cues , 2015, Decis. Support Syst..

[7]  Jun Du,et al.  A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[9]  Amy‐May Leach,et al.  You must be lying because I don't understand you: Language proficiency and lie detection. , 2016, Journal of experimental psychology. Applied.

[10]  Xiao-Lei Zhang,et al.  Deep Belief Networks Based Voice Activity Detection , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Meng Cai,et al.  Deep maxout neural networks for speech recognition , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.