Data Augmentation for Speaker Identification under Stress Conditions to Combat Gender-Based Violence

[1]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[2]  Iain R. Murray,et al.  Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. , 1993, The Journal of the Acoustical Society of America.

[3]  Carlos Busso,et al.  IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.

[4]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[5]  George Trigeorgis,et al.  Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Nengheng Zheng,et al.  Integration of Complementary Acoustic Features for Speaker Recognition , 2007, IEEE Signal Processing Letters.

[7]  S. Dandapat,et al.  Speaker recognition under stressed condition , 2010, Int. J. Speech Technol..

[8]  Rigas Kotsakis,et al.  Speech Emotion Recognition Adapted to Multimodal Semantic Repositories , 2018, 2018 13th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP).

[9]  J.H.L. Hansen,et al.  UT-Scope: Speech under Lombard Effect and Cognitive Stress , 2007, 2007 IEEE Aerospace Conference.

[10]  Gholamreza Anbarjafari,et al.  Supervised Vocal-Based Emotion Recognition Using Multiclass Support Vector Machine, Random Forests, and Adaboost , 2017 .

[11]  Ruili Wang,et al.  Speaker identification features extraction methods: A systematic review , 2017, Expert Syst. Appl..

[12]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[13]  Yun Lei,et al.  A novel scheme for speaker recognition using a phonetically-aware deep neural network , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Goutam Saha,et al.  Speaker verification with short utterances: a review of challenges, trends and opportunities , 2017, IET Biom..

[15]  Douglas E. Sturim,et al.  Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.

[16]  Björn W. Schuller,et al.  Speech emotion recognition , 2018, Commun. ACM.

[17]  Patrice Y. Simard,et al.  Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18]  John H. L. Hansen,et al.  Speech under stress conditions: overview of the effect on speech production and on system performance , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[19]  Carmen Peláez-Moreno,et al.  Speaker Recognition under Stress Conditions , 2018, IberSPEECH.

[20]  Ascensión Gallardo-Antolín,et al.  Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers , 2018, Neural Computing and Applications.

[21]  Rigas Kotsakis,et al.  Speech Emotion Recognition for Performance Interaction , 2018, Journal of the Audio Engineering Society.

[22]  Yang Liu,et al.  A novel method for the detection of R-peaks in ECG based on K-Nearest Neighbors and Particle Swarm Optimization , 2017, EURASIP Journal on Advances in Signal Processing.

[23]  Theodoros Iliou,et al.  Features and classifiers for emotion recognition from speech: a survey from 2000 to 2011 , 2012, Artificial Intelligence Review.

[24]  Christopher Joseph Pal,et al.  EmoNets: Multimodal deep learning approaches for emotion recognition in video , 2015, Journal on Multimodal User Interfaces.

[25]  Walid Mahdi,et al.  Improving speech recognition using data augmentation and acoustic model fusion , 2017, KES.

[26]  Wen Gao,et al.  Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching , 2018, IEEE Transactions on Multimedia.