论文信息 - Data Augmentation for Speaker Identification under Stress Conditions to Combat Gender-Based Violence - 字舞流文

Data Augmentation for Speaker Identification under Stress Conditions to Combat Gender-Based Violence

Carmen Peláez-Moreno | Ascensión Gallardo-Antolín | Esther Rituerto-González | Alba Mínguez-Sánchez | Carmen Peláez-Moreno | A. Gallardo-Antolín | Esther Rituerto-González | Alba Mínguez-Sánchez

[1] Jr. J.P. Campbell,et al. Speaker recognition: a tutorial , 1997, Proc. IEEE.

[2] Iain R. Murray,et al. Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. , 1993, The Journal of the Acoustical Society of America.

[3] Carlos Busso,et al. IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.

[4] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[5] George Trigeorgis,et al. Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6] Nengheng Zheng,et al. Integration of Complementary Acoustic Features for Speaker Recognition , 2007, IEEE Signal Processing Letters.

[7] S. Dandapat,et al. Speaker recognition under stressed condition , 2010, Int. J. Speech Technol..

[8] Rigas Kotsakis,et al. Speech Emotion Recognition Adapted to Multimodal Semantic Repositories , 2018, 2018 13th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP).

[9] J.H.L. Hansen,et al. UT-Scope: Speech under Lombard Effect and Cognitive Stress , 2007, 2007 IEEE Aerospace Conference.

[10] Gholamreza Anbarjafari,et al. Supervised Vocal-Based Emotion Recognition Using Multiclass Support Vector Machine, Random Forests, and Adaboost , 2017 .

[11] Ruili Wang,et al. Speaker identification features extraction methods: A systematic review , 2017, Expert Syst. Appl..

[12] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[13] Yun Lei,et al. A novel scheme for speaker recognition using a phonetically-aware deep neural network , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14] Goutam Saha,et al. Speaker verification with short utterances: a review of challenges, trends and opportunities , 2017, IET Biom..

[15] Douglas E. Sturim,et al. Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.

[16] Björn W. Schuller,et al. Speech emotion recognition , 2018, Commun. ACM.

[17] Patrice Y. Simard,et al. Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18] John H. L. Hansen,et al. Speech under stress conditions: overview of the effect on speech production and on system performance , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[19] Carmen Peláez-Moreno,et al. Speaker Recognition under Stress Conditions , 2018, IberSPEECH.

[20] Ascensión Gallardo-Antolín,et al. Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers , 2018, Neural Computing and Applications.

[21] Rigas Kotsakis,et al. Speech Emotion Recognition for Performance Interaction , 2018, Journal of the Audio Engineering Society.

[22] Yang Liu,et al. A novel method for the detection of R-peaks in ECG based on K-Nearest Neighbors and Particle Swarm Optimization , 2017, EURASIP Journal on Advances in Signal Processing.

[23] Theodoros Iliou,et al. Features and classifiers for emotion recognition from speech: a survey from 2000 to 2011 , 2012, Artificial Intelligence Review.

[24] Christopher Joseph Pal,et al. EmoNets: Multimodal deep learning approaches for emotion recognition in video , 2015, Journal on Multimodal User Interfaces.

[25] Walid Mahdi,et al. Improving speech recognition using data augmentation and acoustic model fusion , 2017, KES.

[26] Wen Gao,et al. Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching , 2018, IEEE Transactions on Multimedia.