Whispered Speech Database: Design, Processing and Application

This paper presents creation of a whispered speech database Whi-Spe for Serbian language. The database has been collected in order to investigate how well the whisper is used by humans in intelligible verbal communication and how well whispered information can be used in human-computer communication. The database consists of 50 isolated words. They are generated by ten speakers (five male and five female). Each of them pronounced this vocabulary ten times in two modes: normal and whispered. So, the database contains 5.000 pairs of normal/whispered pronunciations. Database evaluation was performed by an analysis of specific manifestations in whispered articulation. Finally, the preliminary results in whispering recognition by using of HMM, ANN and DTW techniques are presented.

[1]  S. Jovi Serbian emotional speech database : design , processing and evaluation , 2004 .

[2]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[3]  Kazuya Takeda,et al.  Analysis and recognition of whispered speech , 2005, Speech Commun..

[4]  N. Jakovljevic,et al.  Description of Training Procedure for AlfaNum Continuous Speech Recognition System , 2005, EUROCON 2005 - The International Conference on "Computer as a Tool".

[5]  Shirley Gherson,et al.  Laryngeal hyperfunction during whispering: reality or myth? , 2006, Journal of voice : official journal of the Voice Foundation.

[6]  Chi Zhang,et al.  Whisper-Island Detection Based on Unsupervised Segmentation With Entropy-Based Speech Feature Processing , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  P. Ladefoged,et al.  Fundamental problems in phonetics , 1977 .

[8]  Hamid Reza,et al.  Voiced Speech from Whispers for Post-Laryngectomised Patients , 2009 .

[9]  Hideki Kasuya,et al.  Acoustic nature of the whisper , 1999, EUROSPEECH.

[10]  Mark Beale,et al.  Neural Network Toolbox™ User's Guide , 2015 .

[11]  Tanja Schultz,et al.  Whispery speech recognition using adapted articulatory features , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[12]  Johan Sundberg,et al.  Whispering--a single-subject study of glottal configuration and aerodynamics. , 2010, Journal of voice : official journal of the Voice Foundation.

[13]  John H. L. Hansen,et al.  Speaker Identification Within Whispered Speech Audio Streams , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  John H. L. Hansen,et al.  Analysis and classification of speech mode: whispered through shouted , 2007, INTERSPEECH.

[15]  S. Jovicic,et al.  Acoustic analysis of consonants in whispered speech. , 2008, Journal of voice : official journal of the Voice Foundation.