Unsupervised detection of whispered speech in the presence of normal phonation

The results of an investigation into unsupervised detection of whispered speech segments in the presence of normally phonated speech are presented. The Whispered Speech Detection system presented here extracts features which exploit both waveform energy and periodicity. Unsupervised classification of these features was performed to identify and label long segments (approx. 2 2.5 seconds) of whispered speech which is typically an indication of criminal activity over telephone networks, for instance, in a correctional facility environment. Experiments indicate that it is possible to automatically detect long segments of whispering in the presence of normally phonated speech; testing of the algorithm presented in this paper yields promising results in correct identification of whispered speech segments.

[1]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[2]  Kazuya Takeda,et al.  Acoustic analysis and recognition of whispered speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[4]  Stanley J. Wenndt,et al.  A study on the classification of whispered and normally phonated speech , 2002, INTERSPEECH.

[5]  Johnny B Wilson,et al.  A Comparative Analysis of Whispered and Normally Phonated Speech Using an LPC-10 Vocoder, , 1985 .