论文信息 - Multivariate speech activity dector based on the syllable rate

Multivariate speech activity dector based on the syllable rate

Computationally efficient algorithms which perform speech activity detection have significant potential economic and labor saving benefit, by automating an extremely tedious manual process. In many applications, it is desirable to extract intervals of speech which are obtained by segments of other signal types. In the past, algorithms which successfully discriminate between speech and one specific other signal type have been developed. Frequently, these algorithms fail when the specific non-speech signal is replaced by a different non-speech discrimination problem. Typically, several signal specific discriminators are blindly combined with predictable negative results. Moreover, when a large number of discriminators are involved, dimensions reduction is achieved using Principal Components, which optimally compresses signal variance into the fewest number of dimensions. Unfortunately, these new coordinates are not necessarily optimal for discrimination. In this paper we apply graphical tools to determine a set of discriminators which produce excellent speech vs. non-clustering, thereby eliminating the guesswork in selecting good feature vectors. This cluster structure provides a basis for a general multivariate speech vs. non-speech discriminator, which compares very favorably with the TALKATIVE speech extraction algorithm.

Douglas J. Nelson | David C. Smith | Jeffrey N. Townsend

[1] Douglas J. Nelson,et al. Pitch-based methods for speech detection and automatic frequency recovery , 1995, Optics & Photonics.

[2] Douglas J. Nelson,et al. Assessing the Performance of Three Methods for Separating Non-Spontaneous and Spontaneous Speech Through Simulation , 1998, Simul..

[3] Douglas Nelson,et al. Special purpose correlation functions for improved signal detection and parameter estimation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] Alvin F. Martin,et al. The DET curve in assessment of detection task performance , 1997, EUROSPEECH.