Segregation of stop consonants from acoustic interference

Speech segregation from acoustic interference is a very challenging task. Previous systems have dealt with voiced speech with success, but they cannot handle unvoiced speech. We study the segregation of stop consonants, which contain significant unvoiced signals. We propose a novel method that employs onset as a major cue to segregate stop consonants. Our system first detects stops through onset detection and Bayesian classification of acoustic-phonetic features, and then performs grouping based on onset coincidence. The system has been tested and performs well on utterances mixed with various types of interference.

[1]  R Meddis,et al.  Simulation of auditory-neural transduction: further studies. , 1988, The Journal of the Acoustical Society of America.

[2]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[3]  B. Moore An Introduction to the Psychology of Hearing , 1977 .

[4]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[5]  DeLiang Wang,et al.  Monaural Speech Separation , 2002, NIPS.

[6]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[7]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[8]  Jan Van der Spiegel,et al.  Acoustic-phonetic features for the automatic classification of stop consonants , 2001, IEEE Trans. Speech Audio Process..

[9]  Guy J. Brown,et al.  Computational auditory scene analysis , 1994, Comput. Speech Lang..