论文信息 - Method and apparatus for extracting the target sound signal from the mixed sound

Method and apparatus for extracting the target sound signal from the mixed sound

A method and an apparatus for extracting a target sound signal from a mixed signal are provided to extract a target audio signal in which PESQ(Perceptual Evaluation of Speech Quality) is high from the mixed signal by using an adaptive non-linear filter to an interference noise ratio. A suppressing signal beam former(222) produces a signal in which directivity is suppressed toward a target sound source direction. An emphasizing signal beam forming emphasizes sound pressure with regard to a specific target sound source. A microphone array improves an amplitude by giving proper weight to each signal. A beam former spatially reduces noise of an interference noise signal and a target signal. An adder adds a sound signal inputted through the microphone array. A delay-and-sum algorithm finds out a location of the sound source from a relatively delay time for which a signal reaches to the microphone.

정재훈 | 김규홍 | 정소영 | 오광철

[1] DeLiang Wang,et al. Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation. , 2006, The Journal of the Acoustical Society of America.