Method and apparatus for extracting the target sound signal from the mixed sound
暂无分享,去创建一个
A method and an apparatus for extracting a target sound signal from a mixed signal are provided to extract a target audio signal in which PESQ(Perceptual Evaluation of Speech Quality) is high from the mixed signal by using an adaptive non-linear filter to an interference noise ratio. A suppressing signal beam former(222) produces a signal in which directivity is suppressed toward a target sound source direction. An emphasizing signal beam forming emphasizes sound pressure with regard to a specific target sound source. A microphone array improves an amplitude by giving proper weight to each signal. A beam former spatially reduces noise of an interference noise signal and a target signal. An adder adds a sound signal inputted through the microphone array. A delay-and-sum algorithm finds out a location of the sound source from a relatively delay time for which a signal reaches to the microphone.
[1] DeLiang Wang,et al. Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation. , 2006, The Journal of the Acoustical Society of America.