论文信息 - Sound Source Separation: Azimuth Discrimination and Resynthesis

Sound Source Separation: Azimuth Discrimination and Resynthesis

In this paper we present a novel sound source separation algorithm which requires no prior knowledge, no learning, assisted or otherwise, and performs the task of separation based purely on azimuth discrimination within the stereo field. The algorithm exploits the use of the pan pot as a means to achieve image localisation within stereophonic recordings. As such, only an interaural intensity difference exists between left and right channels for a single source. We use gain scaling and phase cancellation techniques to expose frequency dependent nulls across the azimuth domain, from which source separation and resynthesis is carried out. We present results obtained from real recordings, and show that for musical recordings, the algorithm improves upon the output quality of current source separation schemes.

[1] Michael A. Casey,et al. Separation of Mixed Audio Sources By Independent Subspace Analysis , 2000, ICMC.

[2] David Laurenson,et al. Estimating clean speech thresholds for perceptual based speech enhancement , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[3] Aapo Hyvärinen,et al. Survey on Independent Component Analysis , 1999 .

[4] Özgür Yilmaz,et al. Blind separation of disjoint orthogonal signals: demixing N sources from 2 mixtures , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[5] Carlos Avendano,et al. Frequency Domain Techniques for Stereo to Multichannel Upmix , 2002 .

[6] Guy J. Brown,et al. Computational auditory scene analysis , 1994, Comput. Speech Lang..

[7] John M. Eargle,et al. Stereo/Mono Disc Compatibility: A Survey of the Problems , 1968 .

[8] C. Avendano,et al. Frequency-domain source identification and manipulation in stereo mixes for enhancement, suppression and re-panning applications , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[9] L. Rayleigh,et al. XII. On our perception of sound direction , 1907 .

[10] Justinian P. Rosca,et al. REAL-TIME TIME-FREQUENCY BASED BLIND SOURCE SEPARATION , 2001 .