Phase and level difference fusion for robust multichannel source separation

Inter-channel phase (IPD) and level (ILD) differences are common features in multichannel source separation algorithms like DUET and MENUET. However, their utility depends strongly on the configuration of the array and what microphone pairs are used to calculate them. IPDs are most useful when extracted from microphones that are close together as this avoids spatial aliasing. In contrast, ILD clusters are only well separated for widely spaced microphones. We investigate this trade-off between IPD and ILD features and propose a method to best combine them for multichannel source separation. Experimental results demonstrate the utility of this approach.

[1]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[2]  Hiroshi Sawada,et al.  Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors , 2007, Signal Process..

[3]  Rob A. Rutenbar,et al.  Stereophonic spectrogram segmentation using Markov random fields , 2012, 2012 IEEE International Workshop on Machine Learning for Signal Processing.

[4]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[5]  John T. Scott,et al.  Fundamentals of musical acoustics , 1976 .

[6]  B. S. Manjunath,et al.  The multiRANSAC algorithm and its application to detect planar homographies , 2005, IEEE International Conference on Image Processing 2005.

[7]  Nikolaos Mitianoudis,et al.  A Generalized Directional Laplacian Distribution : Estimation, Mixture Models and Audio Source Separation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Rémi Gribonval,et al.  Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  P. Smaragdis,et al.  Learning source trajectories using wrapped-phase hidden Markov models , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[10]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[11]  Kanti V. Mardia,et al.  Statistics of Directional Data , 1972 .

[12]  Arthur H. Benade Acoustical measurements: Methods and instrumentation , 1984 .

[13]  Richard M. Stern,et al.  Two-microphone source separation algorithm based on statistical modeling of angle distributions , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Paris Smaragdis,et al.  Blind multi-channel source separation by circular-linear statistical modeling of phase differences , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.