Supervised independent vector analysis through pilot dependent components

Unknown global permutation of the separated sources, time-varying source activity and under determination are common problems affecting on-line Independent Vector Analysis when applied to real-world speech enhancement. In this work we propose to extend the signal model of IVA by introducing additional supervising components. Pilot signals, which are dependent on the sources, are injected in the multidimensional source representation and act as a prior knowledge. The resulting adaptation still maximizes the multivariate source independence, while simultaneously forcing the estimation of sources dependent on the pilot components. It is also shown as the S-IVA is a generalization over the previously proposed weighted Natural Gradient. Numerical evaluations shows the effectiveness of the proposed method in challenging real-world applications.

[1]  Andrzej Cichocki,et al.  Adaptive blind signal and image processing , 2002 .

[2]  Rémi Gribonval,et al.  Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Lucas C. Parra,et al.  A SURVEY OF CONVOLUTIVE BLIND SOURCE SEPARATION METHODS , 2007 .

[4]  Taesu Kim,et al.  Real-Time Independent Vector Analysis for Convolutive Blind Source Separation , 2010, IEEE Transactions on Circuits and Systems I: Regular Papers.

[5]  Shigeki Sagayama,et al.  User-guided independent vector analysis with source activity tuning , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Francesco Nesta,et al.  Semi-Blind Noise Extraction Using Partially Known Position of the Target Source , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Tomohiro Nakatani,et al.  Complex angular central Gaussian mixture model for directional statistics in mask-based microphone array signal processing , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[8]  Francesco Nesta,et al.  Enhanced multidimensional spatial functions for unambiguous localization of multiple sparse acoustic sources , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Te-Won Lee,et al.  Independent Vector Analysis for Convolutive Blind Speech Separation , 2007, Blind Speech Separation.

[10]  Hiroshi Sawada,et al.  A robust and precise method for solving the permutation problem of frequency-domain blind source separation , 2004, IEEE Transactions on Speech and Audio Processing.

[11]  Christian Jutten,et al.  Multimodal Data Fusion: An Overview of Methods, Challenges, and Prospects , 2015, Proceedings of the IEEE.

[12]  Kiyohiro Shikano,et al.  Real-Time Implementation of Two-Stage Blind Source Separation Combining SIMO-ICA and Binary Masking , 2005 .

[13]  Scott C. Douglas,et al.  Scaled Natural Gradient Algorithms for Instantaneous and Convolutive Blind Source Separation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[14]  Björn W. Schuller,et al.  Discriminatively trained recurrent neural networks for single-channel speech separation , 2014, 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[15]  K. Matsuoka,et al.  Minimal distortion principle for blind source separation , 2002, Proceedings of the 41st SICE Annual Conference. SICE 2002..

[16]  Francesco Nesta,et al.  A FLEXIBLE SPATIAL BLIND SOURCE EXTRACTION FRAMEWORK FOR ROBUST SPEECH RECOGNITION IN NOISY ENVIRONMENTS , 2013 .

[17]  Emanuel A. P. Habets,et al.  A Geometrically Constrained Independent Vector Analysis Algorithm for Online Source Extraction , 2015, LVA/ICA.

[18]  Arun Ross,et al.  Microphone Arrays , 2009, Encyclopedia of Biometrics.

[19]  Christopher V. Alvino,et al.  Geometric source separation: merging convolutive source separation with geometric beamforming , 2001, Neural Networks for Signal Processing XI: Proceedings of the 2001 IEEE Signal Processing Society Workshop (IEEE Cat. No.01TH8584).