Audio source separation using sparse representations

This is the author's final version of the article, first published as A. Nesbit, M. G. Jafari, E. Vincent and M. D. Plumbley. Audio Source Separation Using Sparse Representations. In W. Wang (Ed), Machine Audition: Principles, Algorithms and Systems. Chapter 10, pp. 246-264. IGI Global, 2011. ISBN 978-1-61520-919-4. DOI: 10.4018/978-1-61520-919-4.ch010

[1]  Rémi Gribonval,et al.  BSS_EVAL Toolbox User Guide -- Revision 2.0 , 2005 .

[2]  Wenwu Wang,et al.  Machine Audition: Principles, Algorithms and Systems , 2010 .

[3]  Barak A. Pearlmutter,et al.  Survey of sparse and non‐sparse methods in source separation , 2005, Int. J. Imaging Syst. Technol..

[4]  S. Mallat A wavelet tour of signal processing , 1998 .

[5]  Mark D. Plumbley,et al.  Application of Geometric Dependency Analysis to the Separation of Convolved Mixtures , 2004, ICA.

[6]  Emmanuel Vincent,et al.  Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[8]  Emmanuel Vincent,et al.  Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation , 2009, ICA.

[9]  Manjunath Ramachandra Iyer Differentially Fed Artificial Neural Networks for Speech Signal Prediction , 2007 .

[10]  Zhifeng Zhang,et al.  Adaptive Nonlinear Approximations , 1994 .

[11]  Laurent Daudet,et al.  REPRESENTATIONS OF AUDIO SIGNALS IN OVERCOMPLETE DICTIONARIES: WHAT IS THE LINK BETWEEN REDUNDANCY FACTOR AND CODING PROPERTIES? , 2006 .

[12]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[13]  Mark D. Plumbley,et al.  Probabilistic Modeling Paradigms for Audio Source Separation , 2010 .

[14]  Roger B. Dannenberg,et al.  Remixing Stereo Music with Score-Informed Source Separation , 2006, ISMIR.

[15]  Mark D. Plumbley,et al.  Separation of stereo speech signals based on a sparse dictionary algorithm , 2008, 2008 16th European Signal Processing Conference.

[16]  Rémi Gribonval,et al.  A survey of Sparse Component Analysis for blind source separation: principles, perspectives, and new challenges , 2006, ESANN.

[17]  Michael Elad,et al.  Double Sparsity: Learning Sparse Dictionaries for Sparse Signal Approximation , 2010, IEEE Transactions on Signal Processing.

[18]  Hiroshi Sawada,et al.  Geometrical Interpretation of the PCA Subspace Approach for Overdetermined Blind Source Separation , 2006, EURASIP J. Adv. Signal Process..

[19]  Rémi Gribonval,et al.  Oracle estimators for the benchmarking of source separation algorithms , 2007, Signal Process..

[20]  Emmanuel Vincent,et al.  An adaptive stereo basis method for convolutive blind audio source separation , 2008, Neurocomputing.

[21]  Charles A. Bouman,et al.  Best basis search in lapped dictionaries , 2006, IEEE Transactions on Signal Processing.

[22]  Christopher J. James,et al.  On Semi-Blind Source Separation Using Spatial Constraints With Applications in EEG Analysis , 2006, IEEE Transactions on Biomedical Engineering.

[23]  Andrzej Cichocki,et al.  Sequential blind source separation based exclusively on second-order statistics developed for a class of periodic signals , 2006, IEEE Transactions on Signal Processing.

[24]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[25]  Rayan Saab,et al.  UNDERDETERMINED SPARSE BLIND SOURCE SEPARATION WITH DELAYS , 2005 .

[26]  Remi Gribonval Piecewise linear source separation , 2003, SPIE Optics + Photonics.

[27]  Marek Domanski,et al.  Adaptive dictionaries for matching pursuit with separable decomposition , 2005, 2005 13th European Signal Processing Conference.

[28]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[29]  Arthur C. Graesser,et al.  Applied Natural Language Processing : Identification , Investigation , and Resolution , 2012 .

[30]  Michael T. Orchard,et al.  Flexible tree-structured signal expansions using time-varying wavelet packets , 1997, IEEE Trans. Signal Process..

[31]  Michael Zibulevsky,et al.  Underdetermined blind source separation using sparse representations , 2001, Signal Process..

[32]  C. Févotte,et al.  A STUDY OF THE EFFECT OF SOURCE SPARSITY FOR VARIOUS TRANSFORMS ON BLIND AUDIO SOURCE SEPARATION PERFORMANCE , 2005 .

[33]  Bruno Torrésani,et al.  Hybrid representations for audiophonic signal encoding , 2002, Signal Process..

[34]  Rémi Gribonval,et al.  Audio source separation with one sensor for robust speech recognition , 2003, NOLISP.

[35]  Barak A. Pearlmutter,et al.  Blind Source Separation by Sparse Decomposition in a Signal Dictionary , 2001, Neural Computation.

[36]  Hector Perez-Meana Advances in audio and speech signal processing : technologies and applications , 2007 .

[37]  Danielle S. McNamara,et al.  Applying NLP Metrics to Students’ Self-Explanations , 2012 .