论文信息 - Scale-invariant probabilistic latent component analysis

Scale-invariant probabilistic latent component analysis

In this paper, we present a new method for decomposing musical spectrograms. This method is similar to shift-invariant Probabilistic Latent Component Analysis, but, when the latter works with constant Q spectrograms (i.e. with a logarithmic frequency resolution), our technique is designed to decompose standard short time Fourier transform spectrograms (i.e. with a linear frequency resolution). This makes it possible to easily reconstruct the latent signals (which can be useful for source separation).

Roland Badeau | Romain Hennequin | Bertrand David

[1] D. Fitzgerald,et al. Shifted non-negative matrix factorisation for sound source separation , 2005, IEEE/SP 13th Workshop on Statistical Signal Processing, 2005.

[2] B. Shinn-Cunningham,et al. Latent variable framework for modeling and separating single-channel acoustic sources , 2008 .

[3] D. Fitzgerald,et al. Towards an Inverse Constant Q Transform , 2006 .

[4] Christian Schörkhuber. CONSTANT-Q TRANSFORM TOOLBOX FOR MUSIC PROCESSING , 2010 .

[5] Bhiksha Raj,et al. Adobe Systems , 1998 .

[6] Bhiksha Raj,et al. Sparse Overcomplete Latent Variable Decomposition of Counts Data , 2007, NIPS.

[7] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[8] Bhiksha Raj,et al. Supervised and Semi-supervised Separation of Sounds from Single-Channel Mixtures , 2007, ICA.

[9] Morten Mørup,et al. Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation , 2006, ICA.

[10] D. Fitzgerald,et al. Resynthesis Methods for Sound Source Separation using Shifted Non-negative Factorisation Models , 2007 .

[11] P. Smaragdis,et al. Shift-Invariant Probabilistic Latent Component Analysis , 2007 .

[12] P. Smaragdis,et al. Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[13] Bhiksha Raj,et al. Missing Data Imputation for Time-Frequency Representations of Audio Signals , 2011, J. Signal Process. Syst..

[14] Paris Smaragdis,et al. Missing data imputation for spectral audio signals , 2009, 2009 IEEE International Workshop on Machine Learning for Signal Processing.

[15] Judith C. Brown. Calculation of a constant Q spectral transform , 1991 .

[16] Paris Smaragdis,et al. Relative pitch estimation of multiple instruments , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17] Tuomas Virtanen,et al. Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.