Parallel multichannel blind source separation using a spatial covariance model and nonnegative matrix factorization

[1]  Emmanuel Vincent,et al.  Multichannel Audio Source Separation With Deep Neural Networks , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[2]  Ivan Tashev,et al.  Sound Capture and Processing: Practical Approaches , 2009 .

[3]  Juha Merimaa,et al.  Spatial Impulse Response Rendering I: Analysis and Synthesis , 2005 .

[4]  Hirokazu Kameoka,et al.  Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Masataka Goto,et al.  Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source Separation Based on Integrated Harmonic and Inharmonic Models , 2008, ISMIR.

[6]  Tuomas Virtanen,et al.  Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[7]  Nancy Bertin,et al.  Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[8]  Tuomas Virtanen,et al.  Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Tomohiro Nakatani,et al.  FastMNMF: Joint Diagonalization Based Accelerated Algorithms for Multichannel Nonnegative Matrix Factorization , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Paris Smaragdis,et al.  Extraction of Speech from Mixture Signals , 2012, Techniques for Noise Robustness in Automatic Speech Recognition.

[11]  Antoine Liutkus,et al.  An overview of informed audio source separation , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).

[12]  Gaurav Sharma,et al.  Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications , 2016, IEEE Transactions on Multimedia.

[13]  Meinard Müller,et al.  Estimating note intensities in music recordings , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Tuomas Virtanen,et al.  Ieee Transactions on Audio, Speech and Language Processing Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation , 2022 .

[15]  Rémi Gribonval,et al.  Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Paris Smaragdis,et al.  Singing-voice separation from monaural recordings using robust principal component analysis , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Heping Ding,et al.  Combining Superdirective Beamforming and Frequency-Domain Blind Source Separation for Highly Reverberant Signals , 2010, EURASIP J. Audio Speech Music. Process..

[18]  Yannick Mahieux,et al.  Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering , 1998, IEEE Trans. Speech Audio Process..

[19]  Hirokazu Kameoka,et al.  Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[20]  Kazuyoshi Yoshii,et al.  Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices , 2019, 2019 27th European Signal Processing Conference (EUSIPCO).

[21]  Gaël Richard,et al.  Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[22]  Francis Bach,et al.  Music Source Separation in the Waveform Domain , 2019, ArXiv.

[23]  Søren Holdt Jensen,et al.  Nonlinear Least Squares Methods for Joint DOA and Pitch Estimation , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[24]  Axel Röbel,et al.  Sound source separation based on non-negative tensor factorization incorporating spatial cue as prior knowledge , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[25]  Ivan Tashev,et al.  Sound Capture and Processing , 2009 .

[26]  G. Elwyn,et al.  One hundred years ago: Should milk be boiled? , 2002, BMJ : British Medical Journal.

[27]  John McDonough,et al.  Microphone Arrays , 2012, Techniques for Noise Robustness in Automatic Speech Recognition.

[28]  Hiroshi Saruwatari,et al.  Multichannel Non-Negative Matrix Factorization Using Banded Spatial Covariance Matrices in Wavenumber Domain , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[29]  Archontis Politis,et al.  Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CNMF , 2020, ArXiv.

[30]  Jürgen Herre,et al.  Interactive Teleconferencing Combining Spatial Audio Object Coding and DirAC Technology , 2010 .

[31]  Ville Pulkki,et al.  Spatial Sound Reproduction with Directional Audio Coding , 2007 .

[32]  Tatsuya Kawahara,et al.  Fast Multichannel Nonnegative Matrix Factorization With Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.