论文信息 - Exploiting the directional coherence function for multichannel source extraction - 字舞流文

Exploiting the directional coherence function for multichannel source extraction

Wenju Liu | Zhanlei Yang | Shan Liang | Guanjun Li | Shuai Nie | Jianhua Tao

[1] R.A. Kennedy,et al. Spatial correlation for general distributions of scatterers , 2002, IEEE Signal Processing Letters.

[2] Rémi Gribonval,et al. Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[3] J. Capon. High-resolution frequency-wavenumber spectrum analysis , 1969 .

[4] Stephen P. Boyd,et al. Applications of second-order cone programming , 1998 .

[5] Jacob Benesty,et al. Microphone array beamforming based on maximization of the front-to-back ratio. , 2018, The Journal of the Acoustical Society of America.

[6] Giovanni Del Galdo,et al. On the spatial coherence in mixed sound fields and its application to signal-to-diffuse ratio estimation. , 2012, The Journal of the Acoustical Society of America.

[7] Hiroshi Sawada,et al. Sparse source separation based on simultaneous clustering of source locational and spectral features , 2011 .

[8] Shefeng Yan,et al. Robust supergain beamforming for circular array via second-order cone programming ☆ , 2005 .

[9] Ehud Weinstein,et al. Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[10] O. L. Frost,et al. An algorithm for linearly constrained adaptive array processing , 1972 .

[11] Karl-Dirk Kammeyer,et al. Theoretical noise reduction limits of the generalized sidelobe canceller (GSC) for speech enhancement , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[12] DeLiang Wang,et al. Deep Learning Based Binaural Speech Separation in Reverberant Environments , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13] Jacob Benesty,et al. On Robust and High Directive Beamforming With Small-Spacing Microphone Arrays for Scattered Sources , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[14] Emanuel A. P. Habets,et al. MMSE-Based Blind Source Extraction in Diffuse Noise Fields Using a Complex Coherence-Based a Priori SAP Estimator , 2012, IWAENC.

[15] Sharon Gannot,et al. Adaptive Beamforming and Postfiltering , 2008 .

[16] I. Cohen,et al. Generating nonstationary multisensor signals under a spatial coherence constraint. , 2008, The Journal of the Acoustical Society of America.

[17] Tomohiro Nakatani,et al. Online MVDR Beamformer Based on Complex Gaussian Mixture Model With Spatial Prior for Noise Robust ASR , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[18] Emanuel A. P. Habets,et al. New Insights Into the MVDR Beamformer in Room Acoustics , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[19] J. Flanagan,et al. Computer‐steered microphone arrays for sound transduction in large rooms , 1985 .

[20] Dong Yu,et al. Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information , 2019, INTERSPEECH.

[21] Jacob Benesty,et al. An Integrated Solution for Online Multichannel Noise Tracking and Reduction , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[22] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .

[23] Emanuel A. P. Habets,et al. DOA-informed source extraction in the presence of competing talkers and background noise , 2017, EURASIP J. Adv. Signal Process..

[24] Tomohiro Nakatani,et al. SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures , 2019, IEEE Journal of Selected Topics in Signal Processing.

[25] Walter Kellermann,et al. Coherent-to-Diffuse Power Ratio Estimation for Dereverberation , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[26] Philipos C. Loizou,et al. A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[27] DeLiang Wang,et al. Supervised Speech Separation Based on Deep Learning: An Overview , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[28] Zhuo Chen,et al. Deep clustering: Discriminative embeddings for segmentation and separation , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29] E. Lehmann,et al. Prediction of energy decay in room impulse responses simulated with an image-source model. , 2008, The Journal of the Acoustical Society of America.

[30] L. J. Griffiths,et al. An alternative approach to linearly constrained adaptive beamforming , 1982 .

[31] Hiroshi Sawada,et al. A NOVEL BLIND SOURCE SEPARATION METHOD WITH OBSERVATION VECTOR CLUSTERING , 2005 .

[32] Mark A. Poletti,et al. Spatially Robust Far-field Beamforming Using the von Mises(-Fisher) Distribution , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[33] H. Cox. Resolving power and sensitivity to mismatch of optimum array processors , 1973 .

[34] Wei Jiang,et al. The analysis of the simplification from the ideal ratio to binary mask in signal-to-noise ratio sense , 2014, Speech Commun..

[35] Jon Barker,et al. An analysis of environment, microphone and data simulation mismatches in robust speech recognition , 2017, Comput. Speech Lang..

[36] Wenju Liu,et al. Deep Learning Based Speech Separation via NMF-Style Reconstructions , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.