Exploiting the directional coherence function for multichannel source extraction

[1]  R.A. Kennedy,et al.  Spatial correlation for general distributions of scatterers , 2002, IEEE Signal Processing Letters.

[2]  Rémi Gribonval,et al.  Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  J. Capon High-resolution frequency-wavenumber spectrum analysis , 1969 .

[4]  Stephen P. Boyd,et al.  Applications of second-order cone programming , 1998 .

[5]  Jacob Benesty,et al.  Microphone array beamforming based on maximization of the front-to-back ratio. , 2018, The Journal of the Acoustical Society of America.

[6]  Giovanni Del Galdo,et al.  On the spatial coherence in mixed sound fields and its application to signal-to-diffuse ratio estimation. , 2012, The Journal of the Acoustical Society of America.

[7]  Hiroshi Sawada,et al.  Sparse source separation based on simultaneous clustering of source locational and spectral features , 2011 .

[8]  Shefeng Yan,et al.  Robust supergain beamforming for circular array via second-order cone programming ☆ , 2005 .

[9]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[10]  O. L. Frost,et al.  An algorithm for linearly constrained adaptive array processing , 1972 .

[11]  Karl-Dirk Kammeyer,et al.  Theoretical noise reduction limits of the generalized sidelobe canceller (GSC) for speech enhancement , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[12]  DeLiang Wang,et al.  Deep Learning Based Binaural Speech Separation in Reverberant Environments , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13]  Jacob Benesty,et al.  On Robust and High Directive Beamforming With Small-Spacing Microphone Arrays for Scattered Sources , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[14]  Emanuel A. P. Habets,et al.  MMSE-Based Blind Source Extraction in Diffuse Noise Fields Using a Complex Coherence-Based a Priori SAP Estimator , 2012, IWAENC.

[15]  Sharon Gannot,et al.  Adaptive Beamforming and Postfiltering , 2008 .

[16]  I. Cohen,et al.  Generating nonstationary multisensor signals under a spatial coherence constraint. , 2008, The Journal of the Acoustical Society of America.

[17]  Tomohiro Nakatani,et al.  Online MVDR Beamformer Based on Complex Gaussian Mixture Model With Spatial Prior for Noise Robust ASR , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[18]  Emanuel A. P. Habets,et al.  New Insights Into the MVDR Beamformer in Room Acoustics , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  J. Flanagan,et al.  Computer‐steered microphone arrays for sound transduction in large rooms , 1985 .

[20]  Dong Yu,et al.  Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information , 2019, INTERSPEECH.

[21]  Jacob Benesty,et al.  An Integrated Solution for Online Multichannel Noise Tracking and Reduction , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[22]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[23]  Emanuel A. P. Habets,et al.  DOA-informed source extraction in the presence of competing talkers and background noise , 2017, EURASIP J. Adv. Signal Process..

[24]  Tomohiro Nakatani,et al.  SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures , 2019, IEEE Journal of Selected Topics in Signal Processing.

[25]  Walter Kellermann,et al.  Coherent-to-Diffuse Power Ratio Estimation for Dereverberation , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[26]  Philipos C. Loizou,et al.  A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[27]  DeLiang Wang,et al.  Supervised Speech Separation Based on Deep Learning: An Overview , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[28]  Zhuo Chen,et al.  Deep clustering: Discriminative embeddings for segmentation and separation , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  E. Lehmann,et al.  Prediction of energy decay in room impulse responses simulated with an image-source model. , 2008, The Journal of the Acoustical Society of America.

[30]  L. J. Griffiths,et al.  An alternative approach to linearly constrained adaptive beamforming , 1982 .

[31]  Hiroshi Sawada,et al.  A NOVEL BLIND SOURCE SEPARATION METHOD WITH OBSERVATION VECTOR CLUSTERING , 2005 .

[32]  Mark A. Poletti,et al.  Spatially Robust Far-field Beamforming Using the von Mises(-Fisher) Distribution , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[33]  H. Cox Resolving power and sensitivity to mismatch of optimum array processors , 1973 .

[34]  Wei Jiang,et al.  The analysis of the simplification from the ideal ratio to binary mask in signal-to-noise ratio sense , 2014, Speech Commun..

[35]  Jon Barker,et al.  An analysis of environment, microphone and data simulation mismatches in robust speech recognition , 2017, Comput. Speech Lang..

[36]  Wenju Liu,et al.  Deep Learning Based Speech Separation via NMF-Style Reconstructions , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.