暂无分享,去创建一个
Ira Kemelmacher-Shlizerman | Steve Seitz | Teerapat Jenrungrot | Vivek Jayaram | S. Seitz | Ira Kemelmacher-Shlizerman | V. Jayaram | Teerapat Jenrungrot
[1] Xiong Xiao,et al. Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[2] Rémi Gribonval,et al. Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model , 2009, IEEE Transactions on Audio, Speech, and Language Processing.
[3] Daniel Johnson,et al. Latent Gaussian Activity Propagation: Using Smoothness and Structure to Separate and Localize Sounds in Large Noisy Environments , 2018, NeurIPS.
[4] Jonathan Le Roux,et al. Deep clustering and conventional networks for music separation: Stronger together , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] Yong Xu,et al. Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Hong Wang,et al. Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources , 1985, IEEE Trans. Acoust. Speech Signal Process..
[7] Nicolas Usunier,et al. Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed , 2019, ArXiv.
[8] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Sharon Gannot,et al. Speaker localization and separation using incremental distributed expectation-maximization , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).
[10] Nima Mesgarani,et al. TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Don H. Johnson,et al. Array Signal Processing: Concepts and Techniques , 1993 .
[12] Archontis Politis,et al. Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks , 2018, IEEE Journal of Selected Topics in Signal Processing.
[13] Efthymios Tzinis,et al. Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures Using Spatial Information , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[14] James Glass,et al. Multiple Sound Source Localization with SVD-PHAT , 2019, INTERSPEECH.
[15] Athanasios Mouchtaris,et al. Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[16] Takuya Yoshioka,et al. End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] Jean Rouat,et al. Robust sound source localization using a microphone array on a mobile robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).
[18] Emmanuel Vincent,et al. Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[19] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[20] Antoine Liutkus,et al. The 2018 Signal Separation Evaluation Campaign , 2018, LVA/ICA.
[21] Paris Smaragdis,et al. Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[22] Naoya Takahashi,et al. Recursive speech separation for unknown number of speakers , 2019, INTERSPEECH.
[23] Hiroshi Sawada,et al. Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[24] DeLiang Wang,et al. Deep Learning Based Binaural Speech Separation in Reverberant Environments , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[25] Ivan Dokmanic,et al. Pyroomacoustics: A Python Package for Audio Room Simulation and Array Processing Algorithms , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] Jont B. Allen,et al. Image method for efficiently simulating small‐room acoustics , 1976 .
[27] Chuang Gan,et al. The Sound of Pixels , 2018, ECCV.
[28] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[29] Francesco Nesta,et al. Convolutive BSS of Short Mixtures by ICA Recursively Regularized Across Frequencies , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[30] Yossi Adi,et al. Voice Separation with an Unknown Number of Multiple Speakers , 2020, ICML.
[31] Boaz Rafaely,et al. Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[32] Paris Smaragdis,et al. Multichannel Source Separation and Tracking With RANSAC and Directional Statistics , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[33] Daniel P. W. Ellis,et al. An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments , 2006, NIPS.
[34] Martin Vetterli,et al. FRIDA: FRI-based DOA estimation for arbitrary array layouts , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[35] Jonathan Le Roux,et al. SDR – Half-baked or Well Done? , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[36] Tomohiro Nakatani,et al. Deep Clustering-Based Beamforming for Separation with Unknown Number of Sources , 2017, INTERSPEECH.
[37] R. O. Schmidt,et al. Multiple emitter location and signal Parameter estimation , 1986 .
[38] Chuang Gan,et al. Self-supervised Audio-visual Co-segmentation , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[39] James H. McClellan,et al. TOPS: new DOA estimator for wideband signals , 2006, IEEE Transactions on Signal Processing.
[40] Guy J. Brown,et al. Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[41] Raffaele Parisi,et al. WAVES: weighted average of signal subspaces for robust wideband direction finding , 2001, IEEE Trans. Signal Process..
[42] DeLiang Wang,et al. On Ideal Binary Mask As the Computational Goal of Auditory Scene Analysis , 2005, Speech Separation by Humans and Machines.
[43] Dong Yu,et al. Deep Neural Networks for Single-Channel Multi-Talker Speech Recognition , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[44] Junichi Yamagishi,et al. SUPERSEDED - CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit , 2016 .
[45] John Thickstun,et al. Source Separation with Deep Generative Priors , 2020, ICML.
[46] Joseph H. DiBiase. A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays , 2000 .
[47] Hakan Erdogan,et al. Multi-Microphone Neural Speech Separation for Far-Field Multi-Talker Speech Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[48] Haizhou Li,et al. Single Channel Speech Separation with Constrained Utterance Level Permutation Invariant Training Using Grid LSTM , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[49] Michael Vorlnder,et al. Auralization: Fundamentals of Acoustics, Modelling, Simulation, Algorithms and Acoustic Virtual Reality , 2020 .
[50] T. Ens,et al. Blind signal separation : statistical principles , 1998 .
[51] Daniel P. W. Ellis,et al. Model-Based Expectation-Maximization Source Separation and Localization , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[52] Tillman Weyde,et al. Singing Voice Separation with Deep U-Net Convolutional Networks , 2017, ISMIR.
[53] Emmanuel Vincent,et al. Multichannel Audio Source Separation With Deep Neural Networks , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[54] Rémi Gribonval,et al. Audio source separation with a single sensor , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[55] Yedid Hoshen,et al. Neural separation of observed and unobserved distributions , 2018, ICML.
[56] Andrea Cavallaro,et al. 3D audio-visual speaker tracking with an adaptive particle filter , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[57] Radu Horaud,et al. Acoustic Space Learning for Sound-Source Separation and Localization on Binaural Manifolds , 2014, Int. J. Neural Syst..
[58] Antoine Liutkus,et al. Generalized Wiener filtering with fractional power spectrograms , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[59] Hamid Amiri,et al. Beamforming Techniques for Multichannel audio Signal Separation , 2012, ArXiv.
[60] Paris Smaragdis,et al. Directional NMF for joint source localization and separation , 2015, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
[61] Zhuo Chen,et al. Deep clustering: Discriminative embeddings for segmentation and separation , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[62] Nima Mesgarani,et al. Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[63] Fakheredine Keyrouz. Robotic Binaural Localization and Separation of Multiple Simultaneous Sound Sources , 2017, 2017 IEEE 11th International Conference on Semantic Computing (ICSC).
[64] Bhiksha Raj,et al. Non-negative matrix factorization based compensation of music for automatic speech recognition , 2010, INTERSPEECH.
[65] Petr Motlícek,et al. Deep Neural Networks for Multiple Speaker Detection and Localization , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[66] Paris Smaragdis,et al. A Wrapped Kalman Filter for Azimuthal Speaker Tracking , 2013, IEEE Signal Processing Letters.
[67] Tatsuya Kawahara,et al. Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[68] Simon Dixon,et al. Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation , 2018, ISMIR.
[69] Nima Mesgarani,et al. Real-Time Binaural Speech Separation with Preserved Spatial Cues , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[70] Futoshi Asano,et al. Sound source localization and separation based on the EM algorithm , 2004, SAPA@INTERSPEECH.