Multitask Learning of Time-Frequency CNN for Sound Source Localization
暂无分享,去创建一个
Xiaofei Li | Hong Liu | Cheng Pang | Cheng Pang | Hong Liu | Xiaofei Li
[1] Haizhou Li,et al. A learning-based approach to direction of arrival estimation in noisy and reverberant environments , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Gökhan Ince,et al. Using binaural and spectral cues for azimuth and elevation localization , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[3] Volker Hohmann,et al. Sound source localization in real sound fields based on empirical statistics of interaural parameters. , 2006, The Journal of the Acoustical Society of America.
[4] C. Avendano,et al. The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).
[5] Zhengyou Zhang,et al. Maximum Likelihood Sound Source Localization and Beamforming for Directional Microphone Arrays in Distributed Meetings , 2008, IEEE Transactions on Multimedia.
[6] L A JEFFRESS,et al. A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.
[7] Rich Caruana,et al. Multitask Learning , 1997, Machine-mediated learning.
[8] G. C. Carter,et al. The smoothed coherence transform , 1973 .
[9] DeLiang Wang,et al. Binaural Localization of Multiple Sources in Reverberant and Noisy Environments , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[10] Jean Rouat,et al. Localization of simultaneous moving sound sources for mobile robot using a frequency- domain steered beamformer approach , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.
[11] Stephan Gerlach,et al. On sound source localization of speech signals using deep neural networks , 2015 .
[12] Jean-Luc Zarader,et al. A learning-based approach to robust binaural sound localization , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[13] Hong Liu,et al. Robust Acoustic Localization Via Time-Delay Compensation and Interaural Matching Filter , 2015, IEEE Transactions on Signal Processing.
[14] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[15] DeLiang Wang,et al. Binaural Sound Localization , 2006 .
[16] Rodney W. Johnson,et al. Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy , 1980, IEEE Trans. Inf. Theory.
[17] Pasi Pertilä,et al. Microphone array post-filtering using supervised machine learning for speech enhancement , 2014, INTERSPEECH.
[18] Thushara D. Abhayapala,et al. Binaural localization of speech sources in 3-D using a composite feature vector of the HRTF , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Steven van de Par,et al. A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[20] Hong Liu,et al. Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[21] AG Armin Kohlrausch,et al. Binaural Localization and Detection of Speakers in Complex Acoustic Scenes , 2013 .
[22] Kazunori Komatani,et al. Sound source localization based on deep neural networks with directional activate function exploiting phase information , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[23] C. Faller,et al. Source localization in complex listening situations: selection of binaural cues based on interaural coherence. , 2004, The Journal of the Acoustical Society of America.
[24] Hong Kook Kim,et al. Direction-of-Arrival Based SNR Estimation for Dual-Microphone Speech Enhancement , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[25] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[26] John R. Hershey,et al. Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks , 2015, INTERSPEECH.
[27] Francesco Piazza,et al. Localizing speakers in multiple rooms by using Deep Neural Networks , 2018, Comput. Speech Lang..
[28] Stanley T. Birchfield,et al. Acoustic localization by interaural level difference , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[29] Michael S. Brandstein,et al. Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.
[30] Jean-Luc Zarader,et al. A binaural sound source localization method using auditive cues and vision , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[31] Sebastian Ruder,et al. An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.
[32] Carlo Drioli,et al. Exploiting CNNs for Improving Acoustic Source Localization in Noisy and Reverberant Conditions , 2018, IEEE Transactions on Emerging Topics in Computational Intelligence.
[33] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[34] Harald Viste,et al. Binaural Source Localization by Joint Estimation of ILD and ITD , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[35] Volker Hohmann,et al. Auditory model based direction estimation of concurrent speakers from binaural signals , 2011, Speech Commun..
[36] Daniel P. W. Ellis,et al. Model-Based Expectation-Maximization Source Separation and Localization , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[37] Guy J. Brown,et al. Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , 2006 .
[38] Raffaele Parisi,et al. Binaural sound source localization in the presence of reverberation , 2011, 2011 17th International Conference on Digital Signal Processing (DSP).
[39] G. Carter,et al. The generalized correlation method for estimation of time delay , 1976 .
[40] Stephen E. Levinson,et al. A Bayes-rule based hierarchical system for binaural sound source localization , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[41] Samuel W. Clapp,et al. A Binaural Model that Analyses Acoustic Spaces and Stereophonic Reproduction Systems by Utilizing Head Rotations , 2013 .
[42] Susanto Rahardja,et al. Indoor Sound Source Localization With Probabilistic Neural Network , 2017, IEEE Transactions on Industrial Electronics.
[43] Herman J. M. Steeneken,et al. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..
[44] Guy J. Brown,et al. Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[45] Michele Scarpiniti,et al. Cepstrum Prefiltering for Binaural Source Localization in Reverberant Environments , 2012, IEEE Signal Processing Letters.
[46] Volker Willert,et al. A Probabilistic Model for Binaural Sound Localization , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[47] R. O. Schmidt,et al. Multiple emitter location and signal Parameter estimation , 1986 .
[48] Jasha Droppo,et al. Improving speech recognition in reverberation using a room-aware deep neural network and multi-task learning , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[49] Archontis Politis,et al. Direction of Arrival Estimation for Multiple Sound Sources Using Convolutional Recurrent Neural Network , 2017, 2018 26th European Signal Processing Conference (EUSIPCO).
[50] DeLiang Wang,et al. Speech segregation based on sound localization , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).
[51] Carla Teixeira Lopes,et al. TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .
[52] Jean-Luc Zarader,et al. Towards a systematic study of binaural cues , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[53] Boaz Rafaely,et al. Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[54] Hiroshi Sawada,et al. Blind Speech Separation in a Meeting Situation with Maximum SNR Beamformers , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[55] Emanuel A. P. Habets,et al. Broadband doa estimation using convolutional neural networks trained with noise signals , 2017, 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).