Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models
暂无分享,去创建一个
[1] John R. Hershey,et al. Super-human multi-talker speech recognition: A graphical modeling approach , 2010, Comput. Speech Lang..
[2] John R. Hershey,et al. Monaural speech separation and recognition challenge , 2010, Comput. Speech Lang..
[3] Yifan Gong,et al. An Overview of Noise-Robust Automatic Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[4] Stuart J. Russell,et al. Dynamic bayesian networks: representation, inference and learning , 2002 .
[5] Richard M. Stern,et al. A vector Taylor series approach for environment-independent speech recognition , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[6] Michael I. Jordan,et al. Factorial Hidden Markov Models , 1995, Machine Learning.
[7] Reinhold Häb-Umbach,et al. An analytic derivation of a phase-sensitive observation model for noise robust speech recognition , 2009, INTERSPEECH.
[8] Jonathan Le Roux,et al. Factorial Models for Noise Robust Speech Recognition , 2012, Techniques for Noise Robustness in Automatic Speech Recognition.
[9] John R. Hershey,et al. Single-Channel Multitalker Speech Recognition , 2010, IEEE Signal Processing Magazine.
[10] Dong Yu,et al. Deep Neural Networks for Single-Channel Multi-Talker Speech Recognition , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[11] Tuomas Virtanen,et al. Speech recognition using factorial hidden Markov models for separation in the feature space , 2006, INTERSPEECH.
[12] Sam T. Roweis,et al. Factorial models and refiltering for speech separation and denoising , 2003, INTERSPEECH.
[13] Brendan J. Frey,et al. Speech recognition in adverse environments: a probabilistic approach , 2002 .
[14] James R. Glass,et al. Developments and directions in speech recognition and understanding, Part 1 [DSP Education] , 2009, IEEE Signal Processing Magazine.
[15] Carla Teixeira Lopes,et al. TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .
[16] Mohammad Ali Keyvanrad,et al. A brief survey on deep belief networks and introducing a new object oriented toolbox ( DeeBNet V 3 . 0 ) , 2016 .
[17] John R. Hershey,et al. Hierarchical variational loopy belief propagation for multi-talker speech recognition , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[18] Simon Haykin,et al. The Cocktail Party Problem , 2005, Neural Computation.
[19] van Dalen,et al. Statistical models for noise-robust speech recognition , 2011 .
[20] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .
[21] Yifan Gong,et al. A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions , 2009, Computer Speech and Language.
[22] Steve Young. A review of large-vocabulary continuous-speech , 1996 .
[23] Mark J. F. Gales,et al. Robust continuous speech recognition using parallel model combination , 1996, IEEE Trans. Speech Audio Process..
[24] Li Deng,et al. Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise , 2004, IEEE Transactions on Speech and Audio Processing.
[25] Jon Barker,et al. An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.