Feature mapping using far-field microphones for distant speech recognition
暂无分享,去创建一个
[1] Jonathan Le Roux,et al. The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[2] James L. Flanagan,et al. Robust distant-talking speech recognition , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[3] Petr Motlícek,et al. Feature and score level combination of subspace Gaussinas in LVCSR task , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[4] Steve Renals,et al. Hybrid acoustic models for distant and multichannel large vocabulary speech recognition , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[5] Steve Renals,et al. Multi-level adaptive networks in tandem and hybrid ASR systems , 2013, ICASSP.
[6] Michael S. Brandstein,et al. Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.
[7] Yu Zhang,et al. Extracting deep neural network bottleneck features using low-rank matrix factorization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Steve Renals,et al. Convolutional Neural Networks for Distant Speech Recognition , 2014, IEEE Signal Processing Letters.
[9] L. J. Griffiths,et al. An alternative approach to linearly constrained adaptive beamforming , 1982 .
[10] Antoine Liutkus,et al. Robust ASR using neural network based speech enhancement and feature simulation , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[11] Richard M. Stern,et al. Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[12] John W. McDonough,et al. Combining multi-source far distance speech recognition strategies: beamforming, blind channel and confusion network combination , 2005, INTERSPEECH.
[13] Thomas Hain,et al. An Analysis of Automatic Speech Recognition with Multiple Microphones , 2011, INTERSPEECH.
[14] Andrew W. Senior,et al. Improving DNN speaker independence with I-vector inputs , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Longbiao Wang,et al. Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[16] DeLiang Wang,et al. Ideal ratio mask estimation using deep neural networks for robust speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[17] G. Carter,et al. The generalized correlation method for estimation of time delay , 1976 .
[18] Tomohiro Nakatani,et al. Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition , 2012, IEEE Signal Process. Mag..
[19] B.D. Van Veen,et al. Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.
[20] Dong Yu,et al. Improved Bottleneck Features Using Pretrained Deep Neural Networks , 2011, INTERSPEECH.
[21] Ian Lane,et al. Recurrent Models for Auditory Attention in Multi-Microphone Distant Speech Recognition , 2016, INTERSPEECH.
[22] Thomas Hain,et al. Using neural network front-ends on far field multiple microphones based speech recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[23] Jörg Meyer,et al. Multi-channel speech enhancement in a car environment using Wiener filtering and spectral subtraction , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[24] Björn W. Schuller,et al. Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR , 2015, LVA/ICA.
[25] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[26] Li-Rong Dai,et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[27] Tomohiro Nakatani,et al. Single-Microphone Blind Dereverberation , 2005 .
[28] Jan Cernocký,et al. Probabilistic and Bottle-Neck Features for LVCSR of Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[29] Yu Zhang,et al. Highway long short-term memory RNNS for distant speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[30] George Saon,et al. Speaker adaptation of neural network acoustic models using i-vectors , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[31] Hervé Bourlard,et al. Microphone array post-filter based on noise field coherence , 2003, IEEE Trans. Speech Audio Process..
[32] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .
[33] Thomas Hain,et al. Extending Limabeam with discrimination and coarse gradients , 2014, INTERSPEECH.
[34] Xavier Anguera Miró,et al. Acoustic Beamforming for Speaker Diarization of Meetings , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[35] Steve Renals,et al. Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[36] Mark J. F. Gales,et al. Investigation of unsupervised adaptation of DNN acoustic models with filter bank input , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[37] Martin Wolf,et al. Channel selection measures for multi-microphone speech recognition , 2014, Speech Commun..
[38] Petr Motlícek,et al. Learning feature mapping using deep neural network bottleneck features for distant large vocabulary speech recognition , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[39] David Pearce,et al. A robust front-end algorithm for distributed speech recognition , 2001, INTERSPEECH.
[40] Weifeng Li,et al. Non-linear mapping for multi-channel speech separation and robust overlapping spech recognition , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[41] Petr Tichavsk,et al. Latent Variable Analysis and Signal Separation , 2012, Lecture Notes in Computer Science.
[42] Kazuya Takeda,et al. Speech recognition based on space diversity using distributed multi-microphone , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[43] Ning Ma,et al. The PASCAL CHiME speech separation and recognition challenge , 2013, Comput. Speech Lang..
[44] Yongqiang Wang,et al. An investigation of deep neural networks for noise robust speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[45] Tanja Schultz,et al. THE ISL RT-04S MEETING TRANSCRIPTION SYSTEM , 2004 .
[46] Masato Miyoshi,et al. Inverse filtering of room acoustics , 1988, IEEE Trans. Acoust. Speech Signal Process..
[47] Mary Harper. The Automatic Speech recogition In Reverberant Environments (ASpIRE) challenge , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[48] James R. Glass,et al. Speech feature denoising and dereverberation via deep autoencoders for noisy reverberant speech recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[49] Jon Barker,et al. The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[50] DeLiang Wang,et al. Binary and ratio time-frequency masks for robust speech recognition , 2006, Speech Commun..
[51] Andreas Stolcke,et al. The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System , 2007, CLEAR.
[52] S. Boll,et al. Suppression of acoustic noise in speech using spectral subtraction , 1979 .
[53] Jan Cernocký,et al. BUT BABEL system for spontaneous Cantonese , 2013, INTERSPEECH.
[54] Steve Renals,et al. Multilingual training of deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[55] Lukás Burget,et al. Transcribing Meetings With the AMIDA Systems , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[56] Te-Won Lee,et al. Blind Speech Separation , 2007, Blind Speech Separation.
[57] Richard M. Stern,et al. Likelihood-maximizing beamforming for robust hands-free speech recognition , 2004, IEEE Transactions on Speech and Audio Processing.
[58] Florian Metze,et al. Improvements to speaker adaptive training of deep neural networks , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[59] Dong Yu,et al. Single-channel mixed speech recognition using deep neural networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[60] Florian Metze,et al. Towards speaker adaptive training of deep neural network acoustic models , 2014, INTERSPEECH.