Acoustic modeling with deep neural networks using raw time signal for LVCSR
暂无分享,去创建一个
Hermann Ney | Ralf Schlüter | Zoltán Tüske | Pavel Golik | H. Ney | R. Schlüter | Pavel Golik | Zoltán Tüske
[1] A. B. Poritz,et al. Linear predictive hidden Markov models and the speech signal , 1982, ICASSP.
[2] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..
[3] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.
[4] Brian R Glasberg,et al. Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.
[5] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[6] Hervé Bourlard,et al. Connectionist Speech Recognition: A Hybrid Approach , 1993 .
[7] William J. J. Roberts,et al. Revisiting autoregressive hidden Markov modeling of speech signals , 2005, IEEE Signal Processing Letters.
[8] Hermann Ney,et al. Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[9] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[10] Peter Sollich,et al. Subband acoustic waveform front-end for robust speech recognition using support vector machines , 2010, 2010 IEEE Spoken Language Technology Workshop.
[11] Hermann Ney,et al. RASR - The RWTH Aachen University Open Source Speech Recognition Toolkit , 2011 .
[12] Dong Yu,et al. Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[13] Hermann Ney,et al. Improved Acoustic Feature Combination for LVCSR by Neural Networks , 2011, INTERSPEECH.
[14] Gerald Penn,et al. Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Dimitri Palaz,et al. Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks , 2013, INTERSPEECH.
[16] Jinyu Li,et al. Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks. , 2013, ICLR 2013.
[17] Tara N. Sainath,et al. Learning filter banks within a deep neural network framework , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[18] Dong Yu,et al. Feature Learning in Deep Neural Networks - A Study on Speech Recognition Tasks , 2013, ICLR.
[19] Hermann Ney,et al. Mean-normalized stochastic gradient for large-scale deep learning , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).