Generalized Hough Transform for Speech Pattern Classification
暂无分享,去创建一个
[1] Hynek Hermansky,et al. TRAPS - classifiers of temporal patterns , 1998, ICSLP.
[2] Jonathan Z. Simon,et al. Robust Spectrotemporal Reverse Correlation for the Auditory System: Optimizing Stimulus Design , 2000, Journal of Computational Neuroscience.
[3] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[4] Steve Young,et al. The HTK book version 3.4 , 2006 .
[5] Richard O. Duda,et al. Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.
[6] Hervé Bourlard,et al. Enhanced Phone Posteriors for Improving Speech Recognition Systems , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[7] Dong Yu,et al. Deep Convex Net: A Scalable Architecture for Speech Pattern Classification , 2011, INTERSPEECH.
[8] Haizhou Li,et al. Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions , 2011, IEEE Signal Processing Letters.
[9] Chng Eng Siong,et al. Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[10] Subhransu Maji,et al. Object detection using a max-margin Hough transform , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[11] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .
[12] Russell G. Death,et al. An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data , 2004 .
[13] Navdeep Jaitly,et al. Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[14] Daniel P. W. Ellis,et al. Tandem connectionist feature extraction for conventional HMM systems , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[15] Po-Sen Huang,et al. Random features for Kernel Deep Convex Network , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[16] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[17] Dana H. Ballard,et al. Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..
[18] Dennis H. Klatt,et al. Speech perception: a model of acoustic–phonetic analysis and lexical access , 1979 .
[19] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.
[20] Hanqing Lu,et al. Fusing multi-modal features for gesture recognition , 2013, ICMI '13.
[21] Dong Yu,et al. Conversational Speech Transcription Using Context-Dependent Deep Neural Networks , 2012, ICML.
[22] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[23] Richard F. Lyon,et al. Machine Hearing: An Emerging Field [Exploratory DSP] , 2010, IEEE Signal Processing Magazine.
[24] Li Deng,et al. Are Sparse Representations Rich Enough for Acoustic Modeling? , 2012, INTERSPEECH.
[25] Chng Eng Siong,et al. Overlapping sound event recognition using local spectrogram features and the generalised hough transform , 2013, Pattern Recognit. Lett..
[26] Bernt Schiele,et al. Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.
[27] Sergio Escalera,et al. ChaLearn multi-modal gesture recognition 2013: grand challenge and workshop summary , 2013, ICMI '13.
[28] Dong Yu,et al. Scalable stacking and learning for building deep architectures , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[29] Richard F. Lyon,et al. Machine Hearing: An Emerging Field , 2010 .
[30] László Tóth. Convolutional deep rectifier neural nets for phone recognition , 2013, INTERSPEECH.
[31] Michael J. Fischer,et al. The String-to-String Correction Problem , 1974, JACM.
[32] Wei-Yun Yau,et al. A multi-modal gesture recognition system using audio, video, and skeletal joint data , 2013, ICMI '13.
[33] Sergio Escalera,et al. Multi-modal gesture recognition challenge 2013: dataset and results , 2013, ICMI '13.