Metadata-Aware End-to-End Keyword Spotting
暂无分享,去创建一个
Noah D. Stein | Brian Kulis | Yuriy Mishchenko | Anish Shah | Shiv Naga Prasad Vitaladevuni | Gengshen Fu | Hongyi Liu | Apurva Abhyankar | Thibaud Sénéchal | S. Vitaladevuni | B. Kulis | Noah D. Stein | Thibaud Sénéchal | Gengshen Fu | Yuriy Mishchenko | Hongyi Liu | Anish Shah | Apurva Abhyankar
[1] Navdeep Jaitly,et al. Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[2] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.
[3] Jong Kyoung Kim,et al. Speech recognition , 1983, 1983 IEEE International Solid-State Circuits Conference. Digest of Technical Papers.
[4] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[5] W. Russell,et al. Continuous hidden Markov modeling for speaker-independent word spotting , 1989, International Conference on Acoustics, Speech, and Signal Processing,.
[6] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.
[7] Louis-Philippe Morency,et al. Multimodal Machine Learning: A Survey and Taxonomy , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[8] Hugo Larochelle,et al. Modulating early visual processing by language , 2017, NIPS.
[9] Thomas Brox,et al. CAM-Convs: Camera-Aware Multi-Scale Convolutions for Single-View Depth , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Aaron C. Courville,et al. Learning Visual Reasoning Without Strong Priors , 2017, ICML 2017.
[11] Gerald Penn,et al. Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[12] Patrick Kenny,et al. Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition , 1991, IEEE Trans. Signal Process..
[13] Tara N. Sainath,et al. Convolutional neural networks for small-footprint keyword spotting , 2015, INTERSPEECH.
[14] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .
[15] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .
[16] Tara N. Sainath,et al. Deep convolutional neural networks for LVCSR , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[17] Lawrence R. Rabiner,et al. A tutorial on Hidden Markov Models , 1986 .
[18] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.