CNN Based Query by Example Spoken Term Detection
暂无分享,去创建一个
Hervé Bourlard | Dhananjay Ram | Lesly Miculicich Werlen | H. Bourlard | Lesly Miculicich | Dhananjay Ram
[1] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[2] Florian Metze,et al. Query by Example Search on Speech at Mediaeval 2015 , 2014, MediaEval.
[3] Meinard Müller,et al. Information retrieval for music and motion , 2007 .
[4] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .
[5] James R. Glass,et al. Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[6] Hervé Bourlard,et al. Sparse modeling of posterior exemplars for keyword detection , 2015, INTERSPEECH.
[7] Mikel Penagarikano. MediaEval 2013 Spoken Web Search Task: System Performance Measures , 2013 .
[8] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[9] James R. Glass,et al. A Nonparametric Bayesian Approach to Acoustic Model Discovery , 2012, ACL.
[10] Afsaneh Asaei,et al. Sparse Subspace Modeling for Query by Example Spoken Term Detection , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[11] Bin Ma,et al. Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection , 2016, INTERSPEECH.
[12] Mireia Díez,et al. GTTS-EHU Systems for QUESST at MediaEval 2014 , 2014, MediaEval.
[13] Hung-An Chang,et al. Resource configurable spoken query detection using Deep Boltzmann Machines , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[14] James R. Glass,et al. Spoken Content Retrieval—Beyond Cascading Speech Recognition with Text Retrieval , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[15] Khalid Choukri,et al. SpeechDat(E) - Eastern European Telephone Speech Databases , 2000 .
[16] Lin-Shan Lee,et al. Model-Based Unsupervised Spoken Term Detection with Spoken Queries , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[17] Mireia Díez,et al. High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Timothy J. Hazen,et al. Query-by-example spoken term detection using phonetic posteriorgram templates , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[19] James R. Glass,et al. Unsupervised Pattern Discovery in Speech , 2008, IEEE Transactions on Audio, Speech, and Language Processing.
[20] Hervé Bourlard,et al. Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection , 2016, INTERSPEECH.
[21] Bin Ma,et al. Acoustic Segment Modeling with Spectral Clustering Methods , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[22] Florian Metze,et al. The Spoken Web Search Task , 2012, MediaEval.
[23] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[24] Hervé Bourlard,et al. Subspace Regularized Dynamic Time Warping for Spoken Query Detection , 2017 .
[25] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.