暂无分享,去创建一个
[1] Bin Ma,et al. Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search , 2018, INTERSPEECH.
[2] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[3] Aren Jansen,et al. Indexing Raw Acoustic Features for Scalable Zero Resource Search , 2012, INTERSPEECH.
[4] James R. Glass,et al. Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[5] Aren Jansen,et al. Rapid Evaluation of Speech Representations for Spoken Term Discovery , 2011, INTERSPEECH.
[6] Bin Ma,et al. Approximate search of audio queries by using DTW with phone time boundary and data augmentation , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] Brian Kingsbury,et al. End-to-end ASR-free keyword search from speech , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Jorge Proença,et al. Segmented Dynamic Time Warping for Spoken Query-by-Example Search , 2016, INTERSPEECH.
[9] Carmen García-Mateo,et al. GTM-UVigo Systems for the Query-by-Example Search on Speech Task at MediaEval 2015 , 2015, MediaEval.
[10] Aren Jansen,et al. Segmental acoustic indexing for zero resource keyword search , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .
[12] Quoc V. Le,et al. SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition , 2019, INTERSPEECH.
[13] Timothy J. Hazen,et al. Query-by-example spoken term detection using phonetic posteriorgram templates , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[14] Mikel Penagarikano. MediaEval 2013 Spoken Web Search Task: System Performance Measures , 2013 .
[15] Bin Ma,et al. Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis , 2016, INTERSPEECH.
[16] Dhananjay Ram,et al. Neural Network Based End-to-End Query by Example Spoken Term Detection , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[17] Karen Livescu,et al. Multi-view Recurrent Neural Acoustic Word Embeddings , 2016, ICLR.
[18] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Chng Eng Siong,et al. The NNI Query-by-Example System for MediaEval 2015 , 2014, MediaEval.
[20] Bhuvana Ramabhadran,et al. Query-by-example Spoken Term Detection For OOV terms , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[21] Georg Heigold,et al. Word embeddings for speech recognition , 2014, INTERSPEECH.
[22] Bin Ma,et al. Learning Neural Network Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information , 2016, INTERSPEECH.
[23] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[24] Xavier Anguera Miró,et al. Speed improvements to Information Retrieval-based dynamic time warping using hierarchical K-Means clustering , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[25] Karen Livescu,et al. Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings , 2017, INTERSPEECH.
[26] Karen Livescu,et al. Deep convolutional acoustic word embeddings using word-pair side information , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[27] Michael Picheny,et al. Acoustically Grounded Word Embeddings for Improved Acoustics-to-word Speech Recognition , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[28] Herman Kamper,et al. Truly Unsupervised Acoustic Word Embeddings Using Weak Top-down Constraints in Encoder-decoder Models , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[29] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[30] Bin Ma,et al. Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection , 2016, INTERSPEECH.
[31] Karen Livescu,et al. Multilingual Jointly Trained Acoustic and Written Word Embeddings , 2020, INTERSPEECH.
[32] Karen Livescu,et al. Discriminative acoustic word embeddings: Tecurrent neural network-based approaches , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[33] Aren Jansen,et al. Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[34] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[35] Emmanuel Dupoux,et al. Learning Word Embeddings: Unsupervised Methods for Fixed-size Representations of Variable-length Speech Segments , 2018, INTERSPEECH.
[36] Emilio Sanchis Arnal,et al. ELiRF at MediaEval 2015: Query by Example Search on Speech Task (QUESST) , 2014, MediaEval.
[37] T. K. Vintsyuk. Speech discrimination by dynamic programming , 1968 .
[38] Sharon Goldwater,et al. Multilingual Acoustic Word Embedding Models for Processing Zero-resource Languages , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[39] James R. Glass,et al. A Piecewise Aggregate Approximation Lower-Bound Estimate for Posteriorgram-Based Dynamic Time Warping , 2011, INTERSPEECH.
[40] Lin-Shan Lee,et al. Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder , 2016, INTERSPEECH.
[41] Florian Metze,et al. Query by Example Search on Speech at Mediaeval 2015 , 2014, MediaEval.