Assisted keyword indexing for lecture videos using unsupervised keyword spotting
暂无分享,去创建一个
Anurag Agarwal | Stephanie Ludi | Richard Zanibbi | Zachary Miller | Manish Kanadje | Roger Gaborski
[1] Aren Jansen,et al. Segmental acoustic indexing for zero resource keyword search , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .
[3] Jan Cernocký,et al. Speech@FIT lecture browser , 2010, 2010 IEEE Spoken Language Technology Workshop.
[4] Paul Lamere,et al. Sphinx-4: a flexible open source framework for speech recognition , 2004 .
[5] Yu Huang,et al. Spoken Knowledge Organization by Semantic Structuring and a Prototype Course Lecture System for Personalized Learning , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[6] James R. Glass,et al. Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[7] Atsunori Ogawa,et al. Zero-resource spoken term detection using hierarchical graph-based similarity search , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Hervé Bourlard,et al. Posterior-Based Features and Distances in Template Matching for Speech Recognition , 2007, MLMI.
[9] S. Levinson,et al. Considerations in dynamic time warping algorithms for discrete word recognition , 1978 .
[10] James R. Glass,et al. Towards unsupervised pattern discovery in speech , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..
[11] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .
[12] Xavier Anguera Miró. Information retrieval-based dynamic time warping , 2013, INTERSPEECH.
[13] Meinard Müller,et al. Information retrieval for music and motion , 2007 .
[14] James R. Glass,et al. Recent progress in the MIT spoken lecture processing project , 2007, INTERSPEECH.
[15] Douglas D. O'Shaughnessy,et al. Comparative Evaluation of Feature Normalization Techniques for Speaker Verification , 2011, NOLISP.
[16] Gerhard Doblinger,et al. Computationally efficient speech enhancement by spectral minima tracking in subbands , 1995, EUROSPEECH.
[17] Lin-Shan Lee,et al. Unsupervised spoken term detection with spoken queries by multi-level acoustic patterns with varying model granularity , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] James R. Glass,et al. Analysis and Processing of Lecture Audio Data: Preliminary Investigations , 2004, Proceedings of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL 2004 - SpeechIR '04.
[19] John R. Kender,et al. VAST MM: multimedia browser for presentation video , 2007, CIVR '07.