Multilingual bottle-neck feature learning from untranscribed speech
暂无分享,去创建一个
Bin Ma | Haizhou Li | Hongjie Chen | Lei Xie | Cheung-Chi Leung | Haizhou Li | B. Ma | Lei Xie | C. Leung | Hongjie Chen
[1] Lin-Shan Lee,et al. Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder , 2016, INTERSPEECH.
[2] Hung-An Chang,et al. Resource configurable spoken query detection using Deep Boltzmann Machines , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[3] Aren Jansen,et al. Weak top-down constraints for unsupervised acoustic model training , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[4] Mark J. F. Gales,et al. Using VTLN for broadcast news transcription , 2004, INTERSPEECH.
[5] Aren Jansen,et al. Evaluating speech features with the minimal-pair ABX task: analysis of the classical MFC/PLP pipeline , 2013, INTERSPEECH.
[6] Aren Jansen,et al. Unsupervised neural network based feature extraction using weak top-down constraints , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] Bin Ma,et al. Parallel inference of dirichlet process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study , 2015, INTERSPEECH.
[8] Haizhou Li,et al. Multitask Feature Learning for Low-Resource Query-by-Example Spoken Term Detection , 2017, IEEE Journal of Selected Topics in Signal Processing.
[9] Rich Caruana,et al. Multitask Learning , 1997, Machine-mediated learning.
[10] Hynek Hermansky,et al. Evaluating speech features with the minimal-pair ABX task (II): resistance to noise , 2014, INTERSPEECH.
[11] Emmanuel Dupoux,et al. Phonetics embedding learning with side information , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[12] Lukás Burget,et al. An empirical evaluation of zero resource acoustic unit discovery , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] James R. Glass,et al. Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[14] John W. Fisher,et al. Parallel Sampling of DP Mixture Models using Sub-Cluster Splits , 2013, NIPS.
[15] Carl E. Rasmussen,et al. The Infinite Gaussian Mixture Model , 1999, NIPS.
[16] Aren Jansen,et al. Segmental acoustic indexing for zero resource keyword search , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] Karen Livescu,et al. Deep convolutional acoustic word embeddings using word-pair side information , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Simon King,et al. Unsupervised lexical clustering of speech segments using fixed-dimensional acoustic embeddings , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[19] Martin Karafiát,et al. The language-independent bottleneck features , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[20] Bin Ma,et al. An acoustic segment modeling approach to query-by-example spoken term detection , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Bin Ma,et al. Extracting bottleneck features and word-like pairs from untranscribed speech for feature representation , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[22] Giorgio Metta,et al. An auto-encoder based approach to unsupervised learning of subword units , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[23] Satoshi Nakamura,et al. Unsupervised Linear Discriminant Analysis for Supporting DPGMM Clustering in the Zero Resource Scenario , 2016, SLTU.
[24] Joseph Picone,et al. A Nonparametric Bayesian Approach for Spoken Term Detection by Example Query , 2016, INTERSPEECH.
[25] James R. Glass,et al. A Nonparametric Bayesian Approach to Acoustic Model Discovery , 2012, ACL.
[26] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[27] James R. Glass,et al. Towards unsupervised pattern discovery in speech , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..
[28] Bin Ma,et al. Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection , 2016, INTERSPEECH.
[29] Satoshi Nakamura,et al. Supervised Learning of Acoustic Models in a Zero Resource Setting to Improve DPGMM Clustering , 2016, INTERSPEECH.
[30] Aren Jansen,et al. Unsupervised Word Segmentation and Lexicon Discovery Using Acoustic Word Embeddings , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[31] Aren Jansen,et al. Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[32] James R. Glass,et al. Unsupervised Lexicon Discovery from Acoustic Input , 2015, TACL.