Phonetic-and-Semantic Embedding of Spoken words with Applications in Spoken Content Retrieval
暂无分享,去创建一个
Lin-Shan Lee | Hung-yi Lee | Yi-Chen Chen | Chia-Hao Shen | Sung-Feng Huang | Hung-yi Lee | Lin-Shan Lee | Yi-Chen Chen | Sung-Feng Huang | Chia-Hao Shen
[1] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[2] Lin-Shan Lee,et al. Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[3] Karen Livescu,et al. Deep convolutional acoustic word embeddings using word-pair side information , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Hao Tang,et al. End-to-End Neural Segmental Models for Speech Recognition , 2017, IEEE Journal of Selected Topics in Signal Processing.
[5] Hung-yi Lee,et al. Towards Unsupervised Automatic Speech Recognition Trained by Unaligned Speech and Text only , 2018, ArXiv.
[6] Georg Heigold,et al. Word embeddings for speech recognition , 2014, INTERSPEECH.
[7] Liang Lu,et al. Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition , 2017, INTERSPEECH.
[8] Karen Livescu,et al. Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings , 2017, INTERSPEECH.
[9] George Trigeorgis,et al. Domain Separation Networks , 2016, NIPS.
[10] Yejin Choi,et al. Neural AMR: Sequence-to-Sequence Models for Parsing and Generation , 2017, ACL.
[11] Lior Wolf,et al. Non-Adversarial Unsupervised Word Translation , 2018, EMNLP.
[12] Ngoc Thang Vu,et al. Character Composition Model with Convolutional Neural Networks for Dependency Parsing on Morphologically Rich Languages , 2017, ACL.
[13] Lin-Shan Lee,et al. Interactive spoken content retrieval by extended query model and continuous state space Markov Decision Process , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[14] Karen Livescu,et al. Discriminative acoustic word embeddings: Tecurrent neural network-based approaches , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[15] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[16] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[17] Aren Jansen,et al. A segmental framework for fully-unsupervised large-vocabulary speech recognition , 2016, Comput. Speech Lang..
[18] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Lin-Shan Lee,et al. Enhancing query expansion for semantic retrieval of spoken content with automatically discovered acoustic patterns , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[20] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[21] James R. Glass,et al. Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech , 2018, INTERSPEECH.
[22] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.
[23] Aren Jansen,et al. Towards Learning Semantic Audio Representations from Unlabeled Data , 2017 .
[24] Yu Zhang,et al. Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data , 2017, NIPS.
[25] Kuan-Yu Chen,et al. Spoken Document Retrieval With Unsupervised Query Modeling Techniques , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[26] Karen Livescu,et al. An embedded segmental K-means model for unsupervised segmentation and clustering of speech , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[27] Lin-Shan Lee,et al. Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder , 2016, INTERSPEECH.
[28] Yifan Gong,et al. Unsupervised adaptation with domain separation networks for robust speech recognition , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[29] Mari Ostendorf,et al. Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information , 2017, NAACL.
[30] Karen Livescu,et al. Multi-view Recurrent Neural Acoustic Word Embeddings , 2016, ICLR.
[31] Kuan-Yu Chen,et al. Leveraging Relevance Cues for Improved Spoken Document Retrieval , 2011, INTERSPEECH.
[32] Lifu Tu,et al. Learning to Embed Words in Context for Syntactic Tasks , 2017, Rep4NLP@ACL.
[33] Guillaume Lample,et al. Neural Architectures for Named Entity Recognition , 2016, NAACL.
[34] Guillaume Lample,et al. Word Translation Without Parallel Data , 2017, ICLR.
[35] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.
[36] Aren Jansen,et al. Unsupervised Learning of Semantic Audio Representations , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[37] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[38] Noah A. Smith,et al. Improved Transition-based Parsing by Modeling Characters instead of Words with LSTMs , 2015, EMNLP.
[39] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.
[40] Aren Jansen,et al. Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[41] James R. Glass,et al. Spoken Content Retrieval—Beyond Cascading Speech Recognition with Text Retrieval , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[42] Barbara Plank,et al. Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss , 2016, ACL.
[43] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[44] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.
[45] Trevor Darrell,et al. Sequence to Sequence -- Video to Text , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[46] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.
[47] Joshua B. Tenenbaum,et al. Deep Convolutional Inverse Graphics Network , 2015, NIPS.
[48] Lior Wolf,et al. An Iterative Closest Point Method for Unsupervised Word Translation , 2018, ArXiv.