End-to-end speech-to-dialog-act recognition
暂无分享,去创建一个
Tatsuya Kawahara | Tianyu Zhao | Viet-Trung Dang | Sei Ueno | Hirofumi Inaguma | Tatsuya Kawahara | Tianyu Zhao | H. Inaguma | Sei Ueno | Viet-Trung Dang
[1] Srinivas Bangalore,et al. Spoken Language Understanding without Speech Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Gökhan Tür,et al. Beyond ASR 1-best: Using word confusion networks in spoken language understanding , 2006, Comput. Speech Lang..
[3] Andreas Stolcke,et al. Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? , 1998, Language and speech.
[4] Yoshua Bengio,et al. Speech Model Pre-training for End-to-End Spoken Language Understanding , 2019, INTERSPEECH.
[5] Matthias Zimmermann,et al. Joint segmentation and classification of dialog acts using conditional random fields , 2009, INTERSPEECH.
[6] Andreas Stolcke,et al. Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.
[7] Yoshua Bengio,et al. Attention-Based Models for Speech Recognition , 2015, NIPS.
[8] Elizabeth Shriberg,et al. Automatic dialog act segmentation and classification in multiparty meetings , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[9] Yoshua Bengio,et al. End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results , 2014, ArXiv.
[10] Raphael Schumann,et al. Incorporating ASR Errors with Attention-Based, Jointly Trained RNN for Intent Detection and Slot Filling , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Bhuvana Ramabhadran,et al. Direct Acoustics-to-Word Models for English Conversational Speech Recognition , 2017, INTERSPEECH.
[12] Shafiq R. Joty,et al. Speech Act Modeling of Written Asynchronous Conversations with Task-Specific Embeddings and Conditional Structured Models , 2016, ACL.
[13] Hung-yi Lee,et al. Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection , 2016, INTERSPEECH.
[14] Giuseppe Riccardi,et al. Simultaneous dialog act segmentation and classification from human-human spoken conversations , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Harish Arsikere,et al. Novel acoustic features for automatic dialog-act tagging , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[16] Quoc V. Le,et al. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] Arun Narayanan,et al. From Audio to Semantics: Approaches to End-to-End Spoken Language Understanding , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[18] Yoshua Bengio,et al. End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Tatsuya Kawahara,et al. Joint Learning of Dialog Act Segmentation and Recognition in Spoken Dialog Using Neural Networks , 2017, IJCNLP.
[20] Tatsuya Kawahara,et al. Acoustic-to-Word Attention-Based Model Complemented with Character-Level CTC-Based Model , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Yongqiang Wang,et al. Towards End-to-end Spoken Language Understanding , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[22] Sunil Kumar Kopparapu,et al. End-to-End Spoken Language Understanding: Bootstrapping in Low Resource Scenarios , 2019, INTERSPEECH.
[23] Yannick Estève,et al. Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability , 2019, INTERSPEECH.