Attention Based On-Device Streaming Speech Recognition with Large Speech Corpus
暂无分享,去创建一个
Daehyun Kim | Young-Yoon Lee | Dhananjaya N. Gowda | Sichen Jin | Chanwoo Kim | Junmo Park | Dhananjaya Gowda | Kwangyoun Kim | Sungsoo Kim | Kyungmin Lee | Seokyeong Jung | Jungin Lee | Jinsu Yeo | Myoungji Han | Chanwoo Kim | Kwangyoun Kim | Kyungmin Lee | Myoungji Han | Sungsoo Kim | Junmo Park | Sichen Jin | Young-Yoon Lee | Jinsu Yeo | Daehyun Kim | Seokyeong Jung | Jungin Lee
[1] Andrew W. Senior,et al. Fast and accurate recurrent neural network acoustic models for speech recognition , 2015, INTERSPEECH.
[2] Dongsoo Lee,et al. DeepTwist: Learning Model Compression via Occasional Weight Distortion , 2018, ArXiv.
[3] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[4] Dhananjaya N. Gowda,et al. End-to-End Training of a Large Vocabulary End-to-End Speech Recognition System , 2019, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[5] Dhananjaya N. Gowda,et al. Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System , 2019, INTERSPEECH.
[6] Alexander Sergeev,et al. Horovod: fast and easy distributed deep learning in TensorFlow , 2018, ArXiv.
[7] Tara N. Sainath,et al. An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Quoc V. Le,et al. Listen, Attend and Spell , 2015, ArXiv.
[9] Hairong Liu,et al. Exploring neural transducers for end-to-end speech recognition , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[10] Tara N. Sainath,et al. State-of-the-Art Speech Recognition with Sequence-to-Sequence Models , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Tara N. Sainath,et al. Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Quoc V. Le,et al. SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition , 2019, INTERSPEECH.
[13] Colin Raffel,et al. Online and Linear-Time Attention by Enforcing Monotonic Alignments , 2017, ICML.
[14] Hermann Ney,et al. Improved training of end-to-end attention models for speech recognition , 2018, INTERSPEECH.
[15] Yoshua Bengio,et al. Attention-Based Models for Speech Recognition , 2015, NIPS.
[16] Shinji Watanabe,et al. Joint CTC-attention based end-to-end speech recognition using multi-task learning , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] Alex Graves,et al. Sequence Transduction with Recurrent Neural Networks , 2012, ArXiv.
[18] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[19] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[20] Yifan Gong,et al. Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Colin Raffel,et al. Monotonic Chunkwise Attention , 2017, ICLR.
[22] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[23] Hermann Ney,et al. Returnn: The RWTH extensible training framework for universal recurrent neural networks , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[24] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[25] Tara N. Sainath,et al. A Comparison of Sequence-to-Sequence Models for Speech Recognition , 2017, INTERSPEECH.
[26] Li Shuangfeng,et al. TensorFlow Lite: On-Device Machine Learning Framework , 2020 .
[27] Rohit Prabhavalkar,et al. Exploring architectures, data and units for streaming end-to-end speech recognition with RNN-transducer , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[28] Hermann Ney,et al. A comprehensive analysis on attention models , 2019, NeurIPS 2019.
[29] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[30] Quoc V. Le,et al. A Neural Transducer , 2015, 1511.04868.