暂无分享,去创建一个
[1] Quoc V. Le,et al. SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition , 2019, INTERSPEECH.
[2] Brian Roark,et al. Bringing contextual information to google speech recognition , 2015, INTERSPEECH.
[3] Tara N. Sainath,et al. Contextual Speech Recognition with Difficult Negative Training Examples , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Dong Yu,et al. Component Fusion: Learning Replaceable Language Model Component for End-to-end Speech Recognition System , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] Alexei Baevski,et al. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations , 2020, NeurIPS.
[6] Yingbo Zhou,et al. Fast and Robust Unsupervised Contextual Biasing for Speech Recognition , 2020, ArXiv.
[7] Mohammad Norouzi,et al. SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network , 2021, ArXiv.
[8] Yoshua Bengio,et al. On Using Monolingual Corpora in Neural Machine Translation , 2015, ArXiv.
[9] Tara N. Sainath,et al. Deep Context: End-to-end Contextual Speech Recognition , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[10] Gil Keren,et al. Deep Shallow Fusion for RNN-T Personalization , 2020, 2021 IEEE Spoken Language Technology Workshop (SLT).
[11] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[12] Yu Zhang,et al. Conformer: Convolution-augmented Transformer for Speech Recognition , 2020, INTERSPEECH.
[13] Gil Keren,et al. Contextual RNN-T For Open Domain ASR , 2020, INTERSPEECH.
[14] Veselin Stoyanov,et al. Simple Fusion: Return of the Language Model , 2018, WMT.
[15] Petar S. Aleksic,et al. Unsupervised context learning for speech recognition , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[16] Richard Socher,et al. Regularizing and Optimizing LSTM Language Models , 2017, ICLR.
[17] Shinji Watanabe,et al. End-to-end Speech Recognition With Word-Based Rnn Language Models , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[18] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Yonghui Wu,et al. ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context , 2020, INTERSPEECH.
[20] Gil Keren,et al. Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion , 2021, Interspeech 2021.
[21] Quoc V. Le,et al. Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition , 2020, ArXiv.
[22] Kishore Papineni,et al. Why Inverse Document Frequency? , 2001, NAACL.
[23] Tara N. Sainath,et al. Shallow-Fusion End-to-End Contextual Biasing , 2019, INTERSPEECH.
[24] Alex Graves,et al. Sequence Transduction with Recurrent Neural Networks , 2012, ArXiv.