Increasing Context for Estimating Confidence Scores in Automatic Speech Recognition
暂无分享,去创建一个
[1] Ian McGraw,et al. Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction , 2021, Interspeech.
[2] P. Woodland,et al. Residual Energy-Based Models for End-to-End Speech Recognition , 2021, Interspeech.
[3] Tara N. Sainath,et al. Learning Word-Level Confidence for Subword End-To-End ASR , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Tara N. Sainath,et al. Less is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] Liangliang Cao,et al. Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Ankur Kumar,et al. Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios , 2020, INTERSPEECH.
[7] Carl Rubino. The Effect of Linguistic Parameters in CLIR Performance , 2020, CLSSTS@LREC.
[8] Anton Ragni,et al. Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Percy Liang,et al. Verified Uncertainty Calibration , 2019, NeurIPS.
[10] Adrian Lancucki,et al. Lattice Generation in Attention-Based Speech Recognition Models , 2019, INTERSPEECH.
[11] Peter A. Flach,et al. Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration , 2019, NeurIPS.
[12] Kai Fan,et al. Lattice Transformer for Speech Translation , 2019, ACL.
[13] Alex Waibel,et al. Self-Attentional Models for Lattice Inputs , 2019, ACL.
[14] Mark J. F. Gales,et al. Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Maurizio Filippone,et al. Calibrating Deep Convolutional Gaussian Processes , 2018, AISTATS.
[16] Mark J. F. Gales,et al. Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks , 2018, 2018 IEEE Spoken Language Technology Workshop (SLT).
[17] Sunita Sarawagi,et al. Trainable Calibration Measures For Neural Networks From Kernel Mean Embeddings , 2018, ICML.
[18] Adrià Giménez,et al. Speaker-Adapted Confidence Measures for ASR Using Deep Bidirectional Recurrent Neural Networks , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[19] Yue Zhang,et al. Chinese NER Using Lattice LSTM , 2018, ACL.
[20] Graham Neubig,et al. Neural Lattice Language Models , 2018, TACL.
[21] Hongyi Zhang,et al. mixup: Beyond Empirical Risk Minimization , 2017, ICLR.
[22] Kilian Q. Weinberger,et al. On Calibration of Modern Neural Networks , 2017, ICML.
[23] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[24] Atsunori Ogawa,et al. Error detection and accuracy estimation in automatic speech recognition using deep bidirectional recurrent neural networks , 2017, Speech Commun..
[25] Matthias Sperber,et al. Neural Lattice-to-Sequence Models for Uncertain Inputs , 2017, EMNLP.
[26] Mark J. F. Gales,et al. Stimulated training for automatic speech recognition and keyword search in limited resource conditions , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[27] Charles Blundell,et al. Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.
[28] Rongrong Ji,et al. Lattice-Based Recurrent Neural Network Encoders for Neural Machine Translation , 2016, AAAI.
[29] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.
[30] Ariya Rastrow,et al. LatticeRnn: Recurrent Neural Networks Over Lattices , 2016, INTERSPEECH.
[31] Quoc V. Le,et al. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[32] Richard S. Zemel,et al. Gated Graph Sequence Neural Networks , 2015, ICLR.
[33] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[34] Kaisheng Yao,et al. Estimating confidence scores on ASR results using recurrent neural networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[35] Atsunori Ogawa,et al. ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[36] Mark J. F. Gales,et al. Unicode-based graphemic systems for limited resource languages , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[37] Christopher D. Manning,et al. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.
[38] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[39] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[40] Mark J. F. Gales,et al. Speech recognition and keyword spotting for low-resource languages: Babel project research at CUED , 2014, SLTU.
[41] Philip C. Woodland,et al. Detecting deletions in ASR output , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[42] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[43] Alex Graves,et al. Sequence Transduction with Recurrent Neural Networks , 2012, ArXiv.
[44] Andrew McCallum,et al. An Introduction to Conditional Random Fields , 2010, Found. Trends Mach. Learn..
[45] Philip C. Woodland,et al. Combining Information Sources for Confidence Estimation with CRF Models , 2011, INTERSPEECH.
[46] Mark J. F. Gales,et al. Derivative kernels for noise robust ASR , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[47] L. Deng,et al. Calibration of Confidence Measures in Speech Recognition , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[48] Geoffrey Zweig,et al. A segmental CRF approach to large vocabulary continuous speech recognition , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[49] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .
[50] Heiga Zen,et al. Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences , 2007, Comput. Speech Lang..
[51] Jonathan Le Roux,et al. Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[52] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[53] Alex Acero,et al. Hidden conditional random fields for phone classification , 2005, INTERSPEECH.
[54] Peder A. Olsen,et al. Theory and practice of acoustic confusability , 2002, Comput. Speech Lang..
[55] Hermann Ney,et al. Confidence measures for large vocabulary continuous speech recognition , 2001, IEEE Trans. Speech Audio Process..
[56] Hermann Ney,et al. Improved MLLR speaker adaptation using confidence measures for conversational speech recognition , 2000, INTERSPEECH.
[57] Andreas Stolcke,et al. Finding consensus in speech recognition: word error minimization and other applications of confusion networks , 2000, Comput. Speech Lang..
[58] Gunnar Evermann,et al. Large vocabulary decoding and confidence estimation using word posterior probabilities , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[59] Gunnar Evermann,et al. Posterior probability decoding, confidence estimation and system combination , 2000 .
[60] Alexander Fischer,et al. Domain adaptation for robust automatic speech recognition in car environments , 1999, EUROSPEECH.
[61] John Platt,et al. Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .
[62] David Haussler,et al. Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.
[63] George Zavaliagkos,et al. Using untranscribed training data to improve performance , 1998, ICSLP.
[64] Jonathan G. Fiscus,et al. A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER) , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.
[65] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[66] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[67] Thomas Schaaf,et al. Estimating confidence using word lattices , 1997, EUROSPEECH.
[68] Larry Gillick,et al. A probabilistic approach to confidence estimation and evaluation , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[69] Chalapathy Neti,et al. Word-based confidence measures as a guide for stack search in speech recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[70] Mitch Weintraub,et al. Neural-network based measures of confidence for word recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[71] Stephen J. Cox,et al. Confidence measures for the SWITCHBOARD database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[72] Herbert Gish,et al. Understanding and improving speech recognition performance through the use of diagnostic tools , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[73] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.