Towards Relevance and Sequence Modeling in Language Recognition
暂无分享,去创建一个
[1] Sriram Ganapathy,et al. End-to-end Language Recognition Using Attention Based Hierarchical Gated Recurrent Unit Models , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Yoshua Bengio,et al. Attention-Based Models for Speech Recognition , 2015, NIPS.
[3] Timothy J. Hazen,et al. Retrieval and browsing of spoken content , 2008, IEEE Signal Processing Magazine.
[4] Andreas Stolcke,et al. Within-class covariance normalization for SVM-based speaker recognition , 2006, INTERSPEECH.
[5] Satish Kumar,et al. The LEAP Language Recognition System for LRE 2017 Challenge - Improvements and Error Analysis , 2018, Odyssey.
[6] Patrick Kenny,et al. Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[7] Wonyong Sung,et al. A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.
[8] John R. Hershey,et al. Hybrid CTC/Attention Architecture for End-to-End Speech Recognition , 2017, IEEE Journal of Selected Topics in Signal Processing.
[9] A. Waibel,et al. Multilinguality in speech and spoken language systems , 2000, Proceedings of the IEEE.
[10] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[11] Tanja Schultz,et al. Language-independent and language-adaptive acoustic modeling for speech recognition , 2001, Speech Commun..
[12] Douglas A. Reynolds,et al. Deep Neural Network Approaches to Speaker and Language Recognition , 2015, IEEE Signal Processing Letters.
[13] S. C. Kremer,et al. Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .
[14] Sanjeev Khudanpur,et al. Spoken Language Recognition using X-vectors , 2018, Odyssey.
[15] Jürgen Schmidhuber,et al. Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.
[16] Mitchell McLaren,et al. Analyzing the Effect of Channel Mismatch on the SRI Language Recognition Evaluation 2015 System , 2016, Odyssey.
[17] Marc A. Zissman,et al. Comparison of : Four Approaches to Automatic Language Identification of Telephone Speech , 2004 .
[18] Kevin Walker,et al. The RATS radio traffic collection system , 2012, Odyssey.
[19] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[20] Yoshua Bengio,et al. Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .
[21] Jirí Navrátil,et al. Spoken language recognition-a step toward multilinguality in speech processing , 2001, IEEE Trans. Speech Audio Process..
[22] Joaquín González-Rodríguez,et al. Automatic language identification using deep neural networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[23] Mohamed Kamal Omar,et al. Unsupervised channel adaptation for language identification using co-training , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[24] Hervé Bourlard,et al. New entropy based combination rules in HMM/ANN multi-stream ASR , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[25] Yun Lei,et al. A novel scheme for speaker recognition using a phonetically-aware deep neural network , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] Daniel Garcia-Romero,et al. Time delay deep neural network-based universal background models for speaker recognition , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[27] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[28] Hari Krishna Vydana,et al. Curriculum learning based approach for noise robust language identification using DNN with attention , 2018, Expert Syst. Appl..
[29] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[30] Suryakanth V. Gangashetty,et al. An Investigation of Deep Neural Network Architectures for Language Recognition in Indian Languages , 2016, INTERSPEECH.
[31] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[32] Paul J. Werbos,et al. Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.
[33] Mohamed Kamal Omar,et al. Robust language identification using convolutional neural network features , 2014, INTERSPEECH.
[34] Daniel Garcia-Romero,et al. Analysis of i-vector Length Normalization in Speaker Recognition Systems , 2011, INTERSPEECH.
[35] Shubham Bansal,et al. Speaker and Language Aware Training for End-to-End ASR , 2019, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[36] Sriram Ganapathy,et al. Attention Based Hybrid i-Vector BLSTM Model for Language Recognition , 2019, INTERSPEECH.
[37] Joaquín González-Rodríguez,et al. Evaluation of an LSTM-RNN System in Different NIST Language Recognition Frameworks , 2016, Odyssey.
[38] Mohamed Kamal Omar,et al. TRAP language identification system for RATS phase II evaluation , 2013, INTERSPEECH.
[39] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[40] Jean-Luc Gauvain,et al. Phonotactic Language Recognition Using MLP Features , 2012, INTERSPEECH.
[41] Yoshua Bengio,et al. End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[42] Daniel Povey,et al. MUSAN: A Music, Speech, and Noise Corpus , 2015, ArXiv.
[43] Yun Lei,et al. Softsad: Integrated frame-based speech confidence for speaker recognition , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[44] Bin Ma,et al. Spoken Language Recognition: From Fundamentals to Practice , 2013, Proceedings of the IEEE.
[45] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..
[46] Fabio Valente. A Novel Criterion for Classifiers Combination in Multistream Speech Recognition , 2009, IEEE Signal Processing Letters.
[47] Douglas A. Reynolds,et al. Language Recognition via i-vectors and Dimensionality Reduction , 2011, INTERSPEECH.
[48] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[49] Alvin F. Martin,et al. The 2011 NIST Language Recognition Evaluation , 2010, INTERSPEECH.
[50] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.