Acoustic Modeling Using a Shallow CNN-HTSVM Architecture
暂无分享,去创建一个
Rodrigo Fernandes de Mello | Sandra M. Aluísio | Martha Dais Ferreira | Christopher Dane Shulby | S. Aluísio | R. Mello | C. Shulby | M. D. Ferreira
[1] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..
[2] Chih-Jen Lin,et al. Training and Testing Low-degree Polynomial Data Mappings via Linear SVM , 2010, J. Mach. Learn. Res..
[3] Peter Ladefoged,et al. Vowels and Consonants , 2000, Manchu Grammar.
[4] Honglak Lee,et al. Unsupervised feature learning for audio classification using convolutional deep belief networks , 2009, NIPS.
[5] Gerald Penn,et al. Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Geoffrey E. Hinton,et al. Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..
[7] Eduardo Lleida,et al. Articulatory Feature Extraction from Voice and Their Impact on Hybrid Acoustic Models , 2014, IberSPEECH.
[8] Dong Yu,et al. Pipelined BackPropagation for Context-Dependent Deep Neural Networks , 2012 .
[9] Navdeep Jaitly,et al. Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.
[10] Gerald Penn,et al. Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[11] Sarah Hoffmann,et al. Automatic Phone Segmentation , 2012 .
[12] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[13] Yoav Goldberg,et al. splitSVM: Fast, Space-Efficient, non-Heuristic, Polynomial Kernel Computation for NLP Applications , 2008, ACL.
[14] László Tóth. Phone recognition with hierarchical convolutional deep maxout networks , 2015, EURASIP J. Audio Speech Music. Process..
[15] Kun Li,et al. Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[16] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[17] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .
[18] Hsiao-Wuen Hon,et al. Speaker-independent phone recognition using hidden Markov models , 1989, IEEE Trans. Acoust. Speech Signal Process..
[19] Daniel Jurafsky,et al. First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs , 2014, ArXiv.
[20] Yoram Singer,et al. An Online Algorithm for Hierarchical Phoneme Classification , 2004, MLMI.
[21] E. Chandra,et al. A Hierarchical Approach in Tamil Phoneme Classification using Support Vector Machine , 2015 .
[22] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[23] Sarel van Vuuren,et al. Relevance of time-frequency features for phonetic and speaker-channel classification , 2000, Speech Commun..
[24] Geoffrey E. Hinton,et al. Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[25] David Miller,et al. The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text , 2004, LREC.
[26] Ke Chen,et al. Exploring hierarchical speech representations with a deep convolutional neural network , 2011 .
[27] Vytautas Rudžionis,et al. IMPLEMENTATION OF HIERARCHICAL PHONEME CLASSIFICATION APPROACH ON LTDIGITS CORPORA , 2015 .
[28] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[29] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[30] Bernhard Schölkopf,et al. Statistical Learning Theory: Models, Concepts, and Results , 2008, Inductive Logic.
[31] Steve Young,et al. The HTK book , 1995 .
[32] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[33] Carla Lopes,et al. Phonetic Recognition Improvements through Input Feature Set Combination and Acoustic Context Window Widening , 2009 .
[34] Tara N. Sainath,et al. Deep convolutional neural networks for LVCSR , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[35] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.
[36] Noureddine Ellouze,et al. Study of Phonemes Confusions in Hierarchical Automatic Phoneme Recognition System , 2015, ArXiv.
[37] Vladimir I. Levenshtein,et al. Binary codes capable of correcting deletions, insertions, and reversals , 1965 .