Transfer learning for speech and language processing
暂无分享,去创建一个
[1] Yoshua Bengio,et al. Deep Learning of Representations for Unsupervised and Transfer Learning , 2011, ICML Unsupervised and Transfer Learning.
[2] Dong Yu,et al. Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[3] Ivor W. Tsang,et al. Hybrid Heterogeneous Transfer Learning through Deep Learning , 2014, AAAI.
[4] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..
[5] Ruslan Salakhutdinov,et al. Multimodal Neural Language Models , 2014, ICML.
[6] Ciro Martins,et al. Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system , 1995, EUROSPEECH.
[7] Yifan Gong,et al. Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.
[9] Yoshua Bengio,et al. BilBOWA: Fast Bilingual Distributed Representations without Word Alignments , 2014, ICML.
[10] Ngoc Thang Vu,et al. Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Keiichi Tokuda,et al. Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[12] Khe Chai Sim,et al. Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems , 2010, INTERSPEECH.
[13] Quoc V. Le,et al. Grounded Compositional Semantics for Finding and Describing Images with Sentences , 2014, TACL.
[14] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[15] Steve Renals,et al. Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[16] Jonathan Baxter,et al. A Model of Inductive Bias Learning , 2000, J. Artif. Intell. Res..
[17] Oscar Täckström. Nudging the Envelope of Direct Transfer Methods for Multilingual Named Entity Recognition , 2012, HLT-NAACL 2012.
[18] M Gibson,et al. Unsupervised Intralingual and Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis Using Two-Pass Decision Tree Construction , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[19] John Blitzer,et al. Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.
[20] Ivor W. Tsang,et al. Domain Adaptation via Transfer Component Analysis , 2009, IEEE Transactions on Neural Networks.
[21] Kaisheng Yao,et al. Adaptation of context-dependent deep neural networks for automatic speech recognition , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[22] Dong Wang,et al. Recognize foreign low-frequency words with similar pairs , 2015, INTERSPEECH.
[23] Trevor Darrell,et al. What you saw is not what you get: Domain adaptation using asymmetric kernel transforms , 2011, CVPR 2011.
[24] C A Nelson,et al. Learning to Learn , 2017, Encyclopedia of Machine Learning and Data Mining.
[25] Qiang Yang,et al. Heterogeneous Transfer Learning for Image Classification , 2011, AAAI.
[26] Yun Tang,et al. Deep neural network trained with speaker representation for speaker normalization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[27] Dong Wang,et al. Normalized Word Embedding and Orthogonal Transform for Bilingual Word Translation , 2015, NAACL.
[28] Qiang Yang,et al. Multiple-Goal Recognition from Low-Level Signals , 2005, AAAI.
[29] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[30] Yoshua Bengio,et al. Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.
[31] Georg Heigold,et al. Multilingual acoustic models using distributed deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[32] Themos Stafylakis,et al. I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[33] Steve Renals,et al. Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[34] Dedre Gentner,et al. Structure-Mapping: A Theoretical Framework for Analogy , 1983, Cogn. Sci..
[35] Yoshihiko Nankaku,et al. The effect of neural networks in statistical parametric speech synthesis , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[36] Mark J. F. Gales,et al. Language independent and unsupervised acoustic models for speech recognition and keyword spotting , 2014, INTERSPEECH.
[37] Heiga Zen,et al. Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[38] Xiaojin Zhu,et al. Semi-Supervised Learning Literature Survey , 2005 .
[39] Sayan Mukherjee,et al. Estimating variable structure and dependence in multitask learning via gradients , 2011, Machine Learning.
[40] Yoshua Bengio,et al. Neural Probabilistic Language Models , 2006 .
[41] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.
[42] Yoshua Bengio,et al. Zero-data Learning of New Tasks , 2008, AAAI.
[43] Steve Renals,et al. Multi-level adaptive networks in tandem and hybrid ASR systems , 2013, ICASSP.
[44] Yongqiang Wang,et al. Adaptation of deep neural network acoustic models using factorised i-vectors , 2014, INTERSPEECH.
[45] Gemma Boleda,et al. Distributional Semantics in Technicolor , 2012, ACL.
[46] George Saon,et al. Speaker adaptation of neural network acoustic models using i-vectors , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[47] Zhizheng Wu,et al. A study of speaker adaptation for DNN-based speech synthesis , 2015, INTERSPEECH.
[48] Jianfeng Gao,et al. Deep Learning for Natural Language Processing and Related Applications (Tutorial at ICASSP) , 2014 .
[49] Tara N. Sainath,et al. FUNDAMENTAL TECHNOLOGIES IN MODERN SPEECH RECOGNITION Digital Object Identifier 10.1109/MSP.2012.2205597 , 2012 .
[50] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.
[51] YamagishiJunichi,et al. Average-Voice-Based Speech Synthesis Using HSMM-Based Speaker Adaptation and Adaptive Training , 2007 .
[52] Yifan Gong,et al. Learning small-size DNN with output-distribution-based criteria , 2014, INTERSPEECH.
[53] Benoît Favre,et al. Speaker adaptation of DNN-based ASR with i-vectors: does it actually adapt models to speakers? , 2014, INTERSPEECH.
[54] Yifan Gong,et al. Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[55] Hui Jiang,et al. Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[56] Florian Metze,et al. On speaker adaptation of long short-term memory recurrent neural networks , 2015, INTERSPEECH.
[57] Fei-FeiLi,et al. One-Shot Learning of Object Categories , 2006 .
[58] Manaal Faruqui,et al. Improving Vector Space Word Representations Using Multilingual Correlation , 2014, EACL.
[59] Takao Kobayashi,et al. Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm , 2009, IEEE Transactions on Audio, Speech, and Language Processing.
[60] Erik McDermott,et al. Deep neural networks for small footprint text-dependent speaker verification , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[61] Kenneth Ward Church,et al. Deep neural network features and semi-supervised training for low resource speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[62] Dominic M. Dousa. THE UNIVERSITY OF TEXAS AT EL PASO , 2004 .
[63] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.
[64] D. Gentner,et al. Reasoning and learning by analogy. , 1997, The American psychologist.
[65] Ivan Laptev,et al. Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[66] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.
[67] Dong Wang,et al. Knowledge Transfer Pre-training , 2015, ArXiv.
[68] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.
[69] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[70] Hank Liao,et al. Speaker adaptation of context dependent deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[71] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.
[72] Hui Liang,et al. A comparison of supervised and unsupervised cross-lingual speaker adaptation approaches for HMM-based speech synthesis , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[73] Florian Metze,et al. DNN acoustic modeling with modular multi-lingual feature extraction networks , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[74] Alex Acero,et al. Separating Speaker and Environmental Variability Using Factored Transforms , 2011, INTERSPEECH.
[75] Dong Yu,et al. Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..
[76] Ivan Titov,et al. Inducing Crosslingual Distributed Representations of Words , 2012, COLING.
[77] Justus H. Piater,et al. Online Learning of Gaussian Mixture Models - a Two-Level Approach , 2008, VISAPP.
[78] Ralf Schlüter,et al. Investigation on cross- and multilingual MLP features under matched and mismatched acoustical conditions , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[79] Yoshua Bengio,et al. On the Expressive Power of Deep Architectures , 2011, ALT.
[80] Pascal Vincent,et al. Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..
[81] Michael I. Jordan,et al. Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.
[82] Yoshua Bengio,et al. FitNets: Hints for Thin Deep Nets , 2014, ICLR.
[83] Yao Lu. Unsupervised Learning on Neural Network Outputs: With Application in Zero-Shot Learning , 2016, IJCAI.
[84] J. Carbonell,et al. Learning by Analogy: Formulating and Generalizing Plans from Past Experience , 1983 .
[85] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[86] Chang Wang,et al. Heterogeneous Domain Adaptation Using Manifold Alignment , 2011, IJCAI.
[87] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.
[88] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..
[89] Pietro Laface,et al. Linear hidden transformations for adaptation of hybrid ANN/HMM models , 2007, Speech Commun..
[90] Dong Wang,et al. Learning from LDA Using Deep Neural Networks , 2015, NLPCC/ICCPOL.
[91] Steve Renals,et al. Multilingual training of deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[92] Zhi Jin,et al. Distilling Word Embeddings: An Encoding Approach , 2015, CIKM.
[93] Kaisheng Yao,et al. KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[94] Jacob Benesty,et al. Springer handbook of speech processing , 2007, Springer Handbooks.
[95] Yoshihiko Nankaku,et al. State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis , 2009, INTERSPEECH.
[96] Massimiliano Pontil,et al. Exploiting Unrelated Tasks in Multi-Task Learning , 2012, AISTATS.
[97] Marie-Francine Moens,et al. Knowledge Transfer across Multilingual Corpora via Latent Topics , 2011, PAKDD.
[98] Christopher Joseph Pal,et al. Heterogeneous Transfer Learning with RBMs , 2011, AAAI.
[99] Pavel Matejka,et al. Multilingual bottleneck features for language recognition , 2015, INTERSPEECH.
[100] Rich Caruana,et al. Do Deep Nets Really Need to be Deep? , 2013, NIPS.
[101] Andrés Montoyo,et al. Advances on natural language processing , 2007, Data Knowl. Eng..
[102] Pietro Perona,et al. One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[103] Patrick Kenny,et al. Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[104] Xiao Li,et al. Machine Learning Paradigms for Speech Recognition: An Overview , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[105] Zhiyuan Tang,et al. Recurrent neural network training with dark knowledge transfer , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[106] James H. Martin,et al. Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.
[107] Geoffrey E. Hinton,et al. A Scalable Hierarchical Distributed Language Model , 2008, NIPS.
[108] William Chan,et al. Transferring knowledge from a RNN to a DNN , 2015, INTERSPEECH.
[109] Miles Osborne,et al. Statistical Machine Translation , 2010, Encyclopedia of Machine Learning and Data Mining.
[110] D. Gentner,et al. Reasoning and learning by analogy. , 1997, The American psychologist.
[111] Florian Metze,et al. Learning discriminative basis coefficients for eigenspace MLLR unsupervised adaptation , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[112] Rada Mihalcea,et al. Going Beyond Text: A Hybrid Image-Text Approach for Measuring Word Relatedness , 2011, IJCNLP.
[113] Yao Lu. Unsupervised Learning on Neural Network Outputs , 2015, ArXiv.
[114] Li-Rong Dai,et al. Speaker Adaptation of Hybrid NN/HMM Model for Speech Recognition Based on Singular Value Decomposition , 2014, Journal of Signal Processing Systems.
[115] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..
[116] Ivor W. Tsang,et al. Learning with Augmented Features for Heterogeneous Domain Adaptation , 2012, ICML.
[117] Yoshua Bengio,et al. Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.
[118] Cheung-Chi Leung,et al. Joint acoustic modeling of triphones and trigraphemes by multi-task learning deep neural networks for low-resource speech recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[119] Jieping Ye,et al. Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis , 2015, IEEE Transactions on Big Data.
[120] Yoshua Bengio,et al. Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.
[121] Dong Wang,et al. Music removal by convolutional denoising autoencoder in speech recognition , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).
[122] Rajat Raina,et al. Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.
[123] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..
[124] Andrew Y. Ng,et al. Zero-Shot Learning Through Cross-Modal Transfer , 2013, NIPS.
[125] Philip S. Yu,et al. Transfer Learning on Heterogenous Feature Spaces via Spectral Transformation , 2010, 2010 IEEE International Conference on Data Mining.
[126] Marc'Aurelio Ranzato,et al. DeViSE: A Deep Visual-Semantic Embedding Model , 2013, NIPS.
[127] Jianmin Wang,et al. Transfer Learning with Graph Co-Regularization , 2012, IEEE Transactions on Knowledge and Data Engineering.
[128] Fei-Fei Li,et al. Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[129] Jun Du,et al. Cross-language transfer learning for deep neural network based speech enhancement , 2014, The 9th International Symposium on Chinese Spoken Language Processing.
[130] Hui Jiang,et al. Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition , 2013, INTERSPEECH.
[131] Koichi Shinoda. Speaker adaptation techniques for automatic speech recognition , 2011 .
[132] Qiang Yang,et al. Transfer Learning by Structural Analogy , 2011, AAAI.
[133] Qiang Yang,et al. Translated Learning: Transfer Learning across Different Feature Spaces , 2008, NIPS.
[134] Florian Metze,et al. Towards speaker adaptive training of deep neural network acoustic models , 2014, INTERSPEECH.
[135] Tanja Schultz,et al. Language-independent and language-adaptive acoustic modeling for speech recognition , 2001, Speech Commun..
[136] Phil Blunsom,et al. Multilingual Models for Compositional Distributed Semantics , 2014, ACL.
[137] Roberto Cipolla,et al. Incremental Learning of Temporally-Coherent Gaussian Mixture Models , 2005, BMVC.
[138] Paul E. Utgoff,et al. Incremental Induction of Decision Trees , 1989, Machine Learning.
[139] Geoffrey Zweig,et al. Recent advances in deep learning for speech research at Microsoft , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[140] Andrew W. Senior,et al. Improving DNN speaker independence with I-vector inputs , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[141] Blaise Potard,et al. Preliminary Work on Speaker Adaptation for DNN-Based Speech Synthesis , 2015 .
[142] Takao Kobayashi,et al. Average-Voice-Based Speech Synthesis Using HSMM-Based Speaker Adaptation and Adaptive Training , 2007, IEICE Trans. Inf. Syst..
[143] Peng Hao,et al. Transfer learning using computational intelligence: A survey , 2015, Knowl. Based Syst..
[144] Li-Rong Dai,et al. Direct adaptation of hybrid DNN/HMM model for fast speaker adaptation in LVCSR based on speaker code , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[145] Lei Shi,et al. Cross Language Text Classification by Model Translation and Semi-Supervised Learning , 2010, EMNLP.
[146] Kaisheng Yao,et al. A basis representation of constrained MLLR transforms for robust adaptation , 2012, Comput. Speech Lang..
[147] S. M. Siniscalchi,et al. Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[148] Martin Karafiát,et al. The language-independent bottleneck features , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[149] Geoffrey E. Hinton,et al. Deep Boltzmann Machines , 2009, AISTATS.
[150] Dan Klein,et al. Syntactic Transfer Using a Bilingual Lexicon , 2012, EMNLP-CoNLL.
[151] Jakob Uszkoreit,et al. Cross-lingual Word Clusters for Direct Transfer of Linguistic Structure , 2012, NAACL.
[152] Benno Stein,et al. Cross-Lingual Adaptation Using Structural Correspondence Learning , 2010, TIST.
[153] Quoc V. Le,et al. Exploiting Similarities among Languages for Machine Translation , 2013, ArXiv.
[154] VincentPascal,et al. Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010 .
[155] Kai Yu,et al. Multi-task learning for text-dependent speaker verification , 2015, INTERSPEECH.