A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition
暂无分享,去创建一个
Kenneth Ward Church | Florian Metze | Aren Jansen | Ian McGraw | Hynek Hermansky | Naomi Feldman | Sanjeev Khudanpur | Ewan Dunbar | Mark Johnson | Abdellah Fourtassi | Emmanuel Dupoux | Samuel Thomas | Kenneth Church | Sharon Goldwater | Benjamin Börschinger | Richard C. Rose | David Harwath | Justin Chiu | Thomas Schatz | Vijayaditya Peddinti | Pascal Clark | Balakrishnan Varadarajan | Keith Levin | Atta Norouzian | Chia-ying Lee | Naomi H. Feldman | Mike Seltzer | Erin Bennett | Rachael Richardson | S. Khudanpur | M. Seltzer | H. Hermansky | A. Jansen | Mark Johnson | Florian Metze | S. Goldwater | Emmanuel Dupoux | Balakrishnan Varadarajan | Ian McGraw | David F. Harwath | R. Rose | Samuel Thomas | Benjamin Börschinger | Vijayaditya Peddinti | Thomas Schatz | P. Clark | Erin D. Bennett | J. Chiu | Ewan Dunbar | Abdellah Fourtassi | Chia-ying Lee | Keith Levin | A. Norouzian | Rachael Richardson | Naomi H Feldman | Keith D. Levin
[1] Hynek Hermansky,et al. The effective second formant F2' and the vocal tract front-cavity , 1989, International Conference on Acoustics, Speech, and Signal Processing,.
[2] Carl de Marcken,et al. The Unsupervised Acquisition of a Lexicon from Continuous Speech , 1995, ArXiv.
[3] Steven Greenberg,et al. INSIGHTS INTO SPOKEN LANGUAGE GLEANED FROM PHONETIC TRANSCRIPTION OF THE SWITCHBOARD CORPUS , 1996 .
[4] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[5] George Zavaliagkos,et al. Using untranscribed training data to improve performance , 1998, ICSLP.
[6] Morten H. Christiansen,et al. Learning to Segment Speech Using Multiple Cues: A Connectionist Model , 1998 .
[7] J. Werker,et al. Influences on infant speech processing: toward a new synthesis. , 1999, Annual review of psychology.
[8] Anand Venkataraman,et al. A Statistical Model for Word Discovery in Transcribed Speech , 2001, CL.
[9] Michael R. Brent,et al. An Efficient, Probabilistically Sound Algorithm for Segmentation and Word Discovery , 1999, Machine Learning.
[10] Richard M. Schwartz,et al. Unsupervised Training on Large Amounts of Broadcast News Data , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[11] Thomas L. Griffiths,et al. Adaptor Grammars: A Framework for Specifying Compositional Nonparametric Bayesian Models , 2006, NIPS.
[12] Mark Johnson,et al. Using Adaptor Grammars to Identify Synergies in the Unsupervised Acquisition of Linguistic Structure , 2008, ACL.
[13] Margaret M. Fleck. Lexicalized Phonotactic Word Segmentation , 2008, ACL.
[14] Sanjeev Khudanpur,et al. Unsupervised Learning of Acoustic Sub-word Units , 2008, ACL.
[15] James R. Glass,et al. Unsupervised Pattern Discovery in Speech , 2008, IEEE Transactions on Audio, Speech, and Language Processing.
[16] James R. Glass,et al. Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[17] Mark Johnson,et al. Improving nonparameteric Bayesian inference: experiments on unsupervised word segmentation with adaptor grammars , 2009, NAACL.
[18] T. Griffiths,et al. A Bayesian framework for word segmentation: Exploring the effects of context , 2009, Cognition.
[19] Thomas L. Griffiths,et al. Learning phonetic categories by learning a lexicon , 2009 .
[20] Hynek Hermansky,et al. Phoneme recognition using spectral envelope and modulation frequency features , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[21] Kenneth Ward Church,et al. Towards spoken term discovery at scale with zero resources , 2010, INTERSPEECH.
[22] Aren Jansen,et al. Efficient spoken term discovery using randomized algorithms , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[23] Herbert Gish,et al. Topic modeling for spoken documents using only phonetic information , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[24] Dong Yu,et al. Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[25] Aren Jansen,et al. Towards Unsupervised Training of Speaker Independent Acoustic Models , 2011, INTERSPEECH.
[26] Aren Jansen,et al. Rapid Evaluation of Speech Representations for Spoken Term Discovery , 2011, INTERSPEECH.
[27] Herbert Gish,et al. Unsupervised Audio Patterns Discovery Using HMM-Based Self-Organized Units , 2011, INTERSPEECH.
[28] Micha Elsner,et al. Bootstrapping a Unified Model of Lexical and Phonetic Acquisition , 2012, ACL.
[29] Tatsuya Kawahara,et al. Bayesian Learning of a Language Model from Continuous Speech , 2012, IEICE Trans. Inf. Syst..
[30] James R. Glass,et al. A Nonparametric Bayesian Approach to Acoustic Model Discovery , 2012, ACL.
[31] Richard C. Rose,et al. Facilitating open vocabulary spoken term detection using a multiple pass hybrid search algorithm , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[32] X. Anguera. Speaker independent discriminant feature extraction for acoustic pattern-matching , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[33] Aren Jansen,et al. Intrinsic Spectral Analysis for Zero and High Resource Speech Recognition , 2012, INTERSPEECH.
[34] Kenneth Ward Church,et al. Deep neural network features and semi-supervised training for low resource speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[35] Aren Jansen,et al. Weak top-down constraints for unsupervised acoustic model training , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[36] Sharon Peperkamp,et al. Learning Phonemes With a Proto-Lexicon , 2013, Cogn. Sci..