Computational language acquisition by statistical bottom-up processing

Statistical learning of patterns from perceptual input is an increasingly central topic in cognitive processing including human language acquisition. We present an unsupervised computational method for statistical word learning by analysis of transitional probabilities of subsequent phone pairs. Results indicate that word differentiation is possible with this type of approach and are in line with previous behavioral findings. Index Terms: computational language acquisition, speech segmentation, speech clustering, statistical learning

[1]  Jenny R Saffran,et al.  Words in a sea of sounds: the output of infant statistical learning , 2001, Cognition.

[2]  Okko Räsänen,et al.  Speech Segmentation and Clustering Methods for a New Speech Recognition Architecture , 2007 .

[3]  J. Werker,et al.  Cross-language speech perception: Evidence for perceptual reorganization during the first year of life , 1984 .

[4]  Daniel Swingley,et al.  Statistical clustering and the contents of the infant vocabulary , 2005, Cognitive Psychology.

[5]  Elizabeth K. Johnson,et al.  Statistical learning of tone sequences by human infants and adults , 1999, Cognition.

[6]  J. Werker,et al.  Infants listen for more phonetic detail in speech perception than in word-learning tasks , 1997, Nature.

[7]  S. Harnad Categorical Perception: The Groundwork of Cognition , 1990 .

[8]  G. Ehret The auditory cortex , 1997, Journal of Comparative Physiology A.

[9]  A. King,et al.  The auditory cortex , 2007, Current Biology.

[10]  Robert M. Gonyea,et al.  Learning at a Distance : , 2009 .

[11]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[12]  S. Trehub The Discrimination of Foreign Speech Contrasts by Infants and Adults. , 1976 .

[13]  P. Bertelson,et al.  Does awareness of speech as a sequence of phones arise spontaneously? , 1979, Cognition.

[14]  E. Newport,et al.  Learning at a distance I. Statistical learning of non-adjacent dependencies , 2004, Cognitive Psychology.

[15]  Bert Cranen,et al.  A computational model for unsupervised word discovery , 2007, INTERSPEECH.

[16]  H. Benedict,et al.  Early lexical development: comprehension and production , 1979, Journal of Child Language.

[17]  D. Buonomano,et al.  Cortical plasticity: from synapses to maps. , 1998, Annual review of neuroscience.

[18]  P. Kuhl,et al.  Foreign-language experience in infancy: Effects of short-term exposure and social interaction on phonetic learning , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  J Bertoncini,et al.  Discrimination in neonates of very short CVs. , 1987, The Journal of the Acoustical Society of America.

[20]  G. F. Cooper,et al.  Development of the Brain depends on the Visual Environment , 1970, Nature.

[21]  J. Saffran The Use of Predictive Dependencies in Language Learning , 2001 .