论文信息 - Preserving subsegmental variation in modeling word segmentation (or, the raising of baby Mondegreen)

Preserving subsegmental variation in modeling word segmentation (or, the raising of baby Mondegreen)

Many computational models have been developed to show how infants break apart utterances into words prior to building a vocabulary—the “word segmentation task.” Most models assume that infants, upon hearing an utterance, represent this input as a string of segments. One type of model uses statistical cues calculated from the distribution of segments within the child-directed speech to locate those points most likely to contain word boundaries. However, these models have been tested in relatively few languages, with little attention paid to how different phonological structures may affect the relative effectiveness of particular statistical heuristics. This dissertation addresses this issue by comparing the performance of two classes of distribution-based statistical cues on a corpus of Modern Greek, a language with a phonotactic structure significantly different from that of English, and shows how these differences change the relative effectiveness of these cues. Another fundamental issue critically examined in this dissertation is the practice of representing input as a string of segments. Such a representation implicitly assumes complete certainty as to the phonemic identity of each segment. This runs counter both to standard practice in automatic speech recognition (where “hard decisions” are eschewed) and, more crucially, overestimates the ability of infants to parse and identify those segments from the spoken input. If even adult native speakers (with the benefit of higher-level linguistic knowledge, such as a

C. Anton Rytting | Christopher Anton Rytting

[1] F. D. Saussure,et al. Cours de linguistique générale@@@Cours de linguistique generale , 1972 .

[2] Zellig S. Harris,et al. From Phoneme to Morpheme , 1955 .

[3] Bhuvana Ramabhadran,et al. Improvements in English ASR for the MALACH project using syllable-centric models , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[4] Jeff Mielke,et al. The Emergence of Distinctive Features , 2008 .

[5] A S House,et al. Phonological oppositions in children: a perceptual study. , 1971, The Journal of the Acoustical Society of America.

[6] P. Eimas. Segmental and syllabic representations in the perception of speech by young infants. , 1999, The Journal of the Acoustical Society of America.

[7] P. Jusczyk,et al. The cocktail party effect in infants. , 1995, Perception & psychophysics.

[8] Xiaofei Lu,et al. Hybrid models for Chinese unknown word resolution , 2006 .

[9] C. Anton Rytting. Segment Predictability as a Cue in Word Segmentation: Application to Modern Greek , 2004, SIGMORPHON@ACL.

[10] Katherine S. White,et al. A Statistical Basis for Speech Sound Discrimination , 2003, Language and speech.

[11] P. Jusczyk,et al. Do infants segment words or recurring contiguous patterns? , 2001, Journal of experimental psychology. Human perception and performance.