Methods for minimizing the confounding effects of word length in the analysis of phonotactic probability and neighborhood density.

Recent research suggests that phonotactic probability (the likelihood of occurrence of a sound sequence) and neighborhood density (the number of words phonologically similar to a given word) influence spoken language processing and acquisition across the lifespan in both normal and clinical populations. The majority of research in this area has tended to focus on controlled laboratory studies rather than naturalistic data such as spontaneous speech samples or elicited probes. One difficulty in applying current measures of phonotactic probability and neighborhood density to more naturalistic samples is the significant correlation between these variables and word length. This study examines several alternative transformations of phonotactic probability and neighborhood density as a means of reducing or eliminating this correlation with word length. Computational analyses of the words in a large database and reanalysis of archival data supported the use of z scores for the analysis of phonotactic probability as a continuous variable and the use of median transformation scores for the analysis of phonotactic probability as a dichotomous variable. Neighborhood density results were less clear with the conclusion that analysis of neighborhood density as a continuous variable warrants further investigation to differentiate the utility of z scores in comparison to median transformation scores. Furthermore, balanced dichotomous coding of neighborhood density was difficult to achieve, suggesting that analysis of neighborhood density as a dichotomous variable should be approached with caution. Recommendations for future application and analyses are discussed.

[1]  M. Vitevitch Naturalistic and Experimental Analyses of Word Frequency and Neighborhood Density Effects in Slips of the Ear , 2002, Language and speech.

[2]  B. Munson,et al.  Phonological pattern frequency and speech production in adults and children. , 2001, Journal of speech, language, and hearing research : JSLHR.

[3]  Holly L. Storkel,et al.  Do children acquire dense neighborhoods? An investigation of similarity neighborhoods in lexical acquisition , 2004, Applied Psycholinguistics.

[4]  Erik D. Thiessen,et al.  When cues collide: use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. , 2003, Developmental psychology.

[5]  P. Luce,et al.  Phonotactics, density, and entropy in spoken word recognition , 2001 .

[6]  Holly L. Storkel,et al.  The effect of probabilistic phonotactics on lexical acquisition , 2000 .

[7]  D. Kleinbaum,et al.  Applied regression analysis and other multivariable methods, 3rd ed. , 1998 .

[8]  Todd M. Bailey,et al.  Determinants of wordlikeness: Phonotactics or lexical neighborhoods? , 2001 .

[9]  M. Vitevitch,et al.  Sublexical and lexical representations in speech production: effects of phonotactic probability and onset density. , 2004, Journal of experimental psychology. Learning, memory, and cognition.

[10]  M. Vitevitch,et al.  The neighborhood characteristics of malapropisms. , 1996, Language and speech.

[11]  P. Jusczyk,et al.  Infants' sensitivity to phonotactic patterns in the native language. , 1994 .

[12]  J. Gierut,et al.  Lexical constraints in phonological acquisition , 1999, Journal of Child Language.

[13]  J. Gordon Phonological neighborhood effects in aphasic speech errors: spontaneous and structured contexts , 2002, Brain and Language.

[14]  Holly L Storkel,et al.  Learning new words II: Phonotactic probability in verb learning. , 2003, Journal of speech, language, and hearing research : JSLHR.

[15]  H. Storkel,et al.  Markedness and the grammar in lexical diffusion of fricatives , 2002, Clinical linguistics & phonetics.

[16]  Judith A. Gierut,et al.  Lexical Influences on Interword Variation , 2002 .

[17]  M. Sommers,et al.  Inhibitory processes and spoken word recognition in young and older adults: the interaction of lexical competition and semantic context. , 1999, Psychology and aging.

[18]  Holly L Storkel,et al.  The emerging lexicon of children with phonological delays: phonotactic constraints and probability in acquisition. , 2004, Journal of speech, language, and hearing research : JSLHR.

[19]  P. Jusczyk,et al.  Phonotactic and Prosodic Effects on Word Segmentation in Infants , 1999, Cognitive Psychology.

[20]  P. Jusczyk,et al.  Phonotactic cues for segmentation of fluent speech by infants , 2001, Cognition.

[21]  George Hollich,et al.  Lexical Neighborhood Effects in 17-Month-Old Word Learning , 2002 .

[22]  Peter W. Jusczyk,et al.  How infants begin to extract words from speech , 1999, Trends in Cognitive Sciences.

[23]  D. Dirks,et al.  Effects of lexical factors on word recognition among normal-hearing and hearing-impaired listeners. , 2001, Journal of the American Academy of Audiology.

[24]  D. Kemmerer,et al.  Phonotactics and Syllable Stress: Implications for the Processing of Spoken Nonsense Words , 1997, Language and speech.

[25]  Mary E. Hughes,et al.  Boston University Conference on Language Development , 1979 .

[26]  H. Nusbaum Sizing up the Hoosier Mental Lexicon: Measuring the Familiarity of 20,000 Words, Research on Speech Perception , 1984 .

[27]  G. Zipf The Psycho-Biology Of Language: AN INTRODUCTION TO DYNAMIC PHILOLOGY , 1999 .

[28]  Rochelle S Newman,et al.  Effects of Lexical Factors on Lexical Access among Typical Language-Learning Children and Children with Word-Finding Difficulties , 2002, Language and speech.

[29]  G. Altmann,et al.  Cognitive Models of Speech Processing: Psycholinguistic and Computational Perspectives - Workshop Overview , 1989, AI Mag..

[30]  Jerome L. Myers,et al.  Research Design and Statistical Analysis , 1991 .

[31]  D. Pisoni,et al.  Recognizing Spoken Words: The Neighborhood Activation Model , 1998, Ear and hearing.

[32]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[33]  Jamie L. Metsala,et al.  An examination of word frequency and neighborhood density in the development of spoken-word recognition , 1997, Memory & cognition.

[34]  Michael S Vitevitch,et al.  The influence of phonological similarity neighborhoods on speech production. , 2002, Journal of experimental psychology. Learning, memory, and cognition.

[35]  David B. Pisoni,et al.  Lexical Effects on Spoken Word Recognition by Pediatric Cochlear Implant Users , 1995, Ear and hearing.

[36]  P. Luce,et al.  When Words Compete: Levels of Processing in Perception of Spoken Words , 1998 .

[37]  Judith A Gierut,et al.  Lexical organization and phonological change in treatment. , 2002, Journal of speech, language, and hearing research : JSLHR.

[38]  D. Kleinbaum,et al.  Applied Regression Analysis and Other Multivariate Methods , 1978 .

[39]  David B Pisoni,et al.  Perception of Wordlikeness: Effects of Segment Probability and Length on the Processing of Nonwords. , 2000, Journal of memory and language.

[40]  J. Metsala,et al.  Age-of-Acquisition, Word Frequency, and Neighborhood Density Effects on Spoken Word Recognition by Children and Adults , 2001 .

[41]  M. Beckman,et al.  The interaction between vocabulary size and phonotactic probability effects on children's production accuracy and fluency in nonword repetition. , 2004, Journal of speech, language, and hearing research : JSLHR.

[42]  David B. Pisoni,et al.  Similarity neighborhoods of spoken words , 1991 .

[43]  Rochelle S Newman,et al.  The impact of lexical factors on children's word-finding errors. , 2004, Journal of speech, language, and hearing research : JSLHR.

[44]  P. Luce,et al.  Probabilistic Phonotactics and Neighborhood Activation in Spoken Word Recognition , 1999 .

[45]  D. Pisoni,et al.  Phonotactics, Neighborhood Activation, and Lexical Access for Spoken Words , 1999, Brain and Language.

[46]  R. Shillcock,et al.  Competitor effects during lexical access: chasing Zipf's tail , 1993 .

[47]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[48]  Benjamin Munson,et al.  Relationships between nonword repetition accuracy and other measures of linguistic development in children with phonological disorders. , 2005, Journal of speech, language, and hearing research : JSLHR.

[49]  H. Storkel,et al.  Learning new words: phonotactic probability in language development. , 2001, Journal of speech, language, and hearing research : JSLHR.

[50]  D. Pisoni,et al.  Effects of stimulus variability on speech perception in listeners with hearing impairment. , 1997, Journal of speech, language, and hearing research : JSLHR.

[51]  R. Miyamoto,et al.  Speech Perception in Children with Cochlear Implants: Effects of Lexical Difficulty, Talker Variability, and Word Length , 2000, The Annals of otology, rhinology & laryngology. Supplement.

[52]  J. Metsala,et al.  Spoken vocabulary growth and the segmental restructuring of lexical representations: precursors to p , 1998 .

[53]  P. Luce,et al.  Similarity neighborhoods of spoken two-syllable words: retroactive effects on multiple activation. , 1990, Journal of Experimental Psychology: Human Perception and Performance.

[54]  Judith A. Gierut,et al.  Lexical properties in implementation of sound change , 1998 .

[55]  John Coleman,et al.  Stochastic phonological grammars and acceptability , 1997, SIGMORPHON@EACL.

[56]  C. Frankish,et al.  Phonotactic influences on short-term memory. , 1999, Journal of experimental psychology. Learning, memory, and cognition.

[57]  E. Newport,et al.  PSYCHOLOGICAL SCIENCE Research Article INCIDENTAL LANGUAGE LEARNING: Ustening (and Learning) out of the Comer of Your Ear , 2022 .

[58]  M. L. Morrisette Lexical characteristics of sound change. , 1999, Clinical linguistics & phonetics.

[59]  Judith A. Gierut,et al.  Variables governing diffusion in phonological acquisition , 2001 .

[60]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[61]  David B. Pisoni,et al.  Speech perception, word recognition and the structure of the lexicon , 1985, Speech Commun..

[62]  H. Storkel Restructuring of similarity neighbourhoods in the developing mental lexicon , 2002, Journal of Child Language.

[63]  S. Goldinger,et al.  Priming Lexical Neighbors of Spoken Words: Effects of Competition and Inhibition. , 1989, Journal of memory and language.

[64]  Michael S Vitevitch,et al.  The facilitative influence of phonological similarity and neighborhood frequency in speech production in younger and older adults , 2003, Memory & cognition.