Restrictiveness and Phonological Grammar and Lexicon Learning

1 Introduction A central problem of language learnability is the learning of restrictive grammars, grammars that generate all the observed forms but as few others as possible. Given only positive evidence, there are many grammars consistent with the observed data, and the learner must select the most restrictive grammar among these. If the learner mistakenly adopts a broader grammar, no positive evidence will contradict this decision since the broader grammar is consistent with all the positive evidence plus additional data. This is known as the subset problem a well-known solution to the restrictiveness problem is a ranking bias, a preference for a particular relative ranking between certain constraint types. Ranking biases have been successfully implemented in algorithms that focus on the learning of a language-particular ranking given the correct underlying forms point out, however, when the full problem of learning rankings and underlying forms is considered, ranking biases are not sufficient to identify restrictive combinations of grammars and lexicons due to the interdependence of grammar and lexicon learning. This paper presents an alternative solution to the restrictiveness problem: Maximum Likelihood Learning of Lexicons and Grammars (MLG; Jarosz 2006). MLG subsumes the effects of ranking biases and naturally extends to the full phonological learning problem, identifying restrictive grammar and lexicon combinations. Rather than using ranking biases to define the relative restrictiveness of multiple analyses of the same data, MLG relies on the likelihood, or probability, that each grammar and lexicon combination assigns to the data. The likelihood provides an explicit measure of how well the grammar and lexicon explain the data, an objective function that may be maximized using a well-known statistical learning algorithm. The paper is organized as follows. Section 2 briefly overviews the use of ranking biases in OT learning. Section 3 introduces MLG, discusses its ability to identity restrictive grammar and lexicon combinations, and explains how the principles of MLG are implemented in the simulations presented in this paper. Section 4 discusses a series of simulations illustrating MLG's capacity to learn restrictive grammar and lexicon combinations. In Section 4.1, simulations illustrate MLG's capacity to derive the effects of the three ranking biases. Sections 4.2 and 4.3 discuss a system with grammar-lexicon interaction for which

[1]  P. Smolensky The Initial State and 'Richness of the Base' in Optimality Theory , 1996 .

[2]  Dana Angluin,et al.  Inductive Inference of Formal Languages from Positive Data , 1980, Inf. Control..

[3]  J. McCarthy Morpheme structure constraints and paradigm occultation , 1998 .

[4]  Gaja Jarosz,et al.  Rich Lexicons and Restrictive Grammars: Maximum Likelihood Learning in Optimality Theory , 2006 .

[5]  Anne-Michelle Tessier,et al.  Biases and Stages in Phonological Acquisition , 2008 .

[6]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[7]  René Kager,et al.  Markedness and faithfulness constraints in child phonology , 2004 .

[8]  P. Jusczyk,et al.  Infants′ Sensitivity to the Sound Patterns of Native Language Words , 1993 .

[9]  J. Werker,et al.  Infants listen for more phonetic detail in speech perception than in word-learning tasks , 1997, Nature.

[10]  Robert C. Berwick,et al.  The acquisition of syntactic knowledge , 1985 .

[11]  Jennifer L. Smith,et al.  Positional Faithfulness and Learnability in Optimality Theory , 2002 .

[12]  P. Boersma,et al.  Empirical Tests of the Gradual Learning Algorithm , 2001, Linguistic Inquiry.

[13]  B. Tesar Learning covert phonological interaction : an analysis of the problem posed by the interaction of stress and epenthesis , 2002 .

[14]  Bruce Tesar,et al.  Computational optimality theory , 1996 .

[15]  J. Werker,et al.  The Perceptual Acquisition of Phonological Contrasts , 2004 .

[16]  J. McCarthy Taking a Free Ride in Morphophonemic Learning , 2005 .

[17]  P. Smolensky,et al.  Optimality Theory: Constraint Interaction in Generative Grammar , 2004 .

[18]  A. Friederici,et al.  Phonotactic knowledge of word boundaries and its use in infant speech perception , 1993, Perception & psychophysics.

[19]  Wim Zonneveld Phonological Acquisition in Optimality Theory: The Early Stages , 1999 .

[20]  J. Berko The Child's Learning of English Morphology , 1958 .

[21]  A. Gnanadesikan Constraints in Phonological Acquisition: Markedness and faithfulness constraints in child phonology , 2004 .