Discovering underlying forms: Contrast pairs and ranking

Phonological learners must acquire a lexicon of underlying forms and a constraint ranking. These must be acquired simultaneously, as the ranking and the underlying forms are interdependent. Exhaustive search of all possible lexica is intractable; the space of lexica is simply too large. Searching the underlying forms for each overt form in isolation poses other problems. A single overt form is often highly ambiguous among both underlying forms and rankings. In this dissertation I propose a learning algorithm that attends to pairs of overt forms that differ in exactly one morpheme. These pairs can exhibit less ambiguity than the isolated overt forms, while still providing a reduced search space. The algorithm first assigns underlying values to occurrences of features whose surface realization never alternates; the other underlying features are left initially unset (Tesar et al., 2003). Pairs of overt forms that differ in one morpheme are then constructed. The algorithm then considers the possible values of unset features for each pair, processing pairs with the fewest unset features first. It uses inconsistency detection (Tesar, 1997) to test sets of values of unset features for viability. A set of values for the unset features is viable if it produces the correct overt forms under some ranking. Those feature values which are common across all viable solutions are then set. In the process of testing for inconsistency for each set of values of unset features a set of winner-loser pairs is generated. The learner determines the ranking restrictions jointly entailed by these sets of winner-loser pairs. These ranking restrictions are then maintained while processing all further contrast pairs. After all pairs have been processed, any still unset feature values are assigned default values. The general success of the algorithm depends upon these features being fully predictable in the output. A ranking is then obtained from this lexicon using Biased Constraint Demotion (Prince and Tesar, 2004). Fixing all non-alternating features reduces the effective lexical search space. The algorithm further reduces the lexical search space by breaking up the search into tractable local pair searches. Extracting shared ranking information from winner-loser pairs generated from inconsistency detection restricts which featural combinations for future contrast pairs will be viable providing information that is otherwise unavailable to the learner.%%%%

[1]  J. McCarthy Taking a Free Ride in Morphophonemic Learning , 2005 .

[2]  B. Tesar,et al.  Learning underlying forms by searching restricted lexical subspaces , 2005 .

[3]  Diana Apoussidou,et al.  The learnability of metrical phonology , 2007 .

[4]  Gaja Jarosz,et al.  Rich Lexicons and Restrictive Grammars: Maximum Likelihood Learning in Optimality Theory , 2006 .

[5]  Bruce Tesar,et al.  Surgery in Language Learning , 2003 .

[6]  P. Smolensky,et al.  Optimality Theory: Constraint Interaction in Generative Grammar , 2004 .

[7]  Kenneth Wexler,et al.  Formal Principles of Language Acquisition , 1980 .

[8]  Bruce Tesar,et al.  Learning phonotactic distributions , 2004 .

[9]  P. Boersma,et al.  Empirical Tests of the Gradual Learning Algorithm , 2001, Linguistic Inquiry.

[10]  Alan S. Prince,et al.  Faithfulness and reduplicative identity , 1995 .

[11]  B. Hayes,et al.  Phonological Acquisition in Optimality Theory: The Early Stages 1 , 1999 .

[12]  Dana Angluin,et al.  Inductive Inference of Formal Languages from Positive Data , 1980, Inf. Control..

[13]  Bruce Tesar,et al.  Contrast Analysis Aids the Learning of Phonological Underlying Forms , 2005 .

[14]  Alan S. Prince,et al.  Generalized alignment , 1993 .

[15]  Mark S. Hewitt,et al.  Quantitative Consequences of Rhythmic Organization , 2002 .

[16]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[17]  Wim Zonneveld,et al.  Learning phonotactic distributions , 2004 .

[18]  Bruce Tesar,et al.  Learning from Paradigmatic Information , 2006 .

[19]  Bruce Tesar,et al.  Multi-Recursive Constraint Demotion , 2000 .

[20]  Bruce Tesar,et al.  Computational optimality theory , 1996 .

[21]  Bruce Tesar,et al.  Contrast Analysis in Phonological Learning 1 Contrast Analysis in Phonological Learning , 2004 .

[22]  Paul Smolensky,et al.  Optimality Theory: Constraint Interaction in Generative Grammar ; CU-CS-696-93 , 1993 .

[23]  Joe Pater,et al.  Morpheme-Specific Phonology: Constraint Indexation and Inconsistency Resolution , 2009 .

[24]  B. Hayes,et al.  Phonological Acquisition in Optimality Theory: the Early Stages 1 Submitted for a Forthcoming Volume on Phonological Acquisition and Typology, Edited Phonological Acquisition in Optimality Theory: the Early Stages , 1999 .

[25]  Michael Hammond,et al.  Gradience, Phonotactics and the Lexicon in English Phonology , 2004 .

[26]  A. E. Albright,et al.  The identification of bases in morphological paradigms , 2002 .

[27]  Ricardo Bermúdez-Otero,et al.  The acquisition of phonological opacity , 2003 .

[28]  Diana Apoussidou,et al.  On-line learning of underlying forms , 2006 .