Computational models for first language acquisition

This work investigates a computational model of first language acquisition; the Categorial Grammar Learner or CGL. The model builds on the work of Villavicenio, who created a parametric Categorial Grammar learner that organises its parameters into an inheritance hierarchy, and also on the work of Buszkowski and Kanazawa, who demonstrated the learnability of a k-valued Classic Categorial Grammar (which uses only the rules of function application) from strings. The CGL is able to learn a k-valued General Categorial Grammar (which uses the rules of function application, function composition and Generalised Weak Permutation). The novel concept of Sentence Objects (simple strings, augmented strings, unlabelled structures and functor-argument structures) are presented as potential points from which learning may commence. Augmented strings (which are strings augmented with some basic syntactic information) are suggested as a sensible input to the CGL as they are cognitively plausible objects and have greater information content than strings alone. Building on the work of Siskind, a method for constructing augmented strings from unordered logic forms is detailed and it is suggested that augmented strings are simply a representation of the constraints placed on the space of possible parses due to a string’s associated semantic content. The CGL makes crucial use of a statistical Memory Module (constructed from a Type Memory and Word Order Memory) that is used to both constrain hypotheses and handle data which is noisy or parametrically ambiguous. A consequence of the Memory Module is that the CGL learns in an incremental fashion. This echoes real child learning as documented in Brown’s Stages of Language Development and also as alluded to by an included corpus study of child speech. Furthermore, the CGL learns faster when initially presented with simpler linguistic data; a further corpus study of child-directed speech suggests that this echos the input provided to children. The CGL is demonstrated to learn from real data. It is evaluated against previous parametric learners (the Triggering Learning Algorithm of Gibson and Wexler and the Structural Triggers Learner of Fodor and Sakas) and is found to be more efficient.

[1]  R. Higginson Fixing : assimilation in language acquisition , 1985 .

[2]  J. Piaget The construction of reality in the child , 1954 .

[3]  Suresh Manandhar,et al.  Unsupervised Lexical Learning with Categorial Grammars , 1999 .

[4]  Letitia R. Naigles,et al.  Children use syntax to learn verb meanings , 1990, Journal of Child Language.

[5]  A. Kroch Reflexes of grammar in patterns of language change , 1989, Language Variation and Change.

[6]  Janet Dean Fodor,et al.  Language Acquisition and Learnability: The Structural Triggers Learner , 2001 .

[7]  Ralph Grishman,et al.  Standardization of the Complement/Adjunct Distinction , 1996 .

[8]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[9]  S. Curtiss Genie: A Psycholinguistic Study of a Modern-Day "Wild Child" , 1977 .

[10]  J. Berko The Child's Learning of English Morphology , 1958 .

[11]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[12]  Eve V. Clark,et al.  The principle of contrast: A constraint on language acquisition. , 1987 .

[13]  Ted Briscoe,et al.  Robust Accurate Statistical Annotation of General Text , 2002, LREC.

[14]  R. Clark The Selection of Syntactic Knowledge , 1992 .

[15]  Hagit Borer,et al.  Parametric Syntax: Case Studies in Semitic and Romance Languages , 1984 .

[16]  L. Gleitman,et al.  Language and Experience: Evidence from the Blind Child , 1988 .

[17]  Ivan A. Sag,et al.  Information-based syntax and semantics , 1987 .

[18]  Franz Baader,et al.  Unification theory , 1986, Decis. Support Syst..

[19]  E. Markman,et al.  When it is better to receive than to give: Syntactic and conceptual constraints on vocabulary growth , 1994 .

[20]  P. Niyogi,et al.  Learning from triggers , 1996 .

[21]  Catherine E. Snow,et al.  Language acquisition: Conversations with children , 1986 .

[22]  Makoto Kanazawa Learnable Classes of Categorial Grammars , 1998 .

[23]  Robert C. Berwick,et al.  Learning Word Meanings From Examples , 1983, IJCAI.

[24]  B. Dresher,et al.  A computational learning model for metrical phonology , 1990, Cognition.

[25]  Michael R. Brent,et al.  Speech segmentation and word discovery: a computational , 1999 .

[26]  Mark Steedman Type-Raising and Directionality in Combinatory Grammar , 1991, ACL.

[27]  M.M. Nice,et al.  Length of sentences as a criterion of a child’s progress in speech , 1925 .

[28]  J. Elman Distributed representations, simple recurrent networks, and grammatical structure , 1991, Machine Learning.

[29]  David Lightfoot,et al.  Principles of diachronic syntax , 1979 .

[30]  Mark Steedman,et al.  The syntactic process , 2004, Language, speech, and communication.

[31]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[32]  D. Ingram First Language Acquisition: Method, Description and Explanation , 1989 .

[33]  Yuval Krymolowski,et al.  On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems , 2002, CoNLL.

[34]  D. Bickerton The language bioprogram hypothesis , 1984, Behavioral and Brain Sciences.

[35]  H. Somers,et al.  On the validity of the complement-adjunct distinction in valency grammar , 1984 .

[36]  J. Pine Input and interaction in language acquisition: The language of primary caregivers , 1994 .

[37]  M. R. Manzini Learnability and Cognition , 1991 .

[38]  M. B. Emeneau Foundations of Language. Louis H. Gray , 1940 .

[39]  Ralph Grishman,et al.  Comlex Syntax: Building a Computational Lexicon , 1994, COLING.

[40]  Geoffrey Leech,et al.  100 Million Words of English:The British National Corpus (BNC) , 1992 .

[41]  Gerald Penn,et al.  Categorial grammars determined from linguistic data by unification , 1990, Stud Logica.

[42]  Ted Briscoe,et al.  Automatic Extraction of Subcategorization from Corpora , 1997, ANLP.

[43]  Ted Briscoe,et al.  The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English , 1987, ACL.

[44]  Barbara B. Levin,et al.  English verb classes and alternations , 1993 .

[45]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[46]  R. R. Bush,et al.  A Mathematical Model for Simple Learning , 1951 .

[47]  R. Brown,et al.  A First Language , 1973 .

[48]  Daniel Jurafsky,et al.  How Verb Subcategorization Frequencies Are Affected By Corpus Choice , 1998, COLING.

[49]  Virginia Valian,et al.  Logical and Psychological Constraints on the Acquisition of Syntax , 1990 .

[50]  Raymond J. Mooney,et al.  Using Multiple Clause Constructors in Inductive Logic Programming for Semantic Parsing , 2001, ECML.

[51]  James R. Curran,et al.  Log-Linear Models for Wide-Coverage CCG Parsing , 2003, EMNLP.

[52]  D. Bishop,et al.  Language development in exceptional circumstances , 1988 .

[53]  E. Lenneberg Biological Foundations of Language , 1967 .

[54]  P. Suppes The Semantics of Children's Language , 1974 .

[55]  Raymond J. Mooney,et al.  Learning to Parse Database Queries Using Inductive Logic Programming , 1996, AAAI/IAAI, Vol. 2.

[56]  김두식,et al.  English Verb Classes and Alternations , 2006 .

[57]  C. Spearman The proof and measurement of association between two things. By C. Spearman, 1904. , 1987, The American journal of psychology.

[58]  Polska Akademia Nauk,et al.  Bulletin of the Polish Academy of Sciences. Mathematics. , 1983 .

[59]  A. B. Alcott Conversations with Children on the Gospels , 1830 .

[60]  Ted Briscoe Linkk Oping Electronic Articles in the Acquisition of Grammar in an Evolving Population of Language Agents Linkk Oping Electronic Articles in Computer and Information Science , 1999 .

[61]  L. Gleitman The Structural Sources of Verb Meanings , 2020, Sentence First, Arguments Afterward.

[62]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[63]  J. Siskind A computational study of cross-situational techniques for learning word-to-meaning mappings , 1996, Cognition.

[64]  Noam Chomsky,et al.  Rules and Representations , 1982 .

[65]  Richard S. Kayne Parameters and universals , 2000 .

[66]  Steven Pinker,et al.  On language and connectionism , 1988 .

[67]  J. Hawkins Explaining Language Universals , 1988 .

[68]  S. Pinker The language instinct : how the mind creates language , 1995 .

[69]  Eve V. Clark,et al.  First Language Acquisition , 2002, The Study of Language.

[70]  Steven Pinker,et al.  Language learnability and language development , 1985 .

[71]  Ted Briscoe,et al.  Learning Stochastic Categorial Grammars , 1997, CoNLL.

[72]  Ray Jackendoff,et al.  X Syntax: A Study of Phrase Structure , 1980 .

[73]  Janet Dean Fodor,et al.  Parsing to Learn , 1998 .

[74]  K. Wexler Very early parameter setting and the unique checking constraint: a new explanation of the optional infinitive stage: a new explanation of the optional infinitive stage , 1998 .

[75]  Amye Warren Sex differences in speech to children , 1982 .

[76]  Charles D. Yang,et al.  Knowledge and learning in natural language , 2000 .

[77]  Ted Briscoe,et al.  The Signicance of Errors to Parametric Models of Language Acquisition , 2004 .

[78]  S. Pinker,et al.  The Language Instinct: How the Mind Creates Language , 1994 .

[79]  Lillian Lee,et al.  Measures of Distributional Similarity , 1999, ACL.

[80]  William Gregory Sakas,et al.  Ambiguity and the computational feasibility of syntax acquisition , 2000 .

[81]  Noam Chomsky,et al.  Lectures on Government and Binding , 1981 .

[82]  James L. McClelland,et al.  Mechanisms of Sentence Processing: Assigning Roles to Constituents of Sentences , 1986 .

[83]  Luke S. Zettlemoyer,et al.  Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars , 2005, UAI.

[84]  J. Elman Distributed Representations, Simple Recurrent Networks, And Grammatical Structure , 1991 .

[85]  S. Pinker How could a child use verb syntax to learn verb semantics , 1994 .

[86]  S. Pinker Words and Rules: The Ingredients of Language , 1999 .

[87]  Martha Jo-Ann. Demetras,et al.  WORKING PARENTS' CONVERSATIONAL RESPONSES TO THEIR TWO-YEAR-OLD SONS (LINGUISTIC INPUT, LANGUAGE ACQUISITION). , 1986 .

[88]  A. Korhonen,et al.  Large Scale Analysis of Verb Subcategorization differences between Child Directed Speech and Adult Speech , .

[89]  Bambi B. Schieffelin,et al.  The acquisition of Kaluli. , 1985 .

[90]  J. Macnamara Names for Things: A Study in Human Learning , 1984 .

[91]  Aline Villavicencio The acquisition of a unification-based generalised categorial grammar , 2002 .

[92]  Daniel Jurafsky,et al.  Verb Subcategorization Frequency Differences between Business- News and Balanced Corpora: The Role of Verb Sense , 2000, ACL 2000.

[93]  Raymond J. Mooney,et al.  Learning Semantic Grammars with Constructive Inductive Logic Programming , 1993, AAAI.

[94]  Raymond J. Mooney,et al.  Acquiring Word-Meaning Mappings for Natural Language Interfaces , 2011, J. Artif. Intell. Res..

[95]  Janet D. Fodor Unambiguous Triggers , 1998, Linguistic Inquiry.