Memory-based lexical acquisition and processing

Current approaches to computational lexicology in language technology are knowledge-based (competence-oriented) and try to abstract away from specific formalisms, domains, and applications. This results in severe complexity, acquisition and reusability bottlenecks. As an alternative, we propose a particular performance-oriented approach to Natural Language Processing based on automatic memory-based learning of linguistic (lexical) tasks. The consequences of the approach for computational lexicology are discussed, and the application of the approach on a number of lexical acquisition and disambiguation tasks in phonology, morphology and syntax is described.

[1]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1989, IJCAI 1989.

[2]  Walter Daelemans,et al.  A Neural Network for Hyphenation , 1992 .

[3]  Terrence J. Sejnowski,et al.  NETtalk: a parallel network that learns to read aloud , 1988 .

[4]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[5]  Ted Briscoe,et al.  Inheritance, Defaults and the Lexicon: Inheritance, Defaults and the Lexicon , 1994 .

[6]  David L. Waltz,et al.  Toward memory-based reasoning , 1986, CACM.

[7]  Robert F. Simmons,et al.  The Acquisition and Use of Context-Dependent Grammars for English , 1992, Comput. Linguistics.

[8]  Walter Daelemans,et al.  Data-Oriented Methods for Grapheme-to-Phoneme Conversion , 1993, EACL.

[9]  Sholom M. Weiss,et al.  Computer Systems That Learn , 1990 .

[10]  Antinus Nijholt,et al.  Connectionism and Natural Language Processing , 1992 .

[11]  Hiroaki Kitano,et al.  Challenges of massive parallelism , 1993, IJCAI 1993.

[12]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[13]  Walter Daelemans,et al.  The Acquisition of Stress: A Data-Oriented Approach , 1994, Comput. Linguistics.

[14]  Royal Skousen,et al.  Analogical Modeling Of Language , 1989 .

[15]  Charles X. Ling,et al.  Learning the Past Tense of English Verbs: The Symbolic Pattern Associator vs. Connectionist Models , 1993, J. Artif. Intell. Res..

[16]  Walter Daelemans,et al.  Tabtalk: reusability in data-oriented grapheme-to-phoneme conversion , 1993, EUROSPEECH.

[17]  Walter Daelemans,et al.  Generalization performance of backpropagation learning on a syllabification task , 1992 .

[18]  Walter Daelemansz,et al.  Learnability and Markedness: Dutch Stress Assignment , 1993 .

[19]  Walter Daelemans,et al.  Learnability and markedness in data-driven acquisition of stress , 1993 .

[20]  Royal Skousen,et al.  Real-Time Morphology: Symbolic Rules or Analogical Networks? , 1989 .

[21]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[22]  Edward E. Smith,et al.  Categories and concepts , 1984 .

[23]  David Aha A study of instance-based algorithms for supervised learning tasks: mathematica:l , 1990 .