论文信息 - Learning language using genetic algorithms

Learning language using genetic algorithms

Strict pattern-based methods of grammar induction are often frustrated by the apparently inexhaustible variety of novel word combinations in large corpora. Statistical methods offer a possible solution by allowing frequent well-formed expressions to overwhelm the infrequent ungrammatical ones. They also have the desirable property of being able to construct robust grammars from positive instances alone. Unfortunately, the “zero-frequency” problem entails assigning a small probability to all possible word patterns, thus ungrammatical n-grams become as probable as unseen grammatical ones. Further, such grammars are unable to take advantage of inherent lexical properties that should allow infrequent words to inherit the syntactic properties of the class to which they belong.

Ian H. Witten | Tony C. Smith | I. Witten | T. Smith

[1] Julian Kupiec,et al. Augmenting a Hidden Markov Model for Phrase-Dependent Word Tagging , 1989, HLT.

[2] Eugene Charniak,et al. Statistical language learning , 1997 .

[3] Christopher Gauker. Language and Reality: An Introduction to the Philosophy of Language , 1987 .

[4] Peter J. Wyard. Context Free Grammar Induction Using Genetic Algorithms , 1991, ICGA.

[5] Riva Wenig Bickel,et al. Tree Structured Rules in Genetic Algorithms , 1987, ICGA.

[6] E. Mark Gold,et al. Language Identification in the Limit , 1967, Inf. Control..

[7] Craig G. Nevill-Manning,et al. Compression by induction of hierarchical grammars , 1994, Proceedings of IEEE Data Compression Conference (DCC'94).

[8] Hermann Moisl,et al. Connectionist Finite State Natural Language Processing , 1992 .

[9] Ian H. Witten,et al. Probability-Driven Lexical Classification: A Corpus-Based Approach , 1995 .

[10] Lawrence Davis,et al. Genetic Algorithms and Simulated Annealing , 1987 .

[11] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[12] Michael L. Mauldin,et al. Maintaining Diversity in Genetic Search , 1984, AAAI.

[13] S. Stich. Grammar, Psychology, and Indeterminacy , 1972 .

[14] John E. Rager,et al. A Connectionist Model of Motion and Government in Chomsky's Government-binding Theory , 1990 .