论文信息 - Simple Recurrent Networks and Natural Language: How Important is Starting Small?

Simple Recurrent Networks and Natural Language: How Important is Starting Small?

Prediction is believed to be an important component of cognition, particularly in natural language processing. It has long been accepted that recurrent neural networks are best able to learn prediction tasks when trained on simple examples before incrementally proceeding to more complex sentences. Furthermore, the counter-intuitive suggestion has been made that networks and, by implication, humans may be aided in learning by limited cognitive resources (Elman, 1993, Cognition). The current work reports evidence that starting with simplified inputs is not necessary in training recurrent networks to learn pseudo-natural languages; in fact, delayed introduction of complex examples is often an impediment. We suggest that the structure of natural language can be learned without special teaching methods or limited cognitive resources.

David C. Plaut | Douglas L. T. Rohde | D. Plaut

[1] M. Kutas,et al. Reading senseless sentences: brain potentials reflect semantic incongruity. , 1980, Science.

[2] Peter M. Duppenthaler. Maturational Constraints on Language Learning , 1990 .

[3] J. Elman. Learning and development in neural networks: the importance of starting small , 1993, Cognition.

[4] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[5] James L. McClelland,et al. Finite State Automata and Simple Recurrent Networks , 1989, Neural Computation.

[6] G. Marcus. Negative evidence in language acquisition , 1993, Cognition.

[7] James L. McClelland,et al. Expectations Increase the Benefit Derived from Parafoveal Visual Information in Reading Words Aloud. , 1981 .

[8] E. Mark Gold,et al. Language Identification in the Limit , 1967, Inf. Control..

[9] C. Lee Giles,et al. How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies , 1998, Neural Networks.

[10] James L. McClelland. Connectionist models and psychological evidence , 1988 .

[11] Geoffrey E. Hinton. Connectionist Learning Procedures , 1989, Artif. Intell..

[12] C. Lee Giles,et al. Learning, Representation, and Synthesis of Discrete Dynamical Systems in Continuous Recurrent Neural , 1995 .