论文信息 - Minsky, Chomsky and Deep Nets

Minsky, Chomsky and Deep Nets

When Minsky and Chomsky were at Harvard in the 1950s, they started out their careers questioning a number of machine learning methods that have since regained popularity. Minsky’s Perceptrons was a reaction to neural nets and Chomsky’s Syntactic Structures was a reaction to ngram language models. Many of their objections are being ignored and forgotten (perhaps for good reasons, and perhaps not). While their arguments may sound negative, I believe there is a more constructive way to think about their efforts; they were both attempting to organize computational tasks into larger frameworks such as what is now known as the Chomsky Hierarchy and algorithmic complexity. Section 5 will propose an organizing framework for deep nets. Deep nets are probably not the solution to all the world’s problems. They don’t do the impossible (solve the halting problem), and they probably aren’t great at many tasks such as sorting large vectors and multiplying large matrices. In practice, deep nets have produced extremely exciting results in vision and speech, though other tasks may be more challenging for deep nets.

Kenneth Ward Church

[1] Yuen Ren Chao,et al. Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[2] A. Turing. On Computable Numbers, with an Application to the Entscheidungsproblem. , 1937 .

[3] W. Daniel Hillis,et al. The connection machine , 1985 .

[4] Kenneth Ward Church. Emerging trends: Artificial Intelligence, China and my new job at Baidu , 2018, Nat. Lang. Eng..

[5] C. Lee Giles,et al. The Neural Network Pushdown Automaton: Model, Stack and Learning Simulations , 2017, ArXiv.

[6] Claude E. Shannon,et al. Prediction and Entropy of Printed English , 1951 .

[7] C. E. SHANNON,et al. A mathematical theory of communication , 1948, MOCO.

[8] Zellig S. Harris,et al. Distributional Structure , 1954 .

[9] Michele Banko,et al. Scaling to Very Very Large Corpora for Natural Language Disambiguation , 2001, ACL.

[10] Kenneth Ward Church,et al. A Pendulum Swung too Far , 2011 .

[11] Kenneth Ward Church,et al. Complexity, Two-Level Morphology and Finnish , 1988, COLING.

[12] Kenneth Ward Church,et al. Introduction to the Special Issue on Computational Linguistics Using Large Corpora , 1993, Comput. Linguistics.

[13] Guy E. Blelloch,et al. A comparison of sorting algorithms for the connection machine CM-2 , 1991, SPAA '91.

[14] Claude E. Shannon,et al. The mathematical theory of communication , 1950 .