Computational Consequences of Agreement and Ambiguity in Natural Language

The computer science technique of computational complexity analysis can provide powerful insights into the algorithm-neutral analysis of information processing tasks. Here we show that a simple, theory-neutral linguistic model of syntactic agreement and ambiguity demonstrates that natural language parsing may be computationally intractable. Significantly, we show that it may be syntactic features rather than rules that can cause this difficulty. Informally, human languages and the computationally intractable Satisfiability (SAT) problem share two costly computional mechanisms: both enforce agreement among symbols across unbounded distances (Subject-Verb agreement) and both allow ambiguity (is a word a Noun or a Verb?).

[1]  Christos H. Papadimitriou,et al.  The complexity of recognizing polyhedral scenes , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[2]  G. A. Miller,et al.  Finitary models of language users , 1963 .

[3]  Noam Chomsky,et al.  Rules and representations , 1980, Behavioral and Brain Sciences.

[4]  Eugene Galanter,et al.  Handbook of mathematical psychology: I. , 1963 .

[5]  George A. Miller,et al.  Introduction to the Formal Analysis of Natural Languages , 1968 .

[6]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[7]  Robert C. Berwick,et al.  Computational complexity and natural language , 1987 .