论文信息 - Patterns hidden from simple algorithms

Patterns hidden from simple algorithms

Is the number 9021960864034418159813 random? Educated opinions might vary from “No! No single string can be random”, to the more contemptuous “Come on! Those are just the 714th to 733rd digits of π”. Yet, to my limited mind, the string did appear random. Is there a way to use some formal mathematics to justify my naivete? The modern theory of pseudorandomness [2, 5] indeed manages to explain such phenomena, where strings “appear” random to simple minds. The key, this theory argues, is that randomness is really in the “eyes of the beholder”, or rather in the computing power of the “tester” of randomness. More things appear random to simpler, or resource-limited, algorithms than to complex, powerful, algorithms. And why should we care? Because randomness is a key resource in computation. Monte Carlo simulations abound in the use of computation for prediction. On the theoretical side too, algorithms get simplified or speeded up incredibly if they use randomness. Fundamental tasks in distributed computing (such as “synchronization”) can be solved with, and only with, randomness. And there is no hope for maintaining privacy and secrecy in the absence of randomness. At the same time much of the randomness we deal with in reality is not “pure”. They don’t come as a collection of independent random bits, but are generated by other processes. Knowing how an algorithm, or a computational process, works in the presence of somewhat random strings becomes the essence of a recurring question. (For example, should you really trust nuclear power safety calculations made by a Monte-Carlo algorithm using randomness from the C++ rand program?) Questions such as the above get formalized in the theory of pseudo-randomness as follows: It considers some source of randomness X (formally given by some probability distribution over n-bit strings), and a class of “Boolean” algorithms A (algorithms that decide Boolean questions) and asks if some algorithm in A behaves very differently given access to a random string generated by X, than it does on pure random strings? If every algorithm in A behaves roughly the same on X as on pure randomness, we say X is pseudo-random to A. In the example above, X = Xπ may be the source that picks a random integer i between 1 and 10000 and outputs π[i + 1], . . . , π[i + 20] where π[j] denotes the jth digit of π. Given some class of algorithms A one could now ask, does Xπ look pseudo-random to A? Unfortunately answering such questions leads to a fundamental challenge in the theory of computing. Proving, say, that Xπ looks random to A involves showing that no algorithm in A can detect patterns that the digits of π show. And proving that some class of algorithms can’t do some task is a major challenge to theoretical computer science—with “Is P = NP?” question being the most notorious example.

Madhu Sudan | M. Sudan

[1] Andrew Chi-Chih Yao,et al. Theory and Applications of Trapdoor Functions (Extended Abstract) , 1982, FOCS.

[2] Noam Nisan,et al. Approximate Inclusion-Exclusion , 1990, Comb..

[3] Manuel Blum,et al. How to generate cryptographically strong sequences of pseudo random bits , 1982, 23rd Annual Symposium on Foundations of Computer Science (sfcs 1982).

[4] Louay Bazzi,et al. Polylogarithmic Independence Can Fool DNF Formulas , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[5] Noam Nisan,et al. Pseudorandom generators for space-bounded computations , 1990, STOC '90.

[6] Noam Nisan,et al. Pseudorandom generators for space-bounded computation , 1992, Comb..