Some statistical methods in machine intelligence research

Abstract About a dozen examples are given of the use of statistical methods in research on machine intelligence, most, though not all, previously known, but not previously brought together. The topics include the application of rationality to the research as a whole; the trading of immediate gain for information; adaptive control without the identification of a model, by using smoothing techniques; phoneme recognition using distinctive features and their derivatives; the compiling of dictionaries; “botryology” or concept formation by clump finding; information retrieval; medical diagnosis; game playing and its relationship to theorem proving; design of an alphabet or of a vocabulary; and artificial neural networks. Among the statistical themes that are emphasized are the estimation of probabilities; the use of amounts of information and of evidence as substitutes for utility when utility is difficult to estimate; decision trees; “evolving” probabilities; and maximum, minimum, and minimax entropy in diagnosis. In this survey of methods it has been necessary at several points to make do with references to the literature.

[1]  M. Bartlett Contingency Table Interactions , 1935 .

[2]  W. Vogel An Asymptotic Minimax Theorem for the Two Armed Bandit Problem , 1960 .

[3]  L. A. Goodman Interactions in Multidimensional Contingency Tables , 1964 .

[4]  D. Lindley On a Measure of the Information Provided by an Experiment , 1956 .

[5]  I. Good,et al.  Utility of a Distribution , 1968, Nature.

[6]  H. Robbins Some aspects of the sequential design of experiments , 1952 .

[7]  H Robbins,et al.  A SEQUENTIAL DECISION PROBLEM WITH A FINITE MEMORY. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[8]  F. Yates Design and Analysis of Factorial Experiments , 1958 .

[9]  Bror Zachrisson,et al.  Studies in the legibility of printed text , 1965 .

[10]  I. Good,et al.  THE NUMBER OF NEW SPECIES, AND THE INCREASE IN POPULATION COVERAGE, WHEN A SAMPLE IS INCREASED , 1956 .

[11]  I. J. Good,et al.  Speculations Concerning the First Ultraintelligent Machine , 1965, Adv. Comput..

[12]  H BRODY,et al.  Organization of the cerebral cortex. III. A study of aging in the human cerebral cortex , 1955, The Journal of comparative neurology.

[13]  W. R. Thompson ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .

[14]  J. J. Kelly A new interpretation of information rate , 1956 .

[15]  I. Good Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency Tables , 1963 .

[16]  W. J. McGill Multivariate information transmission , 1954, Trans. IRE Prof. Group Inf. Theory.

[17]  W. A. Clark,et al.  Simulation of self-organizing systems by digital computer , 1954, Trans. IRE Prof. Group Inf. Theory.

[18]  W. T. WILLIAMS,et al.  Logic of Computer-Based Intrinsic Classifications , 1965, Nature.

[19]  Square-root Law for Solving Two-ended Problems , 1966, Nature.

[20]  W. R. Thompson On the Theory of Apportionment , 1935 .

[21]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[22]  P M MILNER,et al.  The cell assembly: Mark II. , 1957, Psychological review.

[23]  Allen Newell,et al.  Chess-Playing Programs and the Problem of Complexity , 1958, IBM J. Res. Dev..

[24]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[25]  W. Vogel A Sequential Design for the Two Armed Bandit , 1960 .

[26]  Raymond E. Bonner,et al.  On Some Clustering Techniques , 1964, IBM J. Res. Dev..

[27]  L. J. Savage,et al.  Probability and the weighing of evidence , 1951 .

[28]  I. Good How Rational Should a Manager be , 1962 .

[29]  M. J. Rose,et al.  Classification of a set of elements , 1964, Comput. J..