Bayesian Classification

This paper describes a Bayesian technique for unsupervised classification of data and its computer implementation, AutoClass. Given real valued or discrete data, AutoClass determines the most probable number of classes present in the data, the most probable descriptions of those classes, and each object's probability of membership in each class. The program performs as well as or better than other automatic classification systems when run on the same data and contains no ad hoc similarity measures or stopping criteria. AutoClass has been applied to several databases in which it has discovered classes representing previously unsuspected phenomena.

[1]  H. Robbins,et al.  Asymptotic Solutions of the Compound Decision Problem for Two Completely Specified Distributions , 1955 .

[2]  H. Rubin UNIFORM CONVERGENCE OF RANDOM FUNCTIONS WITH APPLICATIONS TO STATISTICS , 1956 .

[3]  J. B. Scarborough Numerical Mathematical Analysis , 1931 .

[4]  Irving John Good,et al.  The Estimation of Probabilities: An Essay on Modern Bayesian Methods , 1965 .

[5]  D. Boes On the Estimation of Mixing Distributions , 1966 .

[6]  R. R. Bahadur A Note on Quantiles in Large Samples , 1966 .

[7]  C. S. Wallace,et al.  An Information Measure for Classification , 1968, Comput. J..

[8]  J. Wolfe PATTERN CLUSTERING BY MULTIVARIATE MIXTURE ANALYSIS. , 1970, Multivariate behavioral research.

[9]  M. Chao The Asymptotic Behavior of Bayes' Estimators , 1970 .

[10]  Richard A. Johnson Asymptotic Expansions Associated with Posterior Distributions , 1970 .

[11]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  A. J. Collins,et al.  Introduction To Multivariate Analysis , 1981 .

[14]  D. A. Kenny,et al.  Correlation and Causation , 1937, Wilmott.

[15]  B. Everitt,et al.  Finite Mixture Distributions , 1981 .

[16]  E. T. Jaynes,et al.  Papers on probability, statistics and statistical physics , 1983 .

[17]  Ryszard S. Michalski,et al.  Automated Construction of Classifications: Conceptual Clustering Versus Numerical Taxonomy , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  R. Michalski,et al.  Learning from Observation: Conceptual Clustering , 1983 .

[19]  C. S. Wallace,et al.  A General Selection Criterion for Inductive Inference , 1984, ECAI.

[20]  E. T. Jaynes,et al.  BAYESIAN METHODS: GENERAL BACKGROUND ? An Introductory Tutorial , 1986 .

[21]  C. S. Wallace,et al.  Estimation and Inference by Compact Coding , 1987 .

[22]  Douglas H. Fisher,et al.  Conceptual Clustering, Learning from Examples, and Inference , 1987 .

[23]  RICHARD C. DUBES,et al.  How many clusters are best? - An experiment , 1987, Pattern Recognit..

[24]  Pat Langley,et al.  Hill-Climbing Theories of Learning , 1987 .

[25]  James Kelly,et al.  AutoClass: A Bayesian Classification System , 1993, ML.

[26]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[27]  R. T. Cox Probability, frequency and reasonable expectation , 1990 .