Population mixture models and clustering algorithms

The problem of clustering individuals is considered within the context of a mixture of distributions. A modification of the usual approach to population mixtures is employed. As usual, a parametric family of distributions is considered, a set of parameter values being associated with each population. In addition, with each observation is associated an identification parameter, Indicating from which population the observation arose. Theresulting likelihood function is interpreted in terms of the conditional probability density of a sample from a mixture of populations, given the identification parameter of each observation. Clustering algorithms are obtained by applying a method of iterated maximum likelihood to this like-lihood function.

[1]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[2]  G. TEMPLE,et al.  Relaxation Methods in Theoretical Physics , 1946, Nature.

[3]  C. Eisenhart The assumptions underlying the analysis of variance. , 1947, Biometrics.

[4]  W. G. Bickley,et al.  Relaxation Methods in Theoretical Physics , 1947 .

[5]  J. Neyman,et al.  Consistent Estimates Based on Partially Consistent Observations , 1948 .

[6]  Herman Rubin,et al.  Statistical Inference in Factor Analysis , 1956 .

[7]  W. Gibson Three multivariate models: Factor analysis, latent structure analysis, and latent profile analysis , 1959 .

[8]  Kenneth E. Iverson,et al.  A programming language , 1899, AIEE-IRE '62 (Spring).

[9]  H. Robbins The Empirical Bayes Approach to Statistical Decision Problems , 1964 .

[10]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[11]  G H Ball,et al.  A clustering technique for summarizing multivariate data. , 1967, Behavioral science.

[12]  J Zubin,et al.  ON THE METHODS AND THEORY OF CLUSTERING. , 1969, Multivariate behavioral research.

[13]  N. E. Day Estimating the components of a mixture of normal distributions , 1969 .

[14]  J. Wolfe PATTERN CLUSTERING BY MULTIVARIATE MIXTURE ANALYSIS. , 1970, Multivariate behavioral research.

[15]  S. John,et al.  On Identifying the Population of Origin of Each Observation in a Mixture of Observations from Two Normal Populations , 1970 .

[16]  James M. Ortega,et al.  Iterative solution of nonlinear equations in several variables , 2014, Computer science and applied mathematics.

[17]  A. Scott,et al.  Clustering methods based on likelihood ratio criteria. , 1971 .

[18]  Herman Chernoff,et al.  Metric considerations in cluster analysis , 1972 .