Model-Based Clustering With Dissimilarities: A Bayesian Approach

A Bayesian model-based clustering method is proposed for clustering objects on the basis of dissimilarites. This combines two basic ideas. The first is that the objects have latent positions in a Euclidean space, and that the observed dissimilarities are measurements of the Euclidean distances with error. The second idea is that the latent positions are generated from a mixture of multivariate normal distributions, each one corresponding to a cluster. We estimate the resulting model in a Bayesian way using Markov chain Monte Carlo. The method carries out multidimensional scaling and model-based clustering simultaneously, and yields good object configurations and good clustering results with reasonable measures of clustering uncertainties. In the examples we study, the clustering results based on low-dimensional configurations were almost as good as those based on high-dimensional ones. Thus, the method can be used as a tool for dimension reduction when clustering high-dimensional objects, which may be useful especially for visual inspection of clusters. We also propose a Bayesian criterion for choosing the dimension of the object configuration and the number of clusters simultaneously. This is easy to compute and works reasonably well in simulations and real examples.

[1]  Man-Suk Oh Estimation of posterior density functions from a posterior sample , 1999 .

[2]  A. Raftery,et al.  Model-based Gaussian and non-Gaussian clustering , 1993 .

[3]  R. Sibson Studies in the Robustness of Multidimensional Scaling: Perturbational Analysis of Classical Scaling , 1979 .

[4]  Adrian E. Raftery,et al.  MCLUST: Software for Model-Based Cluster Analysis , 1999 .

[5]  Robert R. Sokal,et al.  A statistical method for evaluating systematic relationships , 1958 .

[6]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[7]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[8]  Richard L. Priem,et al.  Executives’ Perceptions of Uncertainty Sources: A Numerical Taxonomy and Underlying Dimensions , 2002 .

[9]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[10]  Patrick J. F. Groenen,et al.  The majorization approach to multidimensional scaling : some problems and extensions , 1993 .

[11]  Adrian E. Raftery,et al.  Inference in model-based cluster analysis , 1997, Stat. Comput..

[12]  R. Tibshirani,et al.  The Global Pairwise Approach to Radiation Hybrid Mapping , 1999 .

[13]  P. Groenen,et al.  Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima , 1997 .

[14]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[15]  David B. MacKay,et al.  Probabilistic multidimensional scaling: An anisotropic model for distance judgments , 1989 .

[16]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[17]  Joseph L. Zinnes,et al.  A Probabilistic Model for the Multidimensional Scaling of Proximity and Preference Data , 1986 .

[18]  A. Shapiro Monte Carlo Sampling Methods , 2003 .

[19]  Shijin Ren,et al.  Use of multidimensional scaling in the selection of wastewater toxicity test battery components. , 2003, Water research.

[20]  L. N. Kanal,et al.  Theory of Multidimensional Scaling , 2005 .

[21]  Adrian E. Raftery,et al.  Enhanced Model-Based Clustering, Density Estimation, and Discriminant Analysis Software: MCLUST , 2003, J. Classif..

[22]  Pierre Courrieu,et al.  Two methods for encoding clusters , 2001, Neural Networks.

[23]  Barbara P. Buttenfield,et al.  Loglinear and multidimensional scaling models of digital library navigation , 2002, Int. J. Hum. Comput. Stud..

[24]  M. Ringnér,et al.  Gene expression in inherited breast cancer. , 2002, Advances in cancer research.

[25]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[26]  A. Raftery,et al.  Bayesian Multidimensional Scaling and Choice of Dimension , 2001 .

[27]  P. Groenen,et al.  The majorization approach to multidimensional scaling for Minkowski distances , 1995 .

[28]  M. Savage,et al.  Ascription into Achievement: Models of Career Systems at Lloyds Bank, 1890-1970 , 1996, American Journal of Sociology.

[29]  Hujun Yin,et al.  Data visualisation and manifold mapping using the ViSOM , 2002, Neural Networks.

[30]  W. I. King Statistical Yearbook of the League of Nations. , 1933 .

[31]  Hinrich Schütze,et al.  Projections for efficient document clustering , 1997, SIGIR '97.

[32]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[33]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[34]  Adrian E. Raftery,et al.  Model-based clustering and data transformations for gene expression data , 2001, Bioinform..

[35]  P. Sneath The application of computers to taxonomy. , 1957, Journal of general microbiology.

[36]  G. Getz,et al.  Coupled two-way clustering analysis of gene microarray data. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[37]  D. E Welchew,et al.  Multidimensional Scaling of Integrated Neurocognitive Function and Schizophrenia as a Disconnexion Disorder , 2002, NeuroImage.

[38]  Pierre Courrieu,et al.  Straight monotonic embedding of data sets in Euclidean spaces , 2002, Neural Networks.

[39]  Brita Elvevåg,et al.  Scaling and clustering in the study of semantic disruptions in patients with schizophrenia: a re-evaluation , 2003, Schizophrenia Research.

[40]  Jarkko Venna,et al.  Analysis and visualization of gene expression data using Self-Organizing Maps , 2002, Neural Networks.

[41]  A. Abbott,et al.  Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers , 1990, American Journal of Sociology.

[42]  M. Stephens Dealing with label switching in mixture models , 2000 .

[43]  P. Groenen,et al.  Modern multidimensional scaling , 1996 .

[44]  Joseph B. Kruskal,et al.  Time Warps, String Edits, and Macromolecules , 1999 .

[45]  Forrest W. Young Multidimensional Scaling: History, Theory, and Applications , 1987 .

[46]  C. Robert,et al.  Computational and Inferential Difficulties with Mixture Posterior Distributions , 2000 .

[47]  J. Ramsay Some Statistical Approaches to Multidimensional Scaling Data , 1982 .

[48]  S. Raghavan,et al.  A visualization model based on adjacency data , 2002, Decision Support Systems.

[49]  Yoshio Takane,et al.  THE METHOD OF TRIADIC COMBINATIONS: A NEW TREATMENT AND ITS APPLICATION , 1982 .

[50]  Willem J. Heiser,et al.  13 Theory of multidimensional scaling , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.

[51]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.