A hierarchical bayesian procedure for two-mode cluster analysis

This manuscript introduces a new Bayesian finite mixture methodology for the joint clustering of row and column stimuli/objects associated with two-mode asymmetric proximity, dominance, or profile data. That is, common clusters are derived which partition both the row and column stimuli/objects simultaneously into the same derived set of clusters. In this manner, interrelationships between both sets of entities (rows and columns) are easily ascertained. We describe the technical details of the proposed two-mode clustering methodology including its Bayesian mixture formulation and a Bayes factor heuristic for model selection. We present a modest Monte Carlo analysis to investigate the performance of the proposed Bayesian two-mode clustering procedure with respect to synthetically created data whose structure and parameters are known. Next, a consumer psychology application is provided examining physician pharmaceutical prescription behavior for various brands of prescription drugs in the neuroscience health market. We conclude by discussing several fertile areas for future research.

[1]  W. DeSarbo,et al.  On the Use of Hierarchical Clustering for the Analysis of Nonsymmetric Proximities , 1984 .

[2]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[3]  H. Jeffreys,et al.  Theory of probability , 1896 .

[4]  G. Celeux,et al.  A Classification EM algorithm for clustering and two stochastic versions , 1992 .

[5]  C. Robert,et al.  Estimation of Finite Mixture Distributions Through Bayesian Sampling , 1994 .

[6]  L. Hubert,et al.  Additive two-mode clustering: The error-variance approach revisited , 1995 .

[7]  Martin Abba Tanner,et al.  Tools for Statistical Inference: Observed Data and Data Augmentation Methods , 1993 .

[8]  Wayne S. DeSarbo,et al.  Tree Representations of Rectangular Proximity Matrices , 1984 .

[9]  P. Orlik,et al.  An error variance approach to two-mode hierarchical clustering , 1993 .

[10]  Peter Bryant,et al.  Asymptotic behaviour of classification maximum likelihood estimates , 1978 .

[11]  Thomas Eckes,et al.  An Agglomerative Method for Two-Mode Hierarchical Clustering , 1991 .

[12]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[13]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[14]  Paul J. Schweitzer,et al.  Problem Decomposition and Data Reorganization by a Clustering Technique , 1972, Oper. Res..

[15]  M. West,et al.  A Bayesian method for classification and discrimination , 1992 .

[16]  Vijay Mahajan,et al.  On the Creation of Acceptable Conjoint Analysis Experimental Designs , 1991 .

[17]  W. DeSarbo Gennclus: New models for general nonhierarchical clustering analysis , 1982 .

[18]  Wayne S. DeSarbo,et al.  Constrained classification: The use of a priori information in cluster analysis , 1984 .

[19]  Sylvia Richardson,et al.  Markov Chain Monte Carlo in Practice , 1997 .

[20]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .

[21]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[22]  Roger N. Shepard,et al.  Additive clustering: Representation of similarities as combinations of discrete overlapping properties. , 1979 .

[23]  A. Gelman,et al.  Estimating the Electoral Consequences of Legislative Redistricting , 1990 .

[24]  Peter G. Bryant,et al.  Large-sample results for optimization-based clustering methods , 1991 .