Computational Statistics and Data Analysis a Dual Latent Class Unfolding Model for Two-way Two-mode Preference Rating Data

In unfolding for two-way two-mode preference ratings data, the categorization of the set of individuals while the categories are represented in a low dimensional space may be an advisable procedure to facilitate their understanding. In addition to considering groups of individuals of a similar preference pattern, homogeneous groups of objects are also considered, such that within each group there are clustered objects perceived to have similar attributes. A dual latent class model is proposed for a matrix of preference ratings data, which will partition the individuals and the objects into classes, and simultaneously represent the cluster centers in a low dimensional space, while individuals and objects retain their preference relationship. Both the categories achieved and the unfolding configuration are estimated to be simultaneously optimal, by means of a conditional maximum likelihood estimation procedure, in a simulated annealing framework that enables us to take a statistical decision about the parameters of the model. The adjusted BIC statistic is employed to test the number of mixture components, and the dimensionality of the representation. Real and artificial data sets are analyzed to illustrate the model's performance.

[1]  Jonathan Ainsworth,et al.  Statistics and Data , 2011, Research Ethics for Human Geography: A Handbook for Students.

[2]  Michael P. Windham,et al.  Information and classification: Concepts, methods and applications , 1995 .

[3]  P. Groenen,et al.  Modern multidimensional scaling , 1996 .

[4]  W. Heiser A generalized majorization method for least souares multidimensional scaling of pseudodistances that may be negative , 1991 .

[5]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[6]  José A. Díaz-García,et al.  A global simulated annealing heuristic for the three-parameter lognormal maximum likelihood estimation , 2008, Computational Statistics & Data Analysis.

[7]  Willem J. Heiser,et al.  Clustering in Low-Dimensional Space , 1993 .

[8]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[9]  J. Vera,et al.  Non-stationary spatial covariance structure estimation in oversampled domains by cluster differences scaling with spatial constraints , 2008 .

[10]  The unfolding technique. , 1952 .

[11]  Patrick J. F. Groenen,et al.  Global Optimization strategies for two-mode clustering , 2005 .

[12]  P. Deb Finite Mixture Models , 2008 .

[13]  P. Groenen,et al.  Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima , 1997 .

[14]  Willem J. Heiser,et al.  Interpreting degenerate solutions in unfolding by use of the vector model and the compensatory distance model , 2005 .

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  Willem J. Heiser,et al.  A Latent Class Multidimensional Scaling Model for Two-Way One-Mode Continuous Rating Dissimilarity Data , 2009 .

[17]  Willem J. Heiser,et al.  A Permutation-Translation Simulated Annealing Algorithm for L1 and L2 Unidimensional Scaling , 2005, J. Classif..

[18]  A. Hope A Simplified Monte Carlo Significance Test Procedure , 1968 .

[19]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[20]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[21]  Chih-Chien Yang,et al.  Separating Latent Classes by Information Criteria , 2007, J. Classif..

[22]  S. Dreyfus,et al.  Thermodynamical Approach to the Traveling Salesman Problem : An Efficient Simulation Algorithm , 2004 .

[23]  Amilcar Soares,et al.  Stochastic environmental research risk assessment , 2007 .

[24]  Willem J. Heiser,et al.  Global Optimization in Any Minkowski Metric: A Permutation-Translation Simulated Annealing Algorithm for Multidimensional Scaling , 2007, J. Classif..

[25]  Hans-Hermann Bock,et al.  Two-mode clustering methods: astructuredoverview , 2004, Statistical methods in medical research.

[26]  C. Coombs A theory of data. , 1965, Psychology Review.

[27]  Suzanne Winsberg,et al.  A latent class approach to fitting the weighted Euclidean model, clascal , 1993 .

[28]  W. Heiser,et al.  A latent class unfolding model for analyzing single stimulus preference ratings , 1993 .

[29]  P. Groenen,et al.  Avoiding degeneracy in multidimensional unfolding by penalizing on the coefficient of variation , 2005 .

[30]  Kathleen Marchal,et al.  Joint mapping of genes and conditions via multidimensional unfolding analysis , 2007, BMC Bioinformatics.

[31]  P. Legendre,et al.  Developments in Numerical Ecology , 1988 .

[32]  J. Rissanen Stochastic Complexity and Modeling , 1986 .