Cross validation issues in multiobjective clustering.

The implementation of multiobjective programming methods in combinatorial data analysis is an emergent area of study with a variety of pragmatic applications in the behavioural sciences. Most notably, multiobjective programming provides a tool for analysts to model trade offs among competing criteria in clustering, seriation, and unidimensional scaling tasks. Although multiobjective programming has considerable promise, the technique can produce numerically appealing results that lack empirical validity. With this issue in mind, the purpose of this paper is to briefly review viable areas of application for multiobjective programming and, more importantly, to outline the importance of cross-validation when using this method in cluster analysis.

[1]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[2]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[3]  Michael J Brusco,et al.  Bicriterion methods for partitioning dissimilarity matrices. , 2005, The British journal of mathematical and statistical psychology.

[4]  M. Brusco Identifying a reordering of rows and columns for multiple proximity matrices using multiobjective programming , 2002 .

[5]  Stephanie Stahl,et al.  Bicriterion seriation methods for skew-symmetric matrices. , 2005, The British journal of mathematical and statistical psychology.

[6]  M. Brusco A Repetitive Branch-and-Bound Procedure for Minimum Within-Cluster Sums of Squares Partitioning , 2006, Psychometrika.

[7]  Reginald G. Golledge,et al.  Matrix reorganization and dynamic programming: Applications to paired comparisons and unidimensional seriation , 1981 .

[8]  Xavier Gandibleux,et al.  A survey and annotated bibliography of multiobjective combinatorial optimization , 2000, OR Spectr..

[9]  Saskia de Craen,et al.  Effects of Group Size and Lack of Sphericity on the Recovery of Clusters in K-means Cluster Analysis , 2006, Multivariate behavioral research.

[10]  Richard A. Brown,et al.  Patterns of change in depressive symptoms during smoking cessation: who's at risk for relapse? , 2002, Journal of consulting and clinical psychology.

[11]  Sam Kwong,et al.  Multi-Objective Evolutionary Clustering using Variable-Length Real Jumping Genes Genetic Algorithm , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[12]  L. Hubert Min and max hierarchical clustering using asymmetric similarity measures , 1973 .

[13]  S. Dolnicar,et al.  An examination of indexes for determining the number of clusters in binary data sets , 2002, Psychometrika.

[14]  Pierre Hansen,et al.  Bicriterion Cluster Analysis , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[16]  Lawrence Hubert,et al.  The Structural Representation of Proximity Matrices with MATLAB , 2006 .

[17]  L. Hubert Some applications of graph theory to clustering , 1974 .

[18]  Phipps Arabie,et al.  Combinatorial Data Analysis: Optimization by Dynamic Programming , 1987 .

[19]  Marc Despontin,et al.  Multiple Criteria Optimization: Theory, Computation, and Application, Ralph E. Steuer (Ed.). Wiley, Palo Alto, CA (1986) , 1987 .

[20]  A. Ferligoj,et al.  Direct multicriteria clustering algorithms , 1992 .

[21]  Lawrence Hubert,et al.  Order-Constrained Solutions in K-Means Clustering: Even Better Than Being Globally Optimal , 2008 .

[22]  Michael J. Brusco,et al.  Graph coloring, minimum-diameter partitioning, and the analysis of confusion matrices , 2004 .

[23]  David J. Hand,et al.  Discrimination and Classification , 1982 .

[24]  W. DeSarbo,et al.  Combinatorial Optimization Approaches to Constrained Market Segmentation: An Application to Industrial Market Segmentation , 1998 .

[25]  Anil K. Jain,et al.  Multiobjective data clustering , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[26]  Douglas B. Kell,et al.  Multiobjective Optimization in Bioinformatics and Computational Biology , 2007, IEEE ACM Trans. Comput. Biol. Bioinform..

[27]  L. Hubert SPANNING TREES AND ASPECTS OF CLUSTERING , 1974 .

[28]  E. Lawler A Comment on Minimum Feedback Arc Sets , 1964 .

[29]  Michael J. Brusco,et al.  Multicriterion Clusterwise Regression for Joint Segmentation Settings: An Application to Customer Value , 2003 .

[30]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[31]  M. Brusco,et al.  Branch-and-Bound Applications in Combinatorial Data Analysis , 2005 .

[32]  Matthias Ehrgott,et al.  Multicriteria Optimization (2. ed.) , 2005 .

[33]  Michael J. Brusco,et al.  An interactive multiobjective programming approach to combinatorial data analysis , 2001 .

[34]  Joshua D. Knowles,et al.  An Evolutionary Approach to Multiobjective Clustering , 2007, IEEE Transactions on Evolutionary Computation.

[35]  Michael J. Brusco,et al.  Compact integer-programming models for extracting subsets of stimuli from confusion matrices , 2001 .

[36]  Paul E. Green,et al.  Modifying Cluster-Based Segments to Enhance Agreement with an Exogenous Response Variable , 1996 .

[37]  Gavin L. Fox,et al.  Cautionary Remarks on the Use of Clusterwise Regression , 2008, Multivariate behavioral research.

[38]  W. S. Robinson A Method for Chronologically Ordering Archaeological Deposits , 1951, American Antiquity.

[39]  Michael J. Brusco,et al.  A Simulated Annealing Heuristic for a Bicriterion Partitioning Problem in Market Segmentation , 2002 .

[40]  M. Brusco,et al.  Multiobjective Multidimensional (City‐Block) Scaling , 2010 .

[41]  Donald E. Grierson,et al.  Multicriteria decision making in n-D , 2007 .

[42]  Douglas Steinley,et al.  A New Variable Weighting and Selection Procedure for K-means Cluster Analysis , 2008, Multivariate behavioral research.

[43]  Joshua D. Knowles,et al.  On semi-supervised clustering via multiobjective optimization , 2006, GECCO.

[44]  W. Hartup,et al.  Heterogeneity among peer-rejected boys across middle childhood: developmental pathways of social behavior. , 2002, Developmental psychology.

[45]  L. Hubert,et al.  Combinatorial Data Analysis , 1992 .