Metaheuristics for clustering in KDD

Research into the use of metaheuristics in clustering is reviewed and assessed. Suggestions are made for future work in this area and conceptual clustering is highlighted as a priority.

[1]  Christoph F. Eick,et al.  Using Supervised Clustering to Enhance Classifiers , 2005, ISMIS.

[2]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[3]  Andrew McCallum,et al.  Semi-Supervised Clustering with User Feedback , 2003 .

[4]  Bidyut Baran Chaudhuri,et al.  A novel genetic algorithm for automatic clustering , 2004, Pattern Recognit. Lett..

[5]  Lawrence W. Lan,et al.  Genetic clustering algorithms , 2001, Eur. J. Oper. Res..

[6]  Donald R. Jones,et al.  Solving Partitioning Problems with Genetic Algorithms , 1991, International Conference on Genetic Algorithms.

[7]  Alex A. Freitas,et al.  A Genetic Algorithm for Generalized Rule Induction , 1999 .

[8]  Jianzhuang Liu,et al.  A genetics-based approach to fuzzy clustering , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[9]  Arantza Casillas,et al.  Sampling and Feature Selection in a Genetic Algorithm for Document Clustering , 2004, CICLing.

[10]  Khaled S. Al-Sultan,et al.  A Tabu search approach to the clustering problem , 1995, Pattern Recognit..

[11]  K. Rose Deterministic annealing for clustering, compression, classification, regression, and related optimization problems , 1998, Proc. IEEE.

[12]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[13]  V. Weerackody,et al.  Design of vector quantizers using simulated annealing , 1988 .

[14]  Shokri Z. Selim,et al.  A simulated annealing algorithm for the clustering problem , 1991, Pattern Recognit..

[15]  Lin-Yu Tseng,et al.  A genetic approach to the automatic clustering problem , 2001, Pattern Recognit..

[16]  J. Vaisey,et al.  Simulated annealing and codebook design , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[17]  Joachim M. Buhmann,et al.  Semi-supervised Image Segmentation by Parametric Distributional Clustering , 2003, EMMCVPR.

[18]  Christoph F. Eick,et al.  Supervised clustering - algorithms and benefits , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[19]  Pierre Hansen,et al.  An Interior Point Algorithm for Minimum Sum-of-Squares Clustering , 1997, SIAM J. Sci. Comput..

[20]  Sugato Basu and Mikhail Bilenko and Raymond J. Mooney Semisupervised Clustering for Intelligent User Management , 2004 .

[21]  Richard C. Dubes,et al.  Experiments in projection and clustering by simulated annealing , 1989, Pattern Recognit..

[22]  Vasudha Bhatnagar,et al.  K-means Clustering Algorithm for Categorical Attributes , 1999, DaWaK.

[23]  Andries Petrus Engelbrecht,et al.  Data clustering using particle swarm optimization , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[24]  Kenneth A. De Jong,et al.  Using genetic algorithms for concept learning , 1993, Machine Learning.

[25]  James C. Bezdek,et al.  Optimization of fuzzy clustering criteria using genetic algorithms , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[26]  Andrew Skabar A GA-based Neural Network Weight Optimization Technique for Semi-Supervised Classifier Learning , 2003, HIS.

[27]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[28]  Lawrence O. Hall,et al.  Fuzzy ant clustering by centroid positioning , 2004, 2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No.04CH37542).

[29]  M. Narasimha Murty,et al.  Genetic K-means algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[30]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[31]  Jonathan M. Garibaldi,et al.  The Application of a Simulated Annealing Fuzzy Clustering Algorithm for Cancer Diagnosis , 2004 .

[32]  Richard J. Enbody,et al.  Further Research on Feature Selection and Classification Using Genetic Algorithms , 1993, ICGA.

[33]  Sankar K. Pal,et al.  Data mining in soft computing framework: a survey , 2002, IEEE Trans. Neural Networks.

[34]  C. A. Murthy,et al.  In search of optimal clusters using genetic algorithms , 1996, Pattern Recognit. Lett..

[35]  Victor J. Rayward-Smith,et al.  A New Metric for Categorical Data , 2003 .

[36]  El-Ghazali Talbi,et al.  Clustering Nominal and Numerical Data: A New Distance Concept for a Hybrid Genetic Algorithm , 2004, EvoCOP.

[37]  Ujjwal Maulik,et al.  Genetic algorithm-based clustering technique , 2000, Pattern Recognit..

[38]  Lothar Litz,et al.  Generating Linguistic Fuzzy Rules for Pattern Classification with Genetic Algorithms , 1999, PKDD.

[39]  Hong Liu,et al.  Evolutionary semi-supervised fuzzy clustering , 2003, Pattern Recognit. Lett..

[40]  David A. Bell,et al.  The use of simulated annealing for clustering data in databases , 1990, Inf. Syst..

[41]  Jeng-Shyang Pan,et al.  A Tabu Seach Based Maximum Descent Algorithm for VQ Codebook Design , 2001, J. Inf. Sci. Eng..

[42]  M. Narasimha Murty,et al.  A near-optimal initial seed value selection in K-means means algorithm using a genetic algorithm , 1993, Pattern Recognit. Lett..

[43]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[44]  Olli Nevalainen,et al.  Tabu search algorithm for codebook generation in vector quantization , 1998, Pattern Recognit..

[45]  Douglas H. Fisher,et al.  Data mining tasks and methods: Clustering: conceptual clustering , 2002 .

[46]  Joshua Zhexue Huang,et al.  Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values , 1998, Data Mining and Knowledge Discovery.

[47]  Ujjwal Maulik,et al.  An evolutionary technique based on K-Means algorithm for optimal clustering in RN , 2002, Inf. Sci..

[48]  Fernando E. B. Otero,et al.  Genetic Programming for Attribute Construction in Data Mining , 2002, EuroGP.

[49]  Joshua D. Knowles,et al.  Exploiting the Trade-off - The Benefits of Multiple Objectives in Data Clustering , 2005, EMO.

[50]  Raymond J. Mooney,et al.  Integrating constraints and metric learning in semi-supervised clustering , 2004, ICML.

[51]  Agostinho C. Rosa,et al.  Independent and simultaneous evolution of fuzzy sleep classifiers by genetic algorithms , 1999 .

[52]  Gp Babu,et al.  Simulated annealing for selecting optimal initial seeds in the K-means algorithm , 1994 .

[53]  Joaquín A. Pacheco,et al.  Design of hybrids for the minimum sum-of-squares clustering problem , 2003, Comput. Stat. Data Anal..

[54]  Jiawei Han,et al.  CLARANS: A Method for Clustering Objects for Spatial Data Mining , 2002, IEEE Trans. Knowl. Data Eng..

[55]  Anthony K. H. Tung,et al.  Spatial clustering methods in data mining : A survey , 2001 .

[56]  Allen Gersho,et al.  Globally optimal vector quantizer design by stochastic relaxation , 1992, IEEE Trans. Signal Process..

[57]  David B. Fogel,et al.  Evolving fuzzy clusters , 1993, IEEE International Conference on Neural Networks.

[58]  Jeng-Shyang Pan,et al.  Constrained Ant Colony Optimization for Data Clustering , 2004, PRICAI.

[59]  Alex Alves Freitas,et al.  Data mining with an ant colony optimization algorithm , 2002, IEEE Trans. Evol. Comput..

[60]  M. Delgado,et al.  A tabu search approach to the fuzzy clustering problem , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[61]  James C. Bezdek,et al.  Clustering with a genetically optimized approach , 1999, IEEE Trans. Evol. Comput..

[62]  Joshua D. Knowles,et al.  Evolutionary Multiobjective Clustering , 2004, PPSN.

[63]  Emanuel Falkenauer,et al.  Genetic Algorithms and Grouping Problems , 1998 .

[64]  Krzysztof Krawiec,et al.  Genetic Programming-based Construction of Features for Machine Learning and Knowledge Discovery Tasks , 2002, Genetic Programming and Evolvable Machines.

[65]  Victor J. Rayward-Smith,et al.  The Use of a Supervised k-Means Algorithm on Real-Valued Data with Applications in Health , 2003, IEA/AIE.

[66]  Jean-Louis Deneubourg,et al.  The dynamics of collective sorting robot-like ants and ant-like robots , 1991 .

[67]  Marco Dorigo,et al.  On the Performance of Ant-based Clustering , 2003, HIS.

[68]  Vijay V. Raghavan,et al.  Genetic Algorithm for Clustering with an Ordered Representation , 1991, ICGA.

[69]  V. J. Rayward-Smith,et al.  Data mining rules using multi-objective evolutionary algorithms , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[70]  Baldo Faieta,et al.  Diversity and adaptation in populations of clustering ants , 1994 .

[71]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[72]  Manish Sarkar,et al.  Evolutionary Programming-Based Fuzzy Clustering , 1996, Evolutionary Programming.

[73]  Manish Sarkar,et al.  A clustering algorithm using an evolutionary programming-based approach , 1997, Pattern Recognit. Lett..

[74]  Fernando Moura-Pires,et al.  A Genetic Approach to Fuzzy Clustering with a Validity Measure Fitness Function , 1997, IDA.

[75]  Dana Ron,et al.  A New Conceptual Clustering Framework , 2004, Machine Learning.

[76]  Kien A. Hua,et al.  A decomposition-based simulated annealing technique for data clustering , 1994, PODS '94.