Statistical, Evolutionary, and Neurocomputing Clustering Techniques: Cluster-Based vs Object-Based Approaches

Modern day computers cannot provide optimal solution to the clustering problem. There are many clustering algorithms that attempt to provide an approximation of the optimal solution. These clustering techniques can be broadly classified into two categories. The techniques from first category directly assign objects to clusters and then analyze the resulting clusters. The methods from second category adjust representations of clusters and then determine the object assignments. In terms of disciplines, these techniques can be classified as statistical, genetic algorithms based, and neural network based. This paper reports the results of experiments comparing five different approaches: hierarchical grouping, object-based genetic algorithms, cluster-based genetic algorithms, Kohonen neural networks, and K-means method. The comparisons consist of the time requirements and within-group errors. The theoretical analyses were tested for clustering of highway sections and supermarket customers. All the techniques were applied to clustering of highway sections. The hierarchical grouping and genetic algorithms approaches were computationally infeasible for clustering a larger set of supermarket customers. Hence only Kohonen neural networks and K-means techniques were applied to the second set to confirm some of the results from previous experiments.

[1]  R. Linsker From basic network principles to neural architecture (series) , 1986 .

[2]  L A Hoel,et al.  TRAFFIC AND HIGHWAY ENGINEERING. SECOND EDITION , 1997 .

[3]  Satish Sharma,et al.  IMPROVED METHOD OF GROUPING PROVINCEWIDE PERMANENT TRAFFIC COUNTERS , 1981 .

[4]  Rui Yan,et al.  Comparison of Conventional and Rough K-Means Clustering , 2003, RSFDGrC.

[5]  Frederick E. Petry,et al.  Genetic Algorithms , 1992 .

[6]  Pawan Lingras,et al.  Interval set clustering of web users using modified Kohonen self-organizing maps based on the properties of rough sets , 2004, Web Intell. Agent Syst..

[7]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[8]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[9]  Nicholas J. Garber,et al.  Traffic and Highway Engineering , 1988 .

[10]  Udi Manber,et al.  Introduction to algorithms - a creative approach , 1989 .

[11]  Satish Sharma,et al.  STATEWIDE TRAFFIC VOLUME STUDIES AND PRECISION OF AADT ESTIMATES. , 1996 .

[12]  D Albright,et al.  AN IMPERATIVE FOR, AND CURRENT PROGRESS TOWARD, NATIONAL TRAFFIC MONITORING STANDARDS , 1991 .

[13]  Paul C. Box Manual of Traffic Engineering Studies , 1976 .

[14]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[15]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[16]  Rui Yan,et al.  Rough Set Based Clustering: Evolutionary, Neural, and Statistical Approaches , 2003, IICAI.

[17]  Rui Yan,et al.  Clustering of Web Users: K-Means vs. Fuzzy C-Means , 2003, Indian International Conference on Artificial Intelligence.

[18]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[19]  Pawan Lingras,et al.  Temporal Web usage mining , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[20]  Pawan Lingras,et al.  Clustering Supermarket Customers Using Rough Set Based Kohonen Networks , 2003, ISMIS.

[21]  Pawan Lingras,et al.  Unsupervised Rough Set Classification Using GAs , 2001, Journal of Intelligent Information Systems.

[22]  Allen Van Gelder,et al.  Computer Algorithms: Introduction to Design and Analysis , 1978 .

[23]  Rui Yan,et al.  Fuzzy C-Means Clustering of Web Users for Educational Sites , 2003, Canadian Conference on AI.

[24]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[25]  Satish Sharma,et al.  Duration and Frequency of Seasonal Traffic Counts , 1993 .

[26]  Pawan Lingras,et al.  Supermarket Customer Attrition Analysis Based on Cluster Membership Patterns , 2003, IICAI.

[27]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .