As data mining having attracted a significant amount of research attention, many clustering methods have been proposed in past decades. However, most of those techniques have annoying obstacles in precise pattern recognition. This paper presents a new clustering algorithm termed G-TREACLE, which can fulfill numerous clustering requirements in data mining applications. As a hybrid approach that adopts grid-based concept, the proposed algorithm recognizes the solid framework of clusters and, then, identifies the arbitrary edge of clusters by utilization of a new density-based expansion process, which named "tree-alike pattern". Experimental results illustrate that the new algorithm precisely recognizes the whole cluster, and efficiently reduces the problem of high computational time. It also indicates that the proposed new clustering algorithm performs better than several existing well-known approaches such as the K-means, DBSCAN, CLIQUE and GDH algorithms, while produces much smaller errors than the K-means, DBSCAN, CLIQUE and GDH approaches in most the cases examined herein
[1]
Chia-Chen Yen,et al.
ANGEL: A New Effective and Efficient Hybrid Clustering Technique for Large Databases
,
2007,
PAKDD.
[2]
Dimitrios Gunopulos,et al.
Automatic subspace clustering of high dimensional data for data mining applications
,
1998,
SIGMOD '98.
[3]
Hans-Peter Kriegel,et al.
A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise
,
1996,
KDD.
[4]
Daniel A. Keim,et al.
An Efficient Approach to Clustering in Large Multimedia Databases with Noise
,
1998,
KDD.
[5]
Cheng-Fa Tsai,et al.
ACODF: a novel data clustering approach for data mining in large databases
,
2004
.
[6]
J. MacQueen.
Some methods for classification and analysis of multivariate observations
,
1967
.
[7]
Chun Zhang,et al.
Storing and querying ordered XML using a relational database system
,
2002,
SIGMOD '02.