Hybrid data clustering analysis is an important issue in data mining. After analyzing the traditional clustering algorithms, the paper presents a new algorithm to cluster colorectal carcinoma auto-fluorescence spectrogram data based on lattice after analyzing the characteristics of biomedicine data. The method changes the objectpsilas attributes to lattice based on the conception of simple tuples and hyper tuples in lattice, uses the numbers of covers to measure the similarity between labels, and chooses the clustering mean-point according to the rule of high covers to high similarity. Experiments show that the new algorithm is more efficiently than the other classical ones. More importantly it is a method that works for ordinal, nominal or mixed data.
[1]
J. MacQueen.
Some methods for classification and analysis of multivariate observations
,
1967
.
[2]
Petra Perner,et al.
Data Mining - Concepts and Techniques
,
2002,
Künstliche Intell..
[3]
Joshua Zhexue Huang,et al.
Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values
,
1998,
Data Mining and Knowledge Discovery.
[4]
Michael Spann,et al.
A new approach to clustering
,
1990,
Pattern Recognit..
[5]
H. Ralambondrainy,et al.
A conceptual version of the K-means algorithm
,
1995,
Pattern Recognit. Lett..