Fuzzy Clustering in Biochemical Analysis of Cancer Cells

Currently there is considerable effort investigating whether infrared spectroscopy can be used as a diagnostic probe to identify early stages of cancer since these techniques are sensitive to biological changes within cells. Cluster analysis is often used to try and unravel complex vibrational images. In this paper, a Fuzzy C-Means (FCM) based model selection algorithm was used to automatically cluster sets of infrared spectral data taken from lymph node tissue sections. Initial results were often prone to the creation of excessive clusters in comparison with clinical diagnosis which is partly due to the complexities of biological tissue. A new method to merge clusters was developed with the ability to successfully find and combine the two most similar clusters in order to address this problem. This new method was applied to four sets of infrared spectral data, from two types of human cancers (lymph node and oral). The experimental results clearly show that this new method can successfully combine the most similar clusters together and potentially improve the accuracy of diagnosis.

[1]  Shengrui Wang,et al.  FCM-Based Model Selection Algorithms for Determining the Number of Clusters , 2004, Pattern Recognit..

[2]  Taylor Murray,et al.  Cancer statistics, 1999 , 1999, CA: a cancer journal for clinicians.

[3]  Max Diem,et al.  Imaging of colorectal adenocarcinoma using FT-IR microspectroscopy and cluster analysis. , 2004, Biochimica et biophysica acta.

[4]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[5]  Jonathan M. Garibaldi,et al.  The Application of a Simulated Annealing Fuzzy Clustering Algorithm for Cancer Diagnosis , 2004 .

[6]  Stephen G Bown,et al.  Elastic scattering spectroscopy for intraoperative determination of sentinel lymph node status in the breast. , 2004, Journal of biomedical optics.

[7]  Jonathan M. Garibaldi,et al.  Simulated Annealing Fuzzy Clustering in Cancer Diagnosis , 2005, Informatica.

[8]  Vijay V. Raghavan,et al.  3M algorithm: finding an optimal fuzzy cluster scheme for proximity data , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[9]  Michael B. Richman,et al.  On the Application of Cluster Analysis to Growing Season Precipitation Data in North America East of the Rockies , 1995 .

[10]  A E Giuliano,et al.  Histopathologic validation of the sentinel lymph node hypothesis for breast carcinoma. , 1997, Annals of surgery.

[11]  Jonathan M. Garibaldi,et al.  Application of the Fuzzy C-Means Clustering Method on the Analysis of non Pre- processed FTIR Data for Cancer Diagnosis , 2003 .

[12]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..