Development of a hierarchical double application of crisp cluster validity indices: a proof-of-concept study for automated FTIR spectral histology.

Fourier-transform infrared (FTIR) spectral imaging is currently used as a non-destructive and label-free method for analyzing biological specimens. However, to highlight the different tissue regions, unsupervised clustering methods are commonly used leading to a subjective choice of the number of clusters. Here, we develop a hierarchical double application of 9 selected crisp cluster validity indices (CCVIs) using K-Means clustering. This approach when tested first on an artificial dataset showed that the indices Pakhira-Bandyopadhyay-Maulik (PBM) and Sym-Index (SI) perfectly estimated the expected 9 sub-clusters. Then, the concept was applied to a real dataset consisting of FTIR spectral images of normal human colon tissue samples originating from 5 patients. PBM and SI were revealed to be the most efficient indices that correctly identified the different colon histological components including crypts, lamina propria, muscularis mucosae, submucosa, and lymphoid aggregates. In conclusion, these results strongly suggest that the hierarchical double CCVI application is a promising method for automated and informative spectral histology.

[1]  Weina Wang,et al.  On fuzzy cluster validity indices , 2007, Fuzzy Sets Syst..

[2]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[3]  Ujjwal Maulik,et al.  Validity index for crisp and fuzzy clusters , 2004, Pattern Recognit..

[4]  Mohammadreza Khanmohammadi,et al.  Diagnosis of colon cancer by attenuated total reflectance-fourier transform infrared microspectroscopy and soft independent modeling of class analogy , 2009, Medical oncology.

[5]  Francis L Martin,et al.  Extracting biological information with computational analysis of Fourier-transform infrared (FTIR) biospectroscopy datasets: current practices to future perspectives. , 2012, The Analyst.

[6]  H. Martens,et al.  Extended Multiplicative Signal Correction as a Tool for Separation and Characterization of Physical and Chemical Information in Fourier Transform Infrared Microscopy Images of Cryo-Sections of Beef Loin , 2005, Applied spectroscopy.

[7]  K. alik,et al.  Validity index for clusters of different sizes and densities , 2011 .

[8]  Michel Manfait,et al.  Automation of an algorithm based on fuzzy clustering for analyzing tumoral heterogeneity in human skin carcinoma tissue sections , 2011, Laboratory Investigation.

[9]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[10]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[12]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Jonathan M. Garibaldi,et al.  A novel fuzzy clustering algorithm for the analysis of axillary lymph node tissue sections , 2007, Applied Intelligence.

[14]  Olatz Arbelaitz,et al.  An extensive comparative study of cluster validity indices , 2013, Pattern Recognit..

[15]  Michel Manfait,et al.  IR spectral imaging of secreted mucus: a promising new tool for the histopathological recognition of human colonic adenocarcinomas , 2010, Histopathology.

[16]  Michel Manfait,et al.  IR spectral imaging for histopathological characterization of xenografted human colon carcinomas. , 2008, Analytical chemistry.

[17]  Sanghamitra Bandyopadhyay,et al.  A Point Symmetry-Based Clustering Technique for Automatic Evolution of Clusters , 2008, IEEE Transactions on Knowledge and Data Engineering.

[18]  Iñaki Albisua,et al.  SEP/COP: An efficient method to find the best partition in hierarchical clustering based on a new cluster validity index , 2010, Pattern Recognit..

[19]  Nicholas Stone,et al.  Current trends in machine-learning methods applied to spectroscopic cancer diagnosis , 2014 .

[20]  H. D. Liggitt,et al.  Comparative anatomy and histology : a mouse and human atlas , 2011 .

[21]  Francis L Martin,et al.  Distinguishing cell types or populations based on the computational analysis of their infrared spectra , 2010, Nature Protocols.

[22]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .