Applications of the improved leader-follower cluster analysis (iLFCA) algorithm on large array (LA) and very large array (VLA) hyperspectral mid-infrared imaging datasets

With the potential and advantages of infrared (IR) spectroscopic applications in biological studies, and the introduction of multi-channel focal plane array (FPA) mid-IR detectors, efficient unsupervised clustering algorithms are required to identify and group similar useful spectra from background or outlier spectra within large hyperspectral datasets. Such classification algorithms are crucial for enabling further multivariate analysis. In this paper, a clustering method coined as the improved leader-follower cluster analysis (iLFCA) algorithm is expounded and demonstrated on two mid-IR imaging datasets of exfoliated oral mucosa cells: a Large Array (LA) 64 × 64 pixels image and a Very Large Array (VLA) simulated 128 × 128 pixels image created as a montage of the original LA data. By concatenating the normalized vector form of each spectrum and its integrated areas of characteristic spectral bands, such as Amide I and II, the specificity and efficacy of the clustering algorithm is enhanced. Human intervention for selecting appropriate user-specified parameters and thresholds is also minimized through the development of an automated bisection search algorithm. This resulted in better computational efficiency for iLFCA compared to its predecessor LFCA algorithm. A comparison of iLFCA and LFCA with a common unsupervised classification method based on Principal Component Analysis (PCA) shows iLFCA achieving better clustering results at shorter computational time. In particular, iLFCA has the capability to process larger datasets, namely VLA datasets, which caused both LFCA and PCA-based methods to fail because of computer memory space limitations. iLFCA can potentially be applied to analyze vibrational microspectroscopic data for diagnosis/screening of biological tissue and cells samples, cell culture growth monitoring, and examination of active pharmaceutical ingredients (APIs) distribution and real-time release of pharmaceutical tablets.

[1]  Conrad Bessant,et al.  Support vector machine ensembles for breast cancer type prediction from mid-FTIR micro-calcification spectra , 2011 .

[2]  Kejia Chen,et al.  Utilization of spectral vector properties in multivariate chemometrics analysis of hyperspectral infrared imaging data for cellular studies. , 2008, The Analyst.

[3]  Wee Chew,et al.  Hierarchical band-target entropy minimization curve resolution and Pearson VII curve-fitting analysis of cellular protein infrared imaging spectra. , 2009, Analytical biochemistry.

[4]  J. Lindon,et al.  The identification of novel biomarkers of renal toxicity using automatic data reduction techniques and PCA of proton NMR spectra of urine , 1998 .

[5]  S. Rehman,et al.  Fourier Transform Infrared (FTIR) Spectroscopy of Biological Tissues , 2008 .

[6]  C. Gendrin,et al.  Pharmaceutical applications of vibrational chemical imaging and chemometrics: a review. , 2008, Journal of pharmaceutical and biomedical analysis.

[7]  Y. Roggo,et al.  A review of near infrared spectroscopy and chemometrics in pharmaceutical technologies. , 2007, Journal of pharmaceutical and biomedical analysis.

[8]  I. W. Levin,et al.  Fourier transform infrared vibrational spectroscopic imaging: integrating microscopy and molecular recognition. , 2005, Annual review of physical chemistry.

[9]  M. Mulvey,et al.  Epidemiological typing of methicillin-resistant Staphylococcus aureus strains by Fourier transform infrared spectroscopy. , 2007, Journal of microbiological methods.

[10]  Christoph Krafft,et al.  Identification of primary tumors of brain metastases by SIMCA classification of IR spectroscopic images. , 2006, Biochimica et biophysica acta.

[11]  Michael Neumaier,et al.  Infrared spectroscopy: A new diagnostic tool in Alzheimer disease , 2007, Neuroscience Letters.

[12]  R. E. Fields,et al.  Solid-State Array Detectors for Analytical Spectrometry , 1997 .

[13]  Lynn Stothers,et al.  Classification of male lower urinary tract symptoms using mathematical modelling and a regression tree algorithm of noninvasive near-infrared spectroscopy parameters. , 2010, European urology.

[14]  Hirohiko Kuratsune,et al.  Spectroscopic diagnosis of chronic fatigue syndrome by visible and near-infrared spectroscopy in serum samples. , 2006, Biochemical and biophysical research communications.

[15]  P. Lasch,et al.  Diagnosing benign and malignant lesions in breast tissue sections by using IR-microspectroscopy. , 2006, Biochimica et biophysica acta.

[16]  L. Mariey,et al.  Discrimination, classification, identification of microorganisms using FTIR spectroscopy and chemometrics , 2001 .

[17]  Peter Bugert,et al.  Diagnosis of breast cancer with infrared spectroscopy from serum samples , 2010 .

[18]  Christoph Krafft,et al.  Disease recognition by infrared and Raman spectroscopy , 2009, Journal of biophotonics.