Clustering Based Analysis of Spirometric Data Using Principal Component Analysis and Self Organizing Map

Spirometry is a valuable tool used for respiratory diagnoses and assessment of disease progression. It measures air flow to help make a definitive diagnosis of pulmonary disorder and confirms presence of airway obstruction. In this work, clustering based classification of spirometric pulmonary function data has been attempted using Principal Component Analysis (PCA) and Self Organising Map (SOM). Pulmonary function data (N=100) are obtained from normal and obstructive subjects using gold standard Spirolab II spirometer. These data are subjected to PCA to extract significant parameters relevant to the cluster structure. The clustering analysis of the significant spirometric parameters is further enhanced using self organizing map and classification of spirometric data is achieved. It is observed from results that FEV1, PEF and FEF25 '—' 75% are found to be significant in differentiating normal and obstructive subjects. SOM based classification is able to achieve accuracy of 95%. This cluster based method of feature reduction and classification could be useful in assessing the pulmonary function disorders for spirometric pulmonary function test with large dataset.

[1]  Asloob Ahmad Mudassar,et al.  Application of Principal Component Analysis in Automatic Localization of Optic Disc and Fovea in Retinal Images , 2013, Journal of medical engineering.

[2]  Swaminathan Ramakrishnan,et al.  Evaluation of Flow–Volume Spirometric Test Using Neural Network Based Prediction and Principal Component Analysis , 2011, Journal of Medical Systems.

[3]  Antonio Soriano Payá,et al.  Embedded system for diagnosing dysfunctions in the lower urinary tract , 2007, SAC '07.

[4]  S. C. Manoharan,et al.  Prediction of forced expiratory volume in normal and restrictive respiratory functions using spirometry and self-organizing map , 2009, Journal of medical engineering & technology.

[5]  Hai Fang,et al.  Using Self-Organizing Maps to Visualize, Filter and Cluster Multidimensional Bio-Omics Data , 2012 .

[6]  A.S. Banthia,et al.  Data size reduction for clustering-based binning of ICs using principal component analysis (PCA) , 2005, Proceedings. 2005 IEEE International Workshop on Current and Defect Based Testing, 2005. DBT 2005..

[7]  A. S. Zadgaonkar,et al.  Finger Knuckle Identification using Principal Component Analysis and Nearest Mean Classifier , 2013 .

[8]  José Ferrer,et al.  Using SOM and PCA for analysing and interpreting data from a P-removal SBR , 2008, Eng. Appl. Artif. Intell..

[9]  Christopher M. Bishop,et al.  Mixtures of Probabilistic Principal Component Analyzers , 1999, Neural Computation.

[10]  S. Ramakrishnan,et al.  Evaluation of forced expiratory volume prediction in spirometric test using Principal Component Analysis , 2011 .

[11]  Manojit Chattopadhyay,et al.  PRINCIPAL COMPONENT ANALYSIS AND SELF-ORGANIZING MAP FOR VISUAL CLUSTERING OF MACHINE-PART CELL FORMATION IN CELLULAR MANUFACTURING SYSTEM , 2011, 1201.5524.

[12]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[13]  E. L. Koua,et al.  Using self - organizing maps for information visualization and knowledge discovery in complex gespatial datasets , 2003 .

[14]  Marc Teboulle,et al.  A Unified Continuous Optimization Framework for Center-Based Clustering Methods , 2007, J. Mach. Learn. Res..

[15]  Tania S. Douglas,et al.  Fuzzy clustering to detect tuberculous meningitis-associated hyperdensity in CT images , 2008, Comput. Biol. Medicine.

[16]  M. Partridge,et al.  Why is spirometry underused in the diagnosis of the breathless patient: a qualitative study , 2011, BMC pulmonary medicine.

[17]  T. Petty,et al.  Benefits of and barriers to the widespread use of spirometry , 2005, Current opinion in pulmonary medicine.

[18]  Melody Y. Kiang,et al.  A comparative analysis of an extended SOM network and K-means analysis , 2004, Int. J. Knowl. Based Intell. Eng. Syst..

[19]  I. Guyon,et al.  Detecting stable clusters using principal component analysis. , 2003, Methods in molecular biology.

[20]  Gaibulloev Khusrav,et al.  Common Drivers of Transnational Terrorism: Principal Component Analysis , 2013 .

[21]  D. Gordon Spirometry: Thinking Beyond the COPD “Gold Standard” , 2012 .

[22]  Swaminathan Ramakrishnan,et al.  Comparison of Two ANN Methods for Classification of Spirometer Data , 2008 .

[23]  Swaminathan Ramakrishnan,et al.  Prediction of Forced Expiratory Volume in Pulmonary Function Test using Radial Basis Neural Networks and k-means Clustering , 2009, Journal of Medical Systems.

[24]  Aa Enschede Using Self-Organizing Maps for Information Visualization and Knowledge Discovery in Complex Geospatial Datasets , 2004 .