Automatic pattern recognition of ECG signals using entropy-based adaptive dimensionality reduction and clustering

Display Omitted Developed a new entropy-based principal component analysis approach (EPCA) for the dimensionality reduction. It can automatically select the best number of principal components of the PCA.Designed a fuzzy-entropy based clustering measure (ECM) which combines the symmetric fuzzy cross entropy with the average fuzzy entropy of membership matrix.Proposed a novel fuzzy entropy c-means clustering algorithm(FECM) which can adaptively identify the optimal number of clusters based on the ECM.Experiments on ECG signals verifies that the EPCA performs better than PCA schemes based on cumulative percentage and screen graph. The clustering accuracy of FECM is superior to NJW, HCA and K-means. In order to automatically recognize the patterns of ECG signals for different subjects, a new entropy-based principal component analysis (EPCA) is developed in this paper for dimensionality reduction of ECG signals. With the EPCA, Pb, the best number of principal components for a specific subject, is automatically determined by the energy to entropy ratio of reconstructed ECG signals after dimensionality reduction. Then, a novel fuzzy-entropy based c-means clustering (FECM) is proposed for cluster partition of ECG feature data. The optimal number of clusters, i.e. kb for a data set with specific ECG feature is found through a fuzzy-entropy based clustering measure (ECM), in which the average symmetric fuzzy cross entropy of membership subset pairs is combined with the average fuzzy entropy of clusters. Afterwards, the ECG signals in the MIT-BIH Arrhythmia Database are used for performance evaluation of EPCA and FECM. Five indices of signal reconstruction are employed to evaluate the results of dimensionality reduction, which shows that the EPCA performs better than the schemes based on cumulative percentage and scree graph when searching for Pb. Moreover, by comparing ECM with the other eight fuzzy clustering indices, i.e. PC, PE, MPC, XB, FS, Kwon, FHV and PBMF, the ECM has demonstrates the superiority in searching for kb. By using ECM, the adaptability of FECM to various ECG data sets has been strengthened. Its clustering accuracy is superior to those of three commonly-used algorithms, i.e. spectral clustering (NJW), hierarchical agglomerative (HCA) and K-means clustering.

[1]  Miin-Shen Yang,et al.  A cluster validity index for fuzzy clustering , 2005, Pattern Recognit. Lett..

[2]  Xiu-Gang Shang,et al.  A note on fuzzy information measures , 1997, Pattern Recognit. Lett..

[3]  Hong Yue,et al.  A modified PCA based on the minimum error entropy , 2004, Proceedings of the 2004 American Control Conference.

[4]  Ujjwal Maulik,et al.  A study of some fuzzy cluster validity indices, genetic clustering and application to pixel classification , 2005, Fuzzy Sets Syst..

[5]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Hong He,et al.  Optimal Base Wavelet Selection for ECG Noise Reduction Using a Comprehensive Entropy Criterion , 2015, Entropy.

[8]  H. Kaiser The Application of Electronic Computers to Factor Analysis , 1960 .

[9]  Donald A. Jackson,et al.  How many principal components? stopping rules for determining the number of non-trivial axes revisited , 2005, Comput. Stat. Data Anal..

[10]  Ujjwal Maulik,et al.  Validity index for crisp and fuzzy clusters , 2004, Pattern Recognit..

[11]  Peter Clemmensen,et al.  Telecardiología: pasado, presente y futuro , 2013 .

[12]  Witold Pedrycz,et al.  Use of a fuzzy granulation-degranulation criterion for assessing cluster validity , 2011, Fuzzy Sets Syst..

[13]  Rui Xu,et al.  Clustering Algorithms in Biomedical Research: A Review , 2010, IEEE Reviews in Biomedical Engineering.

[14]  Witold Pedrycz,et al.  ECG Signal Processing, Classification and Interpretation: A Comprehensive Framework of Computational Intelligence , 2011 .

[15]  Maria Sejersten,et al.  Update: Innovation in cardiology (II). Telecardiology: past, present and future. , 2013, Revista espanola de cardiologia.

[16]  Taizo Hanai,et al.  Modified fuzzy gap statistic for estimating preferable number of clusters in fuzzy k-means clustering. , 2008, Journal of bioscience and bioengineering.

[17]  Settimo Termini,et al.  A Definition of a Nonprobabilistic Entropy in the Setting of Fuzzy Sets Theory , 1972, Inf. Control..

[18]  Soon-H. Kwon Cluster validity index for fuzzy clustering , 1998 .

[19]  Douglas N. Rutledge,et al.  GENETIC ALGORITHM APPLIED TO THE SELECTION OF PRINCIPAL COMPONENTS , 1998 .

[20]  Matteo Pardo,et al.  A stability based validity method for fuzzy clustering , 2010, Pattern Recognit..

[21]  R. Gray Entropy and Information Theory , 1990, Springer New York.

[22]  W. Hoffmann,et al.  Telemedicine and telecare for older patients--a systematic review. , 2012, Maturitas.

[23]  Asghar Ali,et al.  Principal Component Analysis Applied to Some Data from Fruit Nutrition Experiments , 1985 .

[24]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[25]  Hanqiang Liu,et al.  A multiobjective spatial fuzzy clustering algorithm for image segmentation , 2015, Appl. Soft Comput..

[26]  Y. Fukuyama,et al.  A new method of choosing the number of clusters for the fuzzy c-mean method , 1989 .

[27]  J. Bezdek Numerical taxonomy with fuzzy sets , 1974 .

[28]  Michalis Vazirgiannis,et al.  Clustering validity checking methods: part II , 2002, SGMD.

[29]  G.B. Moody,et al.  The impact of the MIT-BIH Arrhythmia Database , 2001, IEEE Engineering in Medicine and Biology Magazine.

[30]  Hong He,et al.  A two-stage genetic algorithm for automatic clustering , 2012, Neurocomputing.

[31]  Ravi Kothari,et al.  On finding the number of clusters , 1999, Pattern Recognit. Lett..

[32]  K. alik Cluster validity index for estimation of fuzzy clusters of different sizes and densities , 2010 .

[33]  R. Cattell The Scree Test For The Number Of Factors. , 1966, Multivariate behavioral research.

[34]  Babak Rezaee,et al.  A cluster validity index for fuzzy clustering , 2010, Fuzzy Sets Syst..

[35]  V. J. Rayward-Smith,et al.  Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition , 1999 .

[36]  Giancarlo Diana,et al.  Cross-validation methods in principal component analysis: A comparison , 2002 .

[37]  S. Luo,et al.  A review of electrocardiogram filtering. , 2010, Journal of electrocardiology.

[38]  Weina Wang,et al.  On fuzzy cluster validity indices , 2007, Fuzzy Sets Syst..

[39]  I. Jolliffe Principal Component Analysis , 2002 .

[40]  Madhuchhanda Mitra,et al.  ECG Acquisition and Automated Remote Processing , 2013 .

[41]  Doheon Lee,et al.  On cluster validity index for estimation of the optimal number of fuzzy clusters , 2004, Pattern Recognit..

[42]  Doheon Lee,et al.  Fuzzy cluster validation index based on inter-cluster proximity , 2003, Pattern Recognit. Lett..

[43]  Enrico Zio,et al.  Signal reconstruction by a GA-optimized ensemble of PCA models , 2011 .

[44]  Rajesh N. Davé,et al.  Validating fuzzy partitions obtained through c-shells clustering , 1996, Pattern Recognit. Lett..

[45]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[46]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .