Permutation Entropy: Enhancing Discriminating Power by Using Relative Frequencies Vector of Ordinal Patterns Instead of Their Shannon Entropy

Many measures to quantify the nonlinear dynamics of a time series are based on estimating the probability of certain features from their relative frequencies. Once a normalised histogram of events is computed, a single result is usually derived. This process can be broadly viewed as a nonlinear IRn mapping into IR, where n is the number of bins in the histogram. However, this mapping might entail a loss of information that could be critical for time series classification purposes. In this respect, the present study assessed such impact using permutation entropy (PE) and a diverse set of time series. We first devised a method of generating synthetic sequences of ordinal patterns using hidden Markov models. This way, it was possible to control the histogram distribution and quantify its influence on classification results. Next, real body temperature records are also used to illustrate the same phenomenon. The experiments results confirmed the improved classification accuracy achieved using raw histogram data instead of the PE final values. Thus, this study can provide a very valuable guidance for the improvement of the discriminating capability not only of PE, but of many similar histogram-based measures.

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  Malay K. Pakhira Finding Number of Clusters before Finding Clusters , 2012 .

[3]  Jim Z. C. Lai,et al.  A Fuzzy K-means Clustering Algorithm Using Cluster Center Displacement , 2009, J. Inf. Sci. Eng..

[4]  Subhagata Chattopadhyay,et al.  Comparing Fuzzy-C Means and K-Means Clustering Techniques: A Comprehensive Study , 2012 .

[5]  Chun-Chieh Wang,et al.  Applications of fault diagnosis in rotating machinery by using time series analysis with neural network , 2010, Expert Syst. Appl..

[6]  Hamed Azami,et al.  Amplitude-aware permutation entropy: Illustration in spike detection and signal segmentation , 2016, Comput. Methods Programs Biomed..

[7]  Marimuthu Palaniswami,et al.  Stability, Consistency and Performance of Distribution Entropy in Analysing Short Length Heart Rate Variability (HRV) Signal , 2017, Front. Physiol..

[8]  S. Karthik,et al.  A novel method for selecting initial centroids in K-means clustering algorithm , 2016, Int. J. Intell. Syst. Technol. Appl..

[9]  Patricio A. Vela,et al.  A Comparative Study of Efficient Initialization Methods for the K-Means Clustering Algorithm , 2012, Expert Syst. Appl..

[10]  D. Cuesta-Frau Permutation entropy: Influence of amplitude information on time series classification performance. , 2019, Mathematical biosciences and engineering : MBE.

[11]  Joaquín Pérez Ortega,et al.  Improving the Efficiency and Efficacy of the K-means Clustering Algorithm Through a New Convergence Condition , 2007, ICCSA.

[12]  J. Gower,et al.  Metric and Euclidean properties of dissimilarity coefficients , 1986 .

[13]  Wei Sun,et al.  Regularized k-means clustering of high-dimensional data and its asymptotic consistency , 2012 .

[14]  Minvydas Ragulskis,et al.  Permutation Entropy Based on Non-Uniform Embedding , 2018, Entropy.

[15]  Nuno Constantino Castro,et al.  Time Series Data Mining , 2009, Encyclopedia of Database Systems.

[16]  Yubo Yuan,et al.  A Max-Min clustering method for $k$-means algorithm ofdata clustering , 2012 .

[17]  Badong Chen,et al.  Weighted-permutation entropy: a complexity measure for time series incorporating amplitude information. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Jason Lines,et al.  Classification of Household Devices by Electricity Usage Profiles , 2011, IDEAL.

[19]  David Cuesta-Frau,et al.  Unsupervised classification of ventricular extrasystoles using bounded clustering algorithms and morphology matching , 2007, Medical & Biological Engineering & Computing.

[20]  Luciano Zunino,et al.  Forbidden patterns, permutation entropy and stock market inefficiency , 2009 .

[21]  Sami Sieranoja,et al.  How much can k-means be improved by using better initialization and repeats? , 2019, Pattern Recognit..

[22]  David Cuesta-Frau,et al.  Classification of glucose records from patients at diabetes risk using a combined permutation entropy algorithm , 2018, Comput. Methods Programs Biomed..

[23]  Jun Wang,et al.  Multiscale permutation entropy analysis of electrocardiogram , 2017 .

[24]  Massimiliano Zanin,et al.  Forbidden patterns in financial time series. , 2007, Chaos.

[25]  Rimma Lapovok,et al.  Mechanical Strength and Biocompatibility of Ultrafine-Grained Commercial Purity Titanium , 2013, BioMed research international.

[26]  Ugur Halici,et al.  A novel deep learning approach for classification of EEG motor imagery signals , 2017, Journal of neural engineering.

[27]  Jos Amig Permutation Complexity in Dynamical Systems: Ordinal Patterns, Permutation Entropy and All That , 2010 .

[28]  Karsten Keller,et al.  Efficiently Measuring Complexity on the Basis of Real-World Data , 2013, Entropy.

[29]  David Cuesta-Frau,et al.  Model Selection for Body Temperature Signal Classification Using Both Amplitude and Ordinality-Based Entropy Measures , 2018, Entropy.

[30]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[31]  Keerthana Prasad,et al.  Classification of Infectious and Noninfectious Diseases Using Artificial Neural Networks from 24-Hour Continuous Tympanic Temperature Data of Patients with Undifferentiated Fever. , 2018, Critical reviews in biomedical engineering.

[32]  Rohan J. Dalpatadu,et al.  The Probability Distribution of the Sum of Several Dice: Slot Applications , 2011 .

[33]  Edilson Delgado-Trejos,et al.  Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its Applications , 2019, Entropy.

[34]  Roberto Sassi,et al.  Bubble Entropy: An Entropy Almost Free of Parameters , 2017, IEEE Transactions on Biomedical Engineering.

[35]  Victor Chukwudi Osamor,et al.  Reducing the Time Requirement of k-Means Algorithm , 2012, PloS one.

[36]  G. Castellanos-Dominguez,et al.  An improved method for unsupervised analysis of ECG beats based on WT features and J-means clustering , 2007, 2007 Computers in Cardiology.

[37]  David Cuesta-Frau,et al.  Noisy EEG signals classification based on entropy metrics. Performance assessment using first and second generation statistics , 2017, Comput. Biol. Medicine.

[38]  Stefan Grünewald,et al.  Structured sparse K-means clustering via Laplacian smoothing , 2018, Pattern Recognit. Lett..

[39]  Luciano Zunino,et al.  Permutation entropy based time series analysis: Equalities in the input signal can lead to false conclusions , 2017 .

[40]  Germán Castellanos-Domínguez,et al.  Unsupervised feature relevance analysis applied to improve ECG heartbeat clustering , 2012, Comput. Methods Programs Biomed..

[41]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[42]  I. Chouvarda,et al.  Temperature multiscale entropy analysis: a promising marker for early prediction of mortality in septic patients , 2013, Physiological measurement.

[43]  Samantha Simons,et al.  Fuzzy Entropy Analysis of the Electroencephalogram in Patients with Alzheimer’s Disease: Is the Method Superior to Sample Entropy? , 2018, Entropy.

[44]  Eamonn J. Keogh,et al.  The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances , 2016, Data Mining and Knowledge Discovery.

[45]  Karsten Keller,et al.  Ordinal Patterns, Entropy, and EEG , 2014, Entropy.

[46]  Jing Yu,et al.  Improved Permutation Entropy for Measuring Complexity of Time Series under Noisy Condition , 2019, Complex..

[47]  Aristidis Likas,et al.  The MinMax k-Means clustering algorithm , 2014, Pattern Recognit..

[48]  Sergio Cruces,et al.  Information Theory Applications in Signal Processing , 2019, Entropy.

[49]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[50]  Junjie Wu,et al.  Advances in K-means clustering: a data mining thinking , 2012 .

[51]  José Luis Rodríguez-Sotelo,et al.  Automatic Sleep Stages Classification Using EEG Entropy Features and Unsupervised Pattern Analysis Techniques , 2014, Entropy.

[52]  B. Pompe,et al.  Permutation entropy: a natural complexity measure for time series. , 2002, Physical review letters.

[53]  Jeyhun Karimov,et al.  Clustering Quality Improvement of k-means Using a Hybrid Evolutionary Model , 2015, Complex Adaptive Systems.

[54]  Shyr-Shen Yu,et al.  Two improved k-means algorithms , 2017, Appl. Soft Comput..

[55]  David Cuesta-Frau,et al.  Patterns with Equal Values in Permutation Entropy: Do They Really Matter for Biosignal Classification? , 2018, Complex..

[56]  Zhenhu Liang,et al.  Multiscale permutation entropy analysis of EEG recordings during sevoflurane anesthesia , 2010, Journal of neural engineering.

[57]  Niels Wessel,et al.  Classifying cardiac biosignals using ordinal pattern statistics and symbolic dynamics , 2012, Comput. Biol. Medicine.

[58]  Sariel Har-Peled,et al.  How Fast Is the k-Means Method? , 2005, SODA '05.

[59]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[60]  Marimuthu Palaniswami,et al.  Classification of 5-S Epileptic EEG Recordings Using Distribution Entropy and Sample Entropy , 2016, Front. Physiol..

[61]  C. Kulp,et al.  Using forbidden ordinal patterns to detect determinism in irregularly sampled time series. , 2016, Chaos.

[62]  Simon Fong,et al.  Classifying Human Voices by Using Hybrid SFX Time-Series Preprocessing and Ensemble Feature Selection , 2013, BioMed research international.

[63]  Ludmila I. Kuncheva,et al.  Evaluation of Stability of k-Means Cluster Ensembles with Respect to Random Initialization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  Yingjie Tian,et al.  A Comprehensive Survey of Clustering Algorithms , 2015, Annals of Data Science.

[65]  A. Vannucci,et al.  BICS Bath Institute for Complex Systems A note on time-dependent DiPerna-Majda measures , 2008 .

[66]  Gabriela Andreu García,et al.  Clustering of electrocardiograph signals in computer-aided Holter analysis , 2003, Comput. Methods Programs Biomed..

[67]  Sergei Vassilvitskii,et al.  Scalable K-Means++ , 2012, Proc. VLDB Endow..

[68]  Cesar H. Comin,et al.  Clustering algorithms: A comparative approach , 2016, PloS one.