Effect of data length and bin numbers on distribution entropy (DistEn) measurement in analyzing healthy aging

Complexity analysis of a given time series is executed using various measures of irregularity, the most commonly used being Approximate entropy (ApEn), Sample entropy (SampEn) and Fuzzy entropy (FuzzyEn). However, the dependence of these measures on the critical parameter of tolerance `r' leads to precarious results, owing to random selections of r. Attempts to eliminate the use of r in entropy calculations introduced a new measure of entropy namely distribution entropy (DistEn) based on the empirical probability distribution function (ePDF). DistEn completely avoids the use of a variance dependent parameter like r and replaces it by a parameter M, which corresponds to the number of bins used in the histogram to calculate it. When tested for synthetic data, M has been observed to produce a minimal effect on DistEn as compared to the effect of r on other entropy measures. Also, DistEn is said to be relatively stable with data length (N) variations, as far as synthetic data is concerned. However, these claims have not been analyzed for physiological data. Our study evaluates the effect of data length N and bin number M on the performance of DistEn using both synthetic and physiologic time series data. Synthetic logistic data of `Periodic' and `Chaotic' levels of complexity and 40 RR interval time series belonging to two groups of healthy aging population (young and elderly) have been used for the analysis. The stability and consistency of DistEn as a complexity measure as well as a classifier have been studied. Experiments prove that the parameters N and M are more influential in deciding the efficacy of DistEn performance in the case of physiologic data than synthetic data. Therefore, a generalized random selection of M for a given data length N may not always be an appropriate combination to yield good performance of DistEn for physiologic data.

[1]  S M Pincus,et al.  Approximate entropy as a measure of system complexity. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[2]  P. Castiglioni,et al.  How the threshold “r” influences approximate entropy analysis of heart-rate variability , 2008, 2008 Computers in Cardiology.

[3]  Weiting Chen,et al.  Measuring complexity using FuzzyEn, ApEn, and SampEn. , 2009, Medical engineering & physics.

[4]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[5]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[6]  Dingchang Zheng,et al.  Assessing the complexity of short-term heartbeat interval series by distribution entropy , 2014, Medical & Biological Engineering & Computing.

[7]  M. Palaniswami,et al.  Understanding ageing effects using complexity analysis of foot–ground clearance during walking , 2013, Computer methods in biomechanics and biomedical engineering.

[8]  Nathaniel H. Hunt,et al.  The Appropriate Use of Approximate Entropy and Sample Entropy with Short Data Sets , 2012, Annals of Biomedical Engineering.

[9]  C. Peng,et al.  Age-related alterations in the fractal scaling of cardiac interbeat interval dynamics. , 1996, The American journal of physiology.

[10]  Andreas Holzinger,et al.  Selection of entropy-measure parameters for knowledge discovery in heart rate variability data , 2014, BMC Bioinformatics.

[11]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[12]  Chengyu Liu,et al.  Comparison of different threshold values r for approximate entropy: application to investigate the heart rate variability between heart failure and healthy control groups , 2011, Physiological measurement.

[13]  Ki H. Chon,et al.  Automatic Selection of the Threshold Value $r$ for Approximate Entropy , 2008, IEEE Transactions on Biomedical Engineering.