A non-linear clustering method for fuzzy time series: Histogram damping partition under the optimized cluster paradox

The clustering problem is an emergent issue in fuzzy time series.The existing clustering methods deal with the number of clusters or their size.Optimized cluster paradox refers to that trade-off between size and number.The histogram damping algorithm (HDP) is proposed to deal with this problem by estimating a proper cluster form with number and size characteristics.The proposed model is tested against the conventional methods and the case of random data is also presented. Results indicated superiority of the proposed method. The aim of this paper is to investigate the problem of finding the efficient number of clusters in fuzzy time series. The clustering process has been discussed in the existing literature, and a number of methods have been suggested. These methods have several drawbacks, especially the lack of cluster shape and quantity optimization. There are two critical dimensions in a fuzzy time series clustering: the selection of a proper interval for fuzzy clusters and the optimization of the membership degrees among the fuzzy cluster set. The existing methods for the interval selection assume that the intended data has a short-tailed distribution, and the cluster intervals are established in identical lengths (e.g. Song and Chissom, 1994; Chen, 1996; Yolcu et al., 2009). However, the time series data (particularly in economic research) is rarely short-tailed and mostly converges to long-tail distribution because of the boom-bust market behavior. This paper proposes a novel clustering method named histogram damping partition (HDP) to define sub-clusters on the standard deviation intervals and truncate the histogram of the data by a constraint based on the coefficient of variation. The HDP approach can be used for many different kinds of fuzzy time series models at the clustering stage.

[1]  Leonard J. Tashman,et al.  Out-of-sample tests of forecasting accuracy: an analysis and review , 2000 .

[2]  R. Engle Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation , 1982 .

[3]  Roy Batchelor,et al.  Forecasting spot and forward prices in the international freight market , 2007 .

[4]  Kunhuang Huarng,et al.  Effective lengths of intervals to improve forecasting in fuzzy time series , 2001, Fuzzy Sets Syst..

[5]  Çagdas Hakan Aladag,et al.  A new approach for determining the length of intervals for fuzzy time series , 2009, Appl. Soft Comput..

[6]  D. Swanson,et al.  On MAPE-R as a measure of cross-sectional estimation and forecast accuracy 1 , 2007 .

[7]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[8]  Shyi-Ming Chen,et al.  Multivariate fuzzy forecasting based on fuzzy time series and automatic clustering techniques , 2011, Expert Syst. Appl..

[9]  B. Chissom,et al.  Forecasting enrollments with fuzzy time series—part II , 1993 .

[10]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[11]  Spyros Makridakis,et al.  Accuracy measures: theoretical and practical concerns☆ , 1993 .

[12]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[13]  Lotfi A. Zadeh,et al.  A Theory of Approximate Reasoning , 1979 .

[14]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[15]  P. Goodwin,et al.  On the asymmetry of the symmetric MAPE , 1999 .

[16]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[17]  T. Bollerslev,et al.  Generalized autoregressive conditional heteroskedasticity , 1986 .

[18]  Spyros Makridakis,et al.  The M3-Competition: results, conclusions and implications , 2000 .

[19]  Okan Duru,et al.  Modeling principles in fuzzy time series forecasting , 2012, 2012 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr).

[20]  Rajesh N. Davé,et al.  Generalized fuzzy c-shells clustering and detection of circular and elliptical boundaries , 1992, Pattern Recognit..

[21]  Edwin Diday,et al.  The dynamic clusters method in nonhierarchical clustering , 1973, International Journal of Computer & Information Sciences.

[22]  Anil K. Jain,et al.  A self-organizing network for hyperellipsoidal clustering (HEC) , 1996, IEEE Trans. Neural Networks.

[23]  Fred L. Collopy,et al.  Error Measures for Generalizing About Forecasting Methods: Empirical Comparisons , 1992 .

[24]  J. Scott Armstrong,et al.  Principles of forecasting : a handbook for researchers and practitioners , 2001 .

[25]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[26]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[27]  Shyi-Ming Chen,et al.  Fuzzy forecasting based on high-order fuzzy logical relationships and automatic clustering techniques , 2011, Expert Syst. Appl..

[28]  Emrah Bulut Modeling seasonality using the fuzzy integrated logical forecasting (FILF) approach , 2014, Expert Syst. Appl..

[29]  Okan Duru,et al.  Bivariate Long Term Fuzzy Time Series Forecasting of Dry Cargo Freight Rates , 2010 .

[30]  Chris Chatfield,et al.  A commentary on error measures , 1992 .

[31]  Okan Duru,et al.  A fuzzy integrated logical forecasting model for dry bulk shipping index forecasting: An improved fuzzy time series approach , 2010, Expert Syst. Appl..

[32]  W. Fuller,et al.  Distribution of the Estimators for Autoregressive Time Series with a Unit Root , 1979 .

[33]  Okan Duru,et al.  A fuzzy integrated logical forecasting (FILF) model of time charter rates in dry bulk shipping: A vector autoregressive design of fuzzy time series with fuzzy c-means clustering , 2012 .

[34]  Shyi-Ming Chen,et al.  Forecasting enrollments based on fuzzy time series , 1996, Fuzzy Sets Syst..