Linguistic comparison of time series using the End-Point Fit algorithm

In this paper, we present an approach to linguistically describe two time series based on their domain values and trends. The Iterative End-Point Fit algorithm has been selected for obtaining a suitable segmentation of the series to be used in the description process. The comparison between series is performed by calculating the difference between values of two time series, which are defined on the same time domain. The resulting time series is described by several kind of summaries according to the way the difference is calculated and the permissible error. We have illustrated this proposal with several time series that represent the CO2 Emission in different countries.

[1]  Daniel Sánchez,et al.  Time Series Comparison Using Linguistic Fuzzy Techniques , 2010, IPMU.

[2]  Frank Höppner,et al.  Time Series Abstraction Methods - A Survey , 2002, GI Jahrestagung.

[3]  David H. Douglas,et al.  ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[4]  Anne Laurent,et al.  A new approach for the generation of fuzzy summaries based on fuzzy multidimensional databases , 2003, Intell. Data Anal..

[5]  Daniel Sánchez,et al.  A multi-objective memetic algorithm for the linguistic summarization of time series , 2011, GECCO '11.

[6]  Anna Wilbik,et al.  Using Fuzzy Linguistic Summaries for the Comparison of Time Series: an application to the analysis of investment fund quotations , 2009, IFSA/EUSFLAT Conf..

[7]  Félix Díaz-Hermida,et al.  Linguistic summarization of data with probabilistic fuzzy quantifiers , 2010 .

[8]  Lotfi A. Zadeh,et al.  Computing with Words and Perceptions - A Paradigm Shift , 2009, PDPTA.

[9]  Anna Wilbik,et al.  An approach to the linguistic summarization of time series using a fuzzy quantifier driven aggregation , 2010, Int. J. Intell. Syst..

[10]  Daniel Sánchez,et al.  A proposal for the hierarchical segmentation of time series. Application to trend-based linguistic description , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[11]  Noureddine Mouaddib,et al.  SEQ: a fuzzy set-based approach to database summarization , 2002, Fuzzy Sets Syst..

[12]  Gracián Triviño,et al.  Automatically Generated Linguistic Summaries of Energy Consumption Data , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[13]  A. Bugar Linguistic summarization of data with probabilistic fuzzy quantifiers , 2010 .

[14]  Daniel Sánchez,et al.  Linguistic query answering on data cubes with time dimension , 2011, Int. J. Intell. Syst..

[15]  Eamonn J. Keogh,et al.  Segmenting Time Series: A Survey and Novel Approach , 2002 .

[16]  Sankar K. Pal,et al.  Data mining in soft computing framework: a survey , 2002, IEEE Trans. Neural Networks.

[17]  Ichiro Kobayashi,et al.  An approach to linguistic summarization based on comparison among multiple time-series data , 2012, The 6th International Conference on Soft Computing and Intelligent Systems, and The 13th International Symposium on Advanced Intelligence Systems.

[18]  Daniel Sánchez,et al.  Quality Assessment in Linguistic Summaries of Data , 2012, IPMU.

[19]  Slawomir Zadrozny,et al.  Data mining via protoform based linguistic summaries: Some possible relations to natural language generation , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[20]  Urs Ramer,et al.  An iterative procedure for the polygonal approximation of plane curves , 1972, Comput. Graph. Image Process..

[21]  Bernadette Bouchon-Meunier,et al.  Mathematical Morphology Tools to Evaluate Periodic Linguistic Summaries , 2013, FQAS.

[22]  Kazuhisa Seta,et al.  Improved method for linguistic expression of time series with global trend and local features , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[23]  Nicolás Marín,et al.  A Fuzzy Approach to the Linguistic Summarization of Time Series , 2011, J. Multiple Valued Log. Soft Comput..