Summarizing and Mining Streaming Data via a Functional Data Approach

In recent years, the analysis of data streams has become a challenging task since many applicative fields generate massive amount of data that are difficult to store and to analyze with traditional techniques. In this paper we propose a strategy to summarize pseudo periodic streaming data affected by noise and sampling problems, by means of functional profiles. It is a clustering strategy performed in a divide and conquer manner. In the on-line step, a set of summarization structures, collect statistical information on data. Starting from these, in the off-line step, the final clustering structure and the set of functional profiles are computed.

[1]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[2]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[3]  Hongyan Li,et al.  PGG: An Online Pattern Based Approach for Stream Variation Management , 2008, Journal of Computer Science and Technology.

[4]  Carl de Boor,et al.  A Practical Guide to Splines , 1978, Applied Mathematical Sciences.

[5]  C. Abraham,et al.  Unsupervised Curve Clustering using B‐Splines , 2003 .