Analysis of Uncertainty in Time Series Data: Issues and Challenges

This paper reviews issues and challenges of uncertainty in time series data. The aim of uncertainty analysis is to determine the ways of how to deal with uncertain data in order to gain knowledge, fit low dimensional model, and do prediction. So as to build an efficient predictive tool, uncertainty in data could not be ruled out because it may bring important knowledge. Uncertainty information arises from different resources such as process uncertainty, model uncertainty or data uncertainty. In this paper, issues and challenges of these uncertainties in time series data will be discovered and how these issues could be solved by data mining techniques will be discussed. Frequent pattern mining algorithm through FP-growth, Apriori algorithm and H-mine are methods that could be used to investigate the existing of uncertainty data. Meanwhile, Euclidean distance, particle swarm optimization, Monte Carlo simulation, and regression are methods that could be compared as prediction methods. These methods have been implemented in many data types since early 1900s. Also, this paper shows results of the uncertainty detection test on time series data sets. The test aims to prove the existing of uncertainty in the data. This work will benefit in many application domains.

[1]  T. H. Morgan,et al.  The relation between normal and abnormal development of the embryo of the frog , 1905, Archiv für Entwicklungsmechanik der Organismen.

[2]  S. Lykoudis,et al.  Spatially interpolated time series of δ18Ο in Eastern Mediterranean precipitation , 2010 .

[3]  M. Steiner,et al.  A MACHINE LEARNING APPROACH TO FINDING WEATHER REGIMES AND SKILLFUL PREDICTOR COMBINATIONS FOR SHORT-TERM STORM FORECASTING , 2008 .

[4]  J. Hargreaves Skill and uncertainty in climate models , 2010 .

[5]  Wei Wang,et al.  Similarity Matching over Uncertain Time Series , 2011, 2011 Seventh International Conference on Computational Intelligence and Security.

[6]  Sumit Sarkar,et al.  Generalized Normal Forms for Probabilistic Relational Data , 2002, IEEE Trans. Knowl. Data Eng..

[7]  Charu C. Aggarwal,et al.  Frequent Pattern Mining Algorithms with Uncertain Data , 2009 .

[8]  Ming Xue,et al.  Machine learning enhancement of Storm Scale Ensemble precipitation forecasts , 2012, CIDU.

[9]  William Frazer Hume,et al.  The Effects of Secular Oscillation in Egypt during the Cretaceous and Eocene Periods , 1911, Quarterly Journal of the Geological Society of London.

[10]  Li Haitao,et al.  Precipitation Time Series Predicting of the Chaotic Characters Using Support Vector Machines , 2009, 2009 International Conference on Information Management, Innovation Management and Industrial Engineering.

[11]  V. Jankovic,et al.  Science Migrations , 2004 .

[12]  Maurice van Keulen,et al.  Report on the First VLDB Workshop on Management of Uncertain Data (MUD) , 2007, SGMD.

[13]  E. Hawkins,et al.  The Potential to Narrow Uncertainty in Regional Climate Predictions , 2009 .

[14]  Alon Y. Halevy,et al.  Using Probabilistic Information in Data Integration , 1997, VLDB.

[15]  Stephen Hayne,et al.  Multi-user view integration system (MUVIS): an expert system for view integration , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[16]  Laszlo Fuchs,et al.  A low-dissipative, scale-selective discretization scheme for the Navier–Stokes equations , 2012 .

[17]  M. Webb,et al.  Quantification of modelling uncertainties in a large ensemble of climate change simulations , 2004, Nature.

[18]  Thomas R. Karl,et al.  Recent Heavy Precipitation in the Vicinity of the Great Salt Lake: Just How Unusual? , 1986 .

[19]  Katsiaryna Mirylenka,et al.  Uncertain Time-Series Similarity: Return to the Basics , 2012, Proc. VLDB Endow..

[20]  Azuraliza Abu Bakar,et al.  Uncertain Time Series in Weather Prediction , 2013 .

[21]  Joann J. Ordille,et al.  Data integration: the teenage years , 2006, VLDB.

[22]  Florian Pappenberger,et al.  Ensemble flood forecasting: a review. , 2009 .

[23]  Wang Wei,et al.  Similarity Matching for Uncertain Time Series , 2014 .

[24]  Osmar R. Zaïane,et al.  An Associative Classifier for Uncertain Datasets , 2012, PAKDD.

[25]  Thomas Reichler,et al.  On the Effective Number of Climate Models , 2011 .

[26]  Hans-Peter Kriegel,et al.  Probabilistic Similarity Search for Uncertain Time Series , 2009, SSDBM.