Predictive Queries Algorithm Based on Probability Model over Data Streams

Mining the evolving trends of an online data stream and forecasting the data values in the future can provide important support for the decision-making in many time-sensitive applications. This paper models an online data stream as a continuous state transitions process by mapping the possibly infinite stream data into finite states, and uses state transition disGraph (STG) to maintain the track of the state transactions. By studying the statistic information of the history state transitions, the future values can be predicted based on the theory of Markov chain. Extensive simulation experiments are conducted and show that the predictive performance of our method is preferable to that of the existing analogous algorithms.

[1]  Wang Wei-Ping,et al.  Processing Algorithms for Predictive Aggregate Queries over Data Streams , 2005 .

[2]  Philip S. Yu,et al.  Optimal multi-scale patterns in time series streams , 2006, SIGMOD Conference.

[3]  Aleksandar Lazarevic,et al.  Effective localized regression for damage detection in large complex mechanical structures , 2004, KDD.

[4]  Jimeng Sun,et al.  Streaming Pattern Discovery in Multiple Time-Series , 2005, VLDB.

[5]  Naohiro Ishii,et al.  Improving Accuracy of Multiple Regression Analysis for Effort Prediction Model , 2006, 5th IEEE/ACIS International Conference on Computer and Information Science and 1st IEEE/ACIS International Workshop on Component-Based Software Engineering,Software Architecture and Reuse (ICIS-COMSAR'06).

[6]  Jennifer M. Schopf,et al.  Using Regression Techniques to Predict Large Data Transfers , 2003, Int. J. High Perform. Comput. Appl..

[7]  Zoran Obradovic,et al.  Modeling Spatial-Temporal Data with a Short Observation History , 2003, Knowledge and Information Systems.

[8]  Sundeep Rangan,et al.  Estimation from lossy sensor data: jump linear modeling and Kalman filtering , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.