Optimizing Notifications of Subscription-Based Forecast Queries

Integrating sophisticated statistical methods into database management systems is gaining more and more attention in research and industry. One important statistical method is time series forecasting, which is crucial for decision management in many domains. In this context, previous work addressed the processing of ad-hoc and recurring forecast queries. In contrast, we focus on subscription-based forecast queries that arise when an application (subscriber) continuously requires forecast values for further processing. Forecast queries exhibit the unique characteristic that the underlying forecast model is updated with each new actual value and better forecast values might be available. However, (re-)sending new forecast values to the subscriber for every new value is infeasible because this can cause significant overhead at the subscriber side. The subscriber therefore wishes to be notified only when forecast values have changed relevant to the application. In this paper, we reduce the costs of the subscriber by optimizing the notifications sent to the subscriber, i.e., by balancing the number of notifications and the notification length. We introduce a generic cost model to capture arbitrary subscriber cost functions and discuss different optimization approaches that reduce the subscriber costs while ensuring constrained forecast values deviations. Our experimental evaluation on real datasets shows the validity of our approach with low computational costs.

[1]  Graham Cormode,et al.  Algorithms for distributed functional monitoring , 2008, SODA '08.

[2]  Jennifer Widom,et al.  Offering a Precision-Performance Tradeoff for Aggregation Queries over Replicated Data , 2000, VLDB.

[3]  James W. Taylor,et al.  Triple seasonal methods for short-term electricity demand forecasting , 2010, Eur. J. Oper. Res..

[4]  Qin Zhang,et al.  Multi-dimensional online tracking , 2009, SODA.

[5]  Badrish Chandramouli,et al.  Value-Based Notification Conditions in Large-Scale Publish/Subscribe Systems , 2007, VLDB.

[6]  Torben Bach Pedersen,et al.  Data management in the MIRABEL smart grid system , 2012, EDBT-ICDT '12.

[7]  Stanley B. Zdonik,et al.  A skip-list approach for efficiently processing forecasting queries , 2008, Proc. VLDB Endow..

[8]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[9]  Rob J Hyndman,et al.  Automatic Time Series Forecasting: The forecast Package for R , 2008 .

[10]  Wolfgang Lehner,et al.  F2DB: The Flash-Forward Database System , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[11]  Eli Upfal,et al.  The Case for Predictive Database Systems: Opportunities and Challenges , 2011, CIDR.

[12]  Daan Six,et al.  ADDRESS: Scenarios and architecture for Active Demand development in the smart grids of the future , 2009 .

[13]  Shivnath Babu,et al.  Processing Forecasting Queries , 2007, VLDB.

[14]  Beng Chin Ooi,et al.  An adaptive updating protocol for reducing moving object database workload , 2010, Proc. VLDB Endow..

[15]  Hans-Arno Jacobsen,et al.  Modeling uncertainties in publish/subscribe systems , 2004, Proceedings. 20th International Conference on Data Engineering.

[16]  Jeffrey S. Chase,et al.  Active and accelerated learning of cost models for optimizing scientific applications , 2006, VLDB.