Predicting Program Behavior Based On Objective Function Minimization

Computer systems increasingly rely on dynamic management of their operations with the goal of optimizing an individual or joint metric involving performance, power, temperature, reliability and so on. Such an adaptive system requires an accurate, reliable, and practically viable metric predictors to invoke the dynamic management actions in a timely and efficient manner. Unlike ad-hoc predictors proposed in the past, we propose a unified prediction method in which the optimal metric prediction problem is considered as that of minimizing an objective function. Choice of the objective function and the model type determines the form of the solution whether it is a closed form or one that is numerically determined through optimization. We formulate two particular realizations of the unified prediction method by using the total squared error and accumulated squared error as the objective functions in conjunction with autoregressive models. Under this scenario, the unified prediction method becomes linear prediction and the predictive least square (PLS) prediction, respectively. For both of these predictors, there is a analytical closed form solution that determines model parameters. Experimental results with prediction of instruction per cycle (IPC) and L1 cache miss rate metrics demonstrate superior performance for the proposed predictors over the last value predictor on SPECCPU 2000 benchmarks where in some cases the mean absolute prediction error is reduced by as much as 10-fold.

[1]  Michael C. Huang,et al.  Dynamically Tuning Processor Resources with Adaptive Processing , 2003, Computer.

[2]  R. Plackett Studies in the History of Probability and Statistics. XXIX The discovery of the method of least squares , 1972 .

[3]  Frank Bellosa,et al.  Event-Driven Energy Accounting for Dynamic Thermal Management , 2002 .

[4]  Brad Calder,et al.  Phase tracking and prediction , 2003, ISCA '03.

[5]  M. Morf,et al.  Recursive least squares ladder estimation algorithms , 1981 .

[6]  Margaret Martonosi,et al.  Long-term workload phases: duration predictions and applications to DVFS , 2005, IEEE Micro.

[7]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[8]  H. Akaike A new look at the statistical model identification , 1974 .

[9]  R. Balasubramonian,et al.  Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures , 2000, Proceedings 33rd Annual IEEE/ACM International Symposium on Microarchitecture. MICRO-33 2000.

[10]  Michael C. Huang,et al.  Positional adaptation of processors: application to energy reduction , 2003, ISCA '03.

[11]  Balaram Sinharoy,et al.  POWER4 system microarchitecture , 2002, IBM J. Res. Dev..

[12]  Sandhya Dwarkadas,et al.  Characterizing and predicting program behavior and its variability , 2003, 2003 12th International Conference on Parallel Architectures and Compilation Techniques.

[13]  Margaret Martonosi,et al.  Identifying program power phase behavior using power vectors , 2003, 2003 IEEE International Conference on Communications (Cat. No.03CH37441).

[14]  Brad Calder,et al.  Automatically characterizing large scale program behavior , 2002, ASPLOS X.

[15]  Mati Wax Order selection for AR models by predictive least-squares , 1986 .

[16]  Tie-Jun Shan On predictive least squares filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  G. Yule On a Method of Investigating Periodicities in Disturbed Series, with Special Reference to Wolfer's Sunspot Numbers , 1927 .

[18]  Margaret Martonosi,et al.  Live, Runtime Phase Monitoring and Prediction on Real Systems with Application to Dynamic Power Management , 2006, 2006 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'06).

[19]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[20]  Margaret Martonosi,et al.  Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data , 2003, MICRO.

[21]  Frank Bellosa,et al.  Process cruise control: event-driven clock scaling for dynamic power management , 2002, CASES '02.

[22]  Massoud Pedram,et al.  Dynamic voltage and frequency scaling based on workload decomposition , 2004, Proceedings of the 2004 International Symposium on Low Power Electronics and Design (IEEE Cat. No.04TH8758).

[23]  Jorma Rissanen,et al.  A Predictive Least-Squares Principle , 1986 .