An effective data aggregation based adaptive long term CPU load prediction mechanism on computational grid

With the development of Internet-based technologies and the rapid growth of scientific computing applications, Grid computing becomes more and more attractive. Generally, the execution time of a CPU-intensive task on a certain resource is tightly related to the CPU load on this resource. In order to estimate the task execution time more accurately to achieve an effective task scheduling, it is significant to make an effective long-term load prediction in dynamic Grid environments. Nevertheless, as the prediction errors will be gradually accumulated while the best values of prediction parameters may vary vigorously, the existing prediction algorithms usually fail to achieve good prediction accuracy in the long-term prediction. To address these problems, an effective Data Aggregation based Adaptive Long term resource load Point-Prediction mechanism (DA^2LP"P"o"i"n"t) is proposed in this paper, where a data aggregation concept is introduced herein to reduce the number of prediction step. Furthermore, an interval based prediction mechanism with probability distribution representation called DA^2LP"I"n"t"e"r"v"a"l is lately proposed to improve the adaptation of prediction results. The experimental results show that the DA^2LP"P"o"i"n"t algorithm can outperform previous prediction methods in regard to mean square error (MSE). In addition, the DA^2LP"I"n"t"e"r"v"a"l algorithm can attain lesser prediction error with stronger representation capability; therefore, it is able to provide much more useful information for task scheduling in Grid environments.

[1]  Wei Sun,et al.  Predicting Running Time of Grid Tasks based on CPU Load Predictions , 2006, 2006 7th IEEE/ACM International Conference on Grid Computing.

[2]  Peter A. Dinda The Statistical Properties of Hoast Load , 1998, LCR.

[3]  Peter A. Dinda,et al.  A prediction-based real-time scheduling advisor , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[4]  Sophocles J. Orfanidis,et al.  Introduction to signal processing , 1995 .

[5]  Guangwen Yang,et al.  Load prediction using hybrid model for computational grid , 2007, 2007 8th IEEE/ACM International Conference on Grid Computing.

[6]  Selim G. Akl,et al.  Scheduling Algorithms for Grid Computing: State of the Art and Open Problems , 2006 .

[7]  Amaury Lendasse,et al.  Methodology for long-term prediction of time series , 2007, Neurocomputing.

[8]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[9]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[10]  Renato J. O. Figueiredo,et al.  Adaptive Predictor Integration for System Performance Prediction , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[11]  Guangwen Yang,et al.  Adaptive Hybrid Model for Long Term Load Prediction in Computational Grid , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).

[12]  Peter A. Dinda,et al.  Host load prediction using linear models , 2000, Cluster Computing.

[13]  Wei Sun,et al.  CPU Load Predictions on the Computational Grid , 2006, CCGRID.

[14]  Peter A. Dinda,et al.  The statistical properties of host load , 1999, Sci. Program..

[15]  Richard Wolski,et al.  Experiences with predicting resource performance on-line in computational grid settings , 2003, PERV.

[16]  Lingyun Yang,et al.  Conservative Scheduling: Using Predicted Variance to Improve Scheduling Decisions in Dynamic Environments , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[17]  Ian T. Foster,et al.  Homeostatic and tendency-based CPU load predictions , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[18]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[19]  Richard Wolski,et al.  Multivariate Resource Performance Forecasting in the Network Weather Service , 2002, ACM/IEEE SC 2002 Conference (SC'02).