A Comparative Study on CPU Load Predictions in a Computational Grid using Artificial Neural Network Algorithms

Background/Objectives: To evaluate the prediction accuracy of Neural Network algorithms for host CPU load prediction and evaluate their performance compared to actual values. Methods/Statistical Analysis: The speed of execution of job at the scheduled host is directly proportional to its CPU load. Therefore, target node load prediction plays an important role in job scheduling decisions. It is learnt that Neural Networks are capable of predicting the future values based on the training given on the past data. We designed a multilayer neural network and trained with learning algorithms for the input patterns collected from the load traces and predicted the future load statistics. The Mean and Standard Deviation of the predicted values are computed and analyzed against the Mean and Standard Deviation of actual values for all the ANN algorithms. Findings: We analyzed the prediction accuracy of Back Propagation, Quick Propagation, Back Propagation with Momentum and Resilient Propagation algorithm for the load traces collected from variety of computers connected in a network. Existing reports shows that Back Propagation algorithm exhibits better prediction accuracy compared to statistical approaches like linear regression and polynomial regression. In this paper, we have shown that Resilient Propagation algorithm has better prediction accuracy compared to other ANN algorithms. Application/Improvements: Job scheduling and resource selection algorithms can employ neural network algorithms to predict the load for the sharable resources connected in the network for more accurate and faster scheduling/resource selection decision.

[1]  R. Wolski,et al.  Predicting the CPU availability of time‐shared Unix systems on the computational grid , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[2]  Peter A. Dinda,et al.  Size-based scheduling policies with inaccurate scheduling information , 2004, The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004. (MASCOTS 2004). Proceedings..

[3]  Richard Wolski,et al.  Dynamically forecasting network performance using the Network Weather Service , 1998, Cluster Computing.

[4]  Peter A. Dinda,et al.  Host load prediction using linear models , 2000, Cluster Computing.

[5]  Peter A. Dinda,et al.  A prediction-based real-time scheduling advisor , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[6]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[7]  Yoichi Muraoka,et al.  Extended forecast of CPU and network load on computational Grid , 2004, IEEE International Symposium on Cluster Computing and the Grid, 2004. CCGrid 2004..

[8]  Ian T. Foster,et al.  Homeostatic and tendency-based CPU load predictions , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[9]  Seyyed Reza Khaze,et al.  A New Approach in Bloggers Classification with Hybrid of K-Nearest Neighbor and Artificial Neural Network Algorithms , 2015 .

[10]  Xingfu Wu,et al.  Using Performance Prediction to Allocate Grid Resources , 2004 .

[11]  Richard Wolski,et al.  Multivariate Resource Performance Forecasting in the Network Weather Service , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[12]  Peter A. Dinda The Statistical Properties of Hoast Load , 1998, LCR.

[13]  Richard Wolski,et al.  Experiences with predicting resource performance on-line in computational grid settings , 2003, PERV.

[14]  Lingyun Yang,et al.  Conservative Scheduling: Using Predicted Variance to Improve Scheduling Decisions in Dynamic Environments , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[15]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[16]  Klara Nahrstedt,et al.  Adaptive multi-resource prediction in distributed resource sharing environment , 2004, IEEE International Symposium on Cluster Computing and the Grid, 2004. CCGrid 2004..