Learning-aided predictor integration for system performance prediction

Abstract The integration of multiple predictors promises higher prediction accuracy than the accuracy that can be obtained with a single predictor. The challenge is how to select the best predictor at any given moment. Traditionally, multiple predictors are run in parallel and the one that generates the best result is selected for prediction. In this paper, we propose a novel approach for predictor integration based on the learning of historical predictions. Compared with the traditional approach, it does not require running all the predictors simultaneously. Instead, it uses classification algorithms such as k-Nearest Neighbor (k-NN) and Bayesian classification and dimension reduction technique such as Principal Component Analysis (PCA) to forecast the best predictor for the workload under study based on the learning of historical predictions. Then only the forecasted best predictor is run for prediction. Our experimental results show that it achieved 20.18% higher best predictor forecasting accuracy than the cumulative MSE based predictor selection approach used in the popular Network Weather Service system. In addition, it outperformed the observed most accurate single predictor in the pool for 44.23% of the performance traces.

[1]  Lingyun Yang,et al.  Conservative Scheduling: Using Predicted Variance to Improve Scheduling Decisions in Dynamic Environments , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[2]  Ravi Nair,et al.  Introduction to Virtual Machines , 2005 .

[3]  Renato J. O. Figueiredo,et al.  A case for grid computing on virtual machines , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[4]  Ian Pratt,et al.  Xenoservers: accountable execution of untrusted programs , 1999, Proceedings of the Seventh Workshop on Hot Topics in Operating Systems.

[5]  Forest Baskett,et al.  An Algorithm for Finding Nearest Neighbors , 1975, IEEE Transactions on Computers.

[6]  Hiroki Suyari,et al.  Practical chaos time series analysis with financial applications , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[7]  Alfredo R. Huete,et al.  Analysis of the global vegetation dynamic metrics using MODIS Vegetation Index and land cover products , 2004, IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium.

[8]  Wei Sun,et al.  CPU Load Predictions on the Computational Grid * , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[9]  Philip K. Chan,et al.  Modeling multiple time series for anomaly detection , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[10]  Shlomit S. Pinter,et al.  Improving machine virtualization with 'hotplug memory' , 2005, 17th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD'05).

[11]  Gaurav Jain,et al.  An approach to text classification using dimensionality reduction and combination of classifiers , 2004, Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, 2004. IRI 2004..

[12]  Mahadev Satyanarayanan,et al.  Internet suspend/resume , 2002, Proceedings Fourth IEEE Workshop on Mobile Computing Systems and Applications.

[13]  Yiming Yang,et al.  A scalability analysis of classifiers in text categorization , 2003, SIGIR.

[14]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[15]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[16]  Klara Nahrstedt,et al.  Adaptive multi-resource prediction in distributed resource sharing environment , 2004, IEEE International Symposium on Cluster Computing and the Grid, 2004. CCGrid 2004..

[17]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[18]  Peter A. Dinda,et al.  An Extensible Toolkit for Resource Prediction In Distributed Systems , 1999 .

[19]  Lawrence Sirovich,et al.  Management and Analysis of Large Scientific Datasets , 1992 .

[20]  Dimitris K. Tasoulis,et al.  Financial forecasting through unsupervised clustering and evolutionary trained neural networks , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[21]  Peter A. Dinda,et al.  The statistical properties of host load , 1999, Sci. Program..

[22]  Ian T. Foster,et al.  Virtual workspaces: Achieving quality of service and quality of life in the Grid , 2005, Sci. Program..

[23]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[24]  David G. Stork,et al.  Pattern Classification , 1973 .

[25]  Jennifer M. Schopf,et al.  Using Disk Throughput Data in Predictions of End-to-End Grid Data Transfers , 2002, GRID.

[26]  J. O. Rawlings,et al.  Applied Regression Analysis , 1998 .

[27]  Ivan Marsic,et al.  Modeling and prediction of session throughput of constant bit rate streams in wireless data networks , 2003, 2003 IEEE Wireless Communications and Networking, 2003. WCNC 2003..

[28]  Robert P. Goldberg,et al.  Survey of virtual machine research , 1974, Computer.

[29]  Jonathan D. Cryer,et al.  Time Series Analysis , 1986, Encyclopedia of Big Data.

[30]  M. Rosenblum,et al.  Optimizing the migration of virtual computers , 2002, OSDI '02.

[31]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[32]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[33]  Renato J. O. Figueiredo,et al.  Autonomic Feature Selection for Application Classification , 2006, 2006 IEEE International Conference on Autonomic Computing.

[34]  Jennifer M. Schopf,et al.  Predicting sporadic grid data transfers , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[35]  Renato J. O. Figueiredo,et al.  Application classification through monitoring and learning of resource consumption patterns , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[36]  Renato J. O. Figueiredo,et al.  VMPlants: Providing and Managing Virtual Machine Execution Environments for Grid Computing , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[37]  Richard Wolski,et al.  Dynamically forecasting network performance using the Network Weather Service , 1998, Cluster Computing.

[38]  Peter A. Dinda,et al.  Host load prediction using linear models , 2000, Cluster Computing.

[39]  Horst Bunke,et al.  An evaluation of ensemble methods in handwritten word recognition based on feature selection , 2004, ICPR 2004.

[40]  Riccardo Bellazzi,et al.  A stochastic model to assess the variability of blood glucose time series in diabetic patients self-monitoring , 2006, IEEE Transactions on Biomedical Engineering.