Towards Non-Stationary Grid Models

Despite intense research on Grid scheduling, differentiated quality of service remains an open question, and no consensus has emerged on the most promising strategy. The difficulties of experimentation might be one of the root causes of this stalling. An alternative to experimenting on real, large, and complex data is to look for well-founded and parsimonious representations, which may also contribute to the a-priori knowledge required for operational Autonomics. The goal of this paper is thus to explore explanatory and generative models rather than predictive ones. As a test case, we address the following issue: is it possible to exhibit and validate consistent models of the Grid workload? Most existing work on modeling the dynamics of Grid behavior describes Grids as complex systems, but assumes a steady-state system (technically stationarity) and concludes to some form of long-range dependence (slowly decaying correlation) in the associated time-series. But the physical (economic and sociologic) processes governing the Grid behavior dispel the stationarity hypothesis. This paper considers an appealing different class of models: a sequence of stationary processes separated by breakpoints. The model selection question is now defined as identifying the breakpoints and fitting the processes in each segment. Experimenting with data from the EGEE/EGI Grid, we found that a non-stationary model can consistently be identified from empirical data, and that limiting the range of models to piecewise affine (autoregressive) time series is sufficiently powerful. We propose and experiment a validation methodology that empirically addresses the current lack of theoretical results concerning the quality of the estimated model parameters. Finally, we present a bootstrapping strategy for building more robust models from the limited samples at hand.

[1]  A. D. Meglio,et al.  Programming the Grid with gLite , 2006 .

[2]  Rajesh Raman,et al.  Matchmaking: distributed resource management for high throughput computing , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[3]  Evgenia Smirni,et al.  Injecting realistic burstiness to a traditional client-server benchmark , 2009, ICAC '09.

[4]  Michèle Sebag,et al.  The Grid Observatory , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[5]  Paul Fearnhead,et al.  Exact Bayesian curve fitting and signal segmentation , 2005, IEEE Transactions on Signal Processing.

[6]  Dick H. J. Epema,et al.  A Realistic Integrated Model of Parallel System Workloads , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[7]  Byoung-Dai Lee,et al.  Run-time prediction of parallel applications on shared environments , 2003, 2003 Proceedings IEEE International Conference on Cluster Computing.

[8]  E. Rogers Diffusion of Innovations , 1962 .

[9]  Murad S. Taqqu,et al.  Robustness of whittle-type estimators for time series with long-range dependence , 1997 .

[10]  Radu Prodan,et al.  Soft Benchmarks-Based Application Performance Prediction Using a Minimum Training Set , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[11]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[12]  R. Bhattacharya,et al.  THE HURST EFFECT UNDER TRENDS , 1983 .

[13]  Lorenza Saitta,et al.  Characterization of a computational grid as a complex system , 2009, GMAC '09.

[14]  Remzi H. Arpaci-Dusseau,et al.  Gathering at the Well: Creating Communities for Grid I/O , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[15]  J. R. M. Hosking,et al.  FRACTIONAL DIFFERENCING MODELING IN HYDROLOGY , 1985 .

[16]  Alexandru Iosup,et al.  Trace-based evaluation of job runtime and queue wait time predictions in grids , 2009, HPDC '09.

[17]  Richard Wolski,et al.  Predicting Bounds on Queuing Delay in Space-shared Computing Environments , 2006, 2006 IEEE International Symposium on Workload Characterization.

[18]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[19]  F. Diebold,et al.  Long Memory and Regime Switching , 2000 .

[20]  Richard Wolski,et al.  Eliciting honest value information in a batch-queue environment , 2007, 2007 8th IEEE/ACM International Conference on Grid Computing.

[21]  R. Wolski,et al.  Predicting the CPU availability of time‐shared Unix systems on the computational grid , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[22]  Yang Yang,et al.  Bagging binary and quantile predictors for time series , 2006 .

[23]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[24]  Anand Raghunathan,et al.  Best-effort parallel execution framework for Recognition and mining applications , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[25]  C. Granger,et al.  Occasional structural breaks and long memory with an application to the S&P 500 absolute stock returns , 2004 .

[26]  B. Efron Bootstrap Methods: Another Look at the Jackknife , 1979 .

[27]  Julie A. McCann,et al.  A survey of autonomic computing—degrees, models, and applications , 2008, CSUR.

[28]  Fabrizio Gagliardi,et al.  Building an infrastructure for scientific Grid computing: status and goals of the EGEE project , 2005, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[29]  Michèle Sebag,et al.  Discovering Piecewise Linear Models of Grid Workload , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[30]  Yi-Ching Yao Estimating the number of change-points via Schwarz' criterion , 1988 .

[31]  Shantenu Jha,et al.  Investigating autonomic behaviours in grid-basedcomputational science applications , 2009, GMAC '09.

[32]  Thomas Fahringer,et al.  Identification, Modelling and Prediction of Non-periodic Bursts in Workloads , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[33]  Clive W. J. Granger,et al.  An introduction to long-memory time series models and fractional differencing , 2001 .

[34]  Balázs Kégl,et al.  Utility-Based Reinforcement Learning for Reactive Grids , 2008, 2008 International Conference on Autonomic Computing.

[35]  Antonio Laganà,et al.  COMPCHEM: Progress Towards GEMS a Grid Empowered Molecular Simulator and Beyond , 2010, Journal of Grid Computing.

[36]  Lingyun Yang,et al.  Conservative Scheduling: Using Predicted Variance to Improve Scheduling Decisions in Dynamic Environments , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[37]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[38]  Balázs Kégl,et al.  Multi-objective Reinforcement Learning for Responsive Grids , 2010, Journal of Grid Computing.

[39]  Johan Montagnat,et al.  Modeling the latency on production grids with respect to the execution context , 2009, Parallel Comput..

[40]  Rajarshi Das,et al.  On the use of hybrid reinforcement learning for autonomic resource allocation , 2007, Cluster Computing.

[41]  Jordi Torres,et al.  Maximizing revenue in Grid markets using an economically enhanced resource manager , 2010 .

[42]  Genshiro Kitagawa,et al.  A procedure for the modeling of non-stationary time series , 1978 .

[43]  Gerald Tesauro,et al.  Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies , 2007, IEEE Internet Computing.

[44]  Murad S. Taqqu,et al.  Testing for long‐range dependence in the presence of shifting means or a slowly declining trend, using a variance‐type estimator , 1997 .

[45]  Richard S. Sutton,et al.  Reinforcement Learning is Direct Adaptive Optimal Control , 1992, 1991 American Control Conference.

[46]  Peter A. Dinda,et al.  Host load prediction using linear models , 2000, Cluster Computing.

[47]  Warren Smith,et al.  Using Run-Time Predictions to Estimate Queue Wait Times and Improve Scheduler Performance , 1999, JSSPP.

[48]  J. Gott Implications of the Copernican principle for our future prospects , 1993, Nature.

[49]  Allen B. Downey,et al.  Using Queue Time Predictions for Processor Allocation , 1997, JSSPP.

[50]  Jordi Torres,et al.  Maximizing revenue in Grid markets using an economically enhanced resource manager , 2010, Concurr. Comput. Pract. Exp..

[51]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[52]  Michèle Sebag,et al.  Toward autonomic grids: analyzing the job flow with affinity streaming , 2009, KDD.

[53]  W. Marsden I and J , 2012 .

[54]  Dag Tjøstheim,et al.  Testing for Serial Independence Using Measures of Distance between Densities , 1996 .

[55]  Jan Beran,et al.  Statistics for long-memory processes , 1994 .

[56]  Patrick Burns,et al.  Robustness of the Ljung-Box Test and its Rank Equivalent , 2002 .

[57]  Michael Muskulus,et al.  Analysis and modeling of job arrivals in a production grid , 2007, PERV.

[58]  Richard A. Davis,et al.  Structural Break Estimation for Nonstationary Time Series Models , 2006 .

[59]  Domenico Talia,et al.  Modeling and Supporting Grid Scheduling , 2008, Journal of Grid Computing.