Global behavior modeling: a new approach to grid autonomic management

Over the last decade, grid computing has paved the way for a new level of large scale distributed systems. The grid can be defined as a set of geographically dispersed, interconnected computational resources, aimed at performing challenging computational tasks. This infrastructure makes it possible to securely and reliably take advantage of widely separated computational resources that are part of several different organizations. Resources can be incorporated to the grid, building a theoretical virtual supercomputer. However, this new step in distributed computing comes along with a completely new level of complexity. Grid management mechanisms play a key role, and a correct analysis and understanding of the grid behavior is needed. Grid systems must be able to self-manage, incorporating autonomic features capable of controlling and optimizing all grid resources and services. Traditional distributed computing management mechanisms analyze each resource separately and adjust specific parameters of each one of them. When trying to adapt the same procedures to grid computing, the vast complexity of the system can make this task extremely complicated. But grid complexity could only be a matter of perspective. It could be possible to understand the grid behavior as a single system, instead of a set of resources. This abstraction could provide a deeper understanding of the system, describing large scale behavior and global events that probably would not be detected analyzing each resource separately. This abstraction could also be a solid, unified basis on top of which advanced grid autonomic management solutions could be developed. In this Ph.D. thesis a specific methodology is presented and described in order to create a global behavior model of the grid, analyzing it as a single entity. The purpose of this model is to serve as the above mentioned abstraction of the grid system, providing an unique global behavior understanding. This global behavior model becomes also an extremely valuable tool for developing autonomic management mechanisms and contributes to a service-oriented, unified vision of the grid. This methodology is strongly based on system monitoring, performance estimation tools, knowledge discovery techniques and advanced scientific visualization. As a whole, it provides a unique and new point of view of grid computing, contributing to enrich and develop this technology. To conclude, the proposed methodology has been tested on a series of typical experimental scenarios, both real and simulated, obtaining statistically meaningful results confirming that a global behavior model of a grid system benefits its understanding and serves as a solid basis to improve its autonomic capabilities.

[1]  Jack J. Dongarra,et al.  NetSolve: Grid enabling scientific computing environments , 2004, High Performance Computing Workshop.

[2]  Jong Sik Lee,et al.  User Demand Prediction-Based Resource Management Model in Grid Computing Environment , 2008, 2008 International Conference on Convergence and Hybrid Information Technology.

[3]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[4]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[5]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[6]  M. Stephens EDF Statistics for Goodness of Fit and Some Comparisons , 1974 .

[7]  Rajkumar Buyya,et al.  InterGrid: a case for internetworking islands of Grids , 2008, Concurr. Comput. Pract. Exp..

[8]  Himadeepa Karlapudi Web Application Performance Prediction , 2004 .

[9]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[10]  Pérez Hernández,et al.  Arquitectura multiagente para E/S de alto rendimiento en clusters , 2011 .

[11]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[12]  Randal E. Bryant,et al.  Data-Intensive Supercomputing: The case for DISC , 2007 .

[13]  Julio J. Valdés,et al.  Virtual Reality Representation of Information Systems and Decision Rules: An Exploratory Technique for Understanding Data and Knowledge Structure , 2003, RSFDGrC.

[14]  Rajeev Gandhi,et al.  Ganesha: Black-Box Fault Diagnosis for MapReduce Systems (CMU-PDL-08-112) , 2008 .

[15]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[16]  Brian Tierney,et al.  The DataGrid Architecture Version 2 , 2001 .

[17]  J.J. Valdes,et al.  Virtual reality visual data mining with nonlinear discriminant neural networks: application to leukemia and Alzheimer gene expression data , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[18]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[19]  Aaron Weiss,et al.  Can the PC go green? , 2007, NTWK.

[20]  Eduardo Huedo,et al.  Porting of scientific applications to Grid Computing on GridWay , 2005, Sci. Program..

[21]  Rajkumar Buyya,et al.  Grids and Grid technologies for wide‐area distributed computing , 2002, Softw. Pract. Exp..

[22]  Marios D. Dikaiakos,et al.  GridBench: a tool for benchmarking grids , 2003, Proceedings. First Latin American Web Congress.

[23]  Christine Morin,et al.  XtreemOS: A Grid Operating System Making your Computer Ready for Participating in Virtual Organizations , 2007, 10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07).

[24]  Gregory R. Ganger,et al.  Self-* Storage: Brick-based Storage with Automated Administration (CMU-CS-03-178) , 2003 .

[25]  Hui Li,et al.  Mining performance data for metascheduling decision support in the Grid , 2007, Future Gener. Comput. Syst..

[26]  Laurie J. Heyer,et al.  Exploring expression data: identification and analysis of coexpressed genes. , 1999, Genome research.

[27]  Borja Sotomayor,et al.  Capacity Leasing in Cloud Systems using the OpenNebula Engine , 2008 .

[28]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[29]  David Abramson,et al.  Nimrod/G: an architecture for a resource management and scheduling system in a global computational grid , 2000, Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region.

[30]  Werner Nutt,et al.  Relational Grid Monitoring Architecture (R-GMA) , 2003, ArXiv.

[31]  Franck Cappello,et al.  Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed , 2006, Int. J. High Perform. Comput. Appl..

[32]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[33]  Ian Foster,et al.  Predicting application run times with historical information , 2004, J. Parallel Distributed Comput..

[34]  S. R. Cajal Textura del Sistema Nervioso del Hombre y de los Vertebrados, 1899–1904 , 2019 .

[35]  Joseph N. Wilson,et al.  GridOS: Operating System Services for Grid Architectures , 2003, HiPC.

[36]  Warren Smith,et al.  Scheduling with advanced reservations , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[37]  Anil K. Jain,et al.  Artificial neural network for nonlinear projection of multivariate data , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[38]  Daniel A. Menascé,et al.  Autonomic Virtualized Environments , 2006, International Conference on Autonomic and Autonomous Systems (ICAS'06).

[39]  Darrell D. E. Long,et al.  Theory of finite automata with an introduction to formal languages , 1989 .

[40]  Renato Figueiredo,et al.  Science Clouds: Early Experiences in Cloud Computing for Scientific Applications , 2008 .

[41]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[42]  Rajkumar Buyya,et al.  A taxonomy and survey of grid resource management systems for distributed computing , 2002, Softw. Pract. Exp..

[43]  Erich Schikuta,et al.  Grid-clustering: an efficient hierarchical clustering method for very large data sets , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[44]  Thomas A. Corbi,et al.  The dawning of the autonomic computing era , 2003, IBM Syst. J..

[45]  Michael J. Lewis,et al.  Multi-state grid resource availability characterization , 2007, 2007 8th IEEE/ACM International Conference on Grid Computing.

[46]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[47]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[48]  Marian Bubak,et al.  Performance Tools for the Grid: State of the Art and Future , 2004 .

[49]  Julio J. Valdés,et al.  Finding order in chaos: a behavior model of the whole grid , 2010, Grid 2010.

[50]  William E. Johnston,et al.  Grids as production computing environments: the engineering aspects of NASA's Information Power Grid , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[51]  Michael J. Lewis,et al.  Resource Availability Prediction for Improved Grid Scheduling , 2008, 2008 IEEE Fourth International Conference on eScience.

[52]  Dongyan Xu,et al.  VioCluster: Virtualization for Dynamic Computational Domains , 2005, 2005 IEEE International Conference on Cluster Computing.

[53]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[54]  Johan Tordsson,et al.  Resource Brokering for Grid Environments , 2004 .

[55]  Oh-Young Kwon,et al.  Web Service Resource Framework Based Computing Service Framework for Computational Grid Applications , 2006, ISPA Workshops.

[56]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[57]  J. Rossier,et al.  Classification of fusiform neocortical interneurons based on unsupervised clustering. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[58]  Donald E. Brown,et al.  A practical application of simulated annealing to clustering , 1990, Pattern Recognit..

[59]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[60]  Heinz Stockinger,et al.  Defining the grid: a snapshot on the current view , 2007, The Journal of Supercomputing.

[61]  L. R. Gabler Economies and Diseconomies of Scale in Urban Public Sectors , 1969 .

[62]  C. Mallows,et al.  A Method for Comparing Two Hierarchical Clusterings , 1983 .

[63]  R. Tryon Cluster Analysis , 1939 .

[64]  Ian T. Foster Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, NPC.

[65]  Wil M. P. van der Aalst,et al.  A Reference Model for Grid Architectures and Its Analysis , 2008, OTM Conferences.

[66]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[67]  I. Borg Multidimensional similarity structure analysis , 1987 .

[68]  H. Markram The Blue Brain Project , 2006, Nature Reviews Neuroscience.

[69]  Richard Wolski,et al.  Experiences with predicting resource performance on-line in computational grid settings , 2003, PERV.

[70]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[71]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[72]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[73]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[74]  Dietmar W. Erwin,et al.  UNICORE—a Grid computing environment , 2002, Concurr. Comput. Pract. Exp..

[75]  Alberto Sánchez Campos Autonomic high performance storage for grid environments based on long term prediction , 2008 .

[76]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[77]  Harvey B Newman,et al.  A Distributed Agent-based Architecture for Dynamic Services , 2001 .

[78]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[79]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[80]  Xuxian Jiang,et al.  VIOLIN: Virtual Internetworking on Overlay Infrastructure , 2004, ISPA.

[81]  Alan L. Cox,et al.  Datacenter Storage Architecture for MapReduce Applications , 2009 .

[82]  David H. Wolpert,et al.  The Lack of A Priori Distinctions Between Learning Algorithms , 1996, Neural Computation.

[83]  Rajkumar Buyya,et al.  GridSim: a toolkit for the modeling and simulation of distributed resource management and scheduling for Grid computing , 2002, Concurr. Comput. Pract. Exp..

[84]  A. Arithmetic Data Cube as a Data Intensive Benchmark , 2003 .

[85]  Armando Fox,et al.  Capturing, indexing, clustering, and retrieving system history , 2005, SOSP '05.

[86]  Iosif Legrand,et al.  MonALISA : A Distributed Monitoring Service Architecture , 2003, ArXiv.

[87]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[88]  R. D'Andrade U-statistic hierarchical clustering , 1978 .

[89]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[90]  Y. Escoufier,et al.  Analyse Typologique. Theories et Applications , 1982 .

[91]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[92]  GhemawatSanjay,et al.  The Google file system , 2003 .

[93]  Yuri Gurevich,et al.  Evolving Algebras: an Attempt to Discover Semantics , 1993, Current Trends in Theoretical Computer Science.

[94]  Michael J. Lewis,et al.  Scheduling on the Grid via multi-state resource availability prediction , 2008, 2008 9th IEEE/ACM International Conference on Grid Computing.

[95]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[96]  Muli Ben-Yehuda,et al.  The Reservoir model and architecture for open federated cloud computing , 2009, IBM J. Res. Dev..

[97]  Eduardo Huedo,et al.  The GridWay Framework for Adaptive Scheduling and Execution on Grids , 2001, Scalable Comput. Pract. Exp..

[98]  Jack J. Dongarra,et al.  Computer benchmarks , 1993 .

[99]  Michael A. Frumkin,et al.  NAS Grid Benchmarks: a tool for Grid space exploration , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[100]  E. Schikuta GRID-CLUSTERING: A FAST HIERARCHICAL CLUSTERING METHOD FOR VERY LARGE DATA SETS , 1993 .

[101]  Nemanja Isailovic An Introspective Approach to Speculative Execution , 2002 .

[102]  Ian Foster,et al.  Monitoring and Discovery in a Web Services Framework: Functionality and Performance of Globus Toolkit MDS4 , 2006 .

[103]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[104]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[105]  Chris Chatfield,et al.  The Analysis of Time Series : An Introduction, Sixth Edition , 2003 .

[106]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[107]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[108]  Gabriel Antoniu,et al.  BlobSeer: how to enable efficient versioning for large object storage under heavy access concurrency , 2009, EDBT/ICDT '09.

[109]  Steven Tuecke,et al.  The Open Grid Services Architecture , 2004, The Grid 2, 2nd Edition.

[110]  Julie A. McCann Adaptivity for improving web streaming application performance , 2003 .

[111]  El-Ghazali Talbi,et al.  A Taxonomy of Hybrid Metaheuristics , 2002, J. Heuristics.

[112]  George Tsouloupas Marios D. Dikaiakos Design and Implementation of GridBench ? , 2005 .

[113]  Thomas Fahringer,et al.  GridARM: Askalon's Grid Resource Management System , 2005, EGC.

[114]  Richard Mortier,et al.  Using Magpie for Request Extraction and Workload Modelling , 2004, OSDI.

[115]  D. Lindley Regression and Correlation Analysis , 1990 .

[116]  Michael A Arbib,et al.  Theories of abstract automata (Prentice-Hall series in automatic computation) , 1969 .

[117]  Gabriel Antoniu,et al.  Enabling High Data Throughput in Desktop Grids through Decentralized Data and Metadata Management: The BlobSeer Approach , 2009, Euro-Par.

[118]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.