A prediction module to optimize scheduling in a grid computing environment

Heterogeneous computing environment such as grid computing allows sharing and aggregation of a wide variety of geographically distributed computational resources (such as supercomputers, clusters, data sources, people and storage systems) and present them as a single, unified resource for solving large-scale and data-intensive computing applications. A common problem arising in grid computing is to select the most efficient resource to run a particular program. Also users are required to reserve in advance the resources needed to run their program on the grid. At present the execution time of any program submission depends on guesswork by the user. This leads to inefficient use of resources, incurring extra operation costs such as idling queues or machines. Thus a prediction module was designed and developed to aid the user. This module estimates the execution time of a program by using aspects of static analysis, analytical benchmarking and compiler based approach. It consists of 4 main stages; each with its own functionality. An incoming program is categorized accordingly, parsed and then broken down into smaller units known as tokens. The complexity and relationship amongst these tokens are then analyzed and finally the execution time is estimated for the entire program that was submitted.

[1]  Alan Jay Smith,et al.  Analysis of benchmark characteristics and benchmark performance prediction , 1996, TOCS.

[2]  Tarek A. El-Ghazawi,et al.  A performance study of job management systems , 2004, Concurr. Pract. Exp..

[3]  Rajkumar Buyya,et al.  A taxonomy and survey of grid resource management systems for distributed computing , 2002, Softw. Pract. Exp..

[4]  Wei Li,et al.  Software metrics and application domain complexity , 1997, Proceedings of Joint 4th International Computer Science Conference and 4th Asia Pacific Software Engineering Conference.

[5]  Pawel Plaszczak,et al.  Grid Computing: The Savvy Manager's Guide , 2005 .

[6]  V. Lenin,et al.  The United States of America , 2002, Government Statistical Agencies and the Politics of Credibility.

[7]  Stephen A. Jarvis,et al.  Performance prediction technology for agent-based resource management in grid environments , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[8]  Ahmed Karmouch,et al.  Job completion prediction in grid using distributed case-based reasoning , 2005, 14th IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprise (WETICE'05).

[9]  G. Taylor,et al.  Plug in to grid computing , 2004, IEEE Power and Energy Magazine.

[10]  Lucio Grandinetti,et al.  Operations research methods for resource management and scheduling in a computational grid: a survey , 2004, High Performance Computing Workshop.