Scheduling Remote Access to Scientific Instruments in Cyberinfrastructure for Education and Research

While a grid represents a computing infrastructure for cross domain sharing of computational resources, the cyberinfrastructure, proposed by the US NSF Blue - Ribbon advisory panel, is expected to revolutionizing science and engineering by including more computer integrated resources, e.g. telescopes and observatories. As a part of the China national cyberinfrastructure for education and research, resource sharing of expensive scientific instruments is discussed in this work. A layered model of instrument pools is introduced and the process from submitting a job to instrument pools to obtaining results is analyzed. Fuzzy random scheduling algorithms are proposed in instrument pools when a job is submitted to one of instruments within a pool. The randomness lies in the probability which instrument could be chosen for an experiment and the fuzziness origins from vagueness of users' feedback opinions on experimental results. Users' feedback information is utilized to improve overall quality of service (QoS) of an instrument cyberinfrastructure. Several algorithms are provided to increase utilization of instruments providing higher QoS and decrease utilization of those with poor QoS. This is demonstrated in details using quantitative simulation results included in this paper.

[1]  Warren Smith,et al.  Predicting Application Run Times Using Historical Information , 1998, JSSPP.

[2]  R. Kruse,et al.  Statistics with vague data , 1987 .

[4]  Subhash Saini,et al.  GridFlow: workflow management for grid computing , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[5]  F. Berman,et al.  Adaptive Performance Prediction for Distributed Data-Intensive Applications , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[6]  Yian-Kui Liu,et al.  Fuzzy Random Variables: A Scalar Expected Value Operator , 2003, Fuzzy Optim. Decis. Mak..

[7]  Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 14-17 May 2007, Rio de Janeiro, Brazil , 2007, CCGRID.

[8]  Cheng Wu,et al.  Research on Equipment Resource Scheduling in Grids , 2004, GCC.

[9]  Warren Smith,et al.  Using Run-Time Predictions to Estimate Queue Wait Times and Improve Scheduler Performance , 1999, JSSPP.

[10]  Huibert Kwakernaak,et al.  Fuzzy random variables--II. Algorithms and examples for the discrete case , 1979, Inf. Sci..

[11]  María Angeles Gil,et al.  Fuzzy random variables , 2001, Inf. Sci..

[12]  Huibert Kwakernaak,et al.  Fuzzy random variables - I. definitions and theorems , 1978, Inf. Sci..

[13]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[14]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[15]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[16]  Baoding Liu,et al.  Inequalities and Convergence Concepts of Fuzzy and Rough Variables , 2003, Fuzzy Optim. Decis. Mak..

[17]  Subhash Saini,et al.  Local grid scheduling techniques using performance prediction , 2003 .

[18]  Daniel Atkins,et al.  Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure , 2003 .

[19]  George Emanuel,et al.  Definitions and Theorems , 1986 .

[20]  Lingyun Yang,et al.  Conservative Scheduling: Using Predicted Variance to Improve Scheduling Decisions in Dynamic Environments , 2003, ACM/IEEE SC 2003 Conference (SC'03).