Short Paper: Data Mining-based Fault Prediction and Detection on the Grid

This paper describes a novel approach to fault detection and prediction on the grid based on data mining techniques. Data mining techniques are here applied as a mean to effectively process the significant amount of captured data from grid sites, services, workflows and activities. The paper provides a first approach of proposed techniques in terms of its ability of utilizing relevant information and the fault tolerance requirements. Such approach is one intelligent, distributed framework of fault detection and prediction for anomaly and failed activity by using resource- and workflow-based information. We use fault predictions to improve the performance of the workflow execution by avoiding potential faults of activities

[1]  Thomas Fahringer,et al.  GLARE: A Grid Activity Registration, Deployment and Provisioning Framework , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[2]  Jun Qin,et al.  ASKALON: a Grid application development and computing environment , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[3]  Takashi Chikayama,et al.  A scalable and efficient self-organizing failure detector for grid applications , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[4]  MANISH PARASHAR,et al.  Conceptual and Implementation Models for the Grid , 2005, Proceedings of the IEEE.

[5]  Gregor von Laszewski,et al.  A fault detection service for wide area distributed computations , 2004, Cluster Computing.

[6]  Jie Xu,et al.  Fault Tolerance within a Grid Environment , 2003 .

[7]  Radu Prodan,et al.  DEE: A Distributed Fault Tolerant Workflow Enactment Engine for Grid Computing , 2005, HPCC.