Capturing Workflow Event Data for Monitoring, Performance Analysis, and Management of Scientific Workflows

To effectively support real-time monitoring and performance analysis of scientific workflow execution, varying levels of event data must be captured and made available to interested parties. This paper discusses the creation of an ontology-aware workflow monitoring system for use in the Trident system which utilizes a distributed publish/subscribe event model. The implementation of the publish/subscribe system is discussed and performance results are presented.

[1]  Yogesh L. Simmhan,et al.  Efficient scheduling of scientific workflows in a high performance computing cluster , 2008, CLADE '08.

[2]  Edward D. Lazowska,et al.  Trident: Scientific Workflow Workbench for Oceanography , 2008, 2008 IEEE Congress on Services - Part I.

[3]  Juliana Freire,et al.  Provenance and scientific workflows: challenges and opportunities , 2008, SIGMOD Conference.

[4]  Bartosz Balis,et al.  From Monitoring Data to Experiment Information – Monitoring of Grid Scientific Workflows , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).

[5]  Roger S. Barga,et al.  Automatic capture and efficient storage of e‐Science experiment provenance , 2008, Concurr. Comput. Pract. Exp..

[6]  Declan O'Sullivan,et al.  Knowledge-based networking , 2010, Principles and Applications of Distributed Event-Based Systems.

[7]  Jing Li,et al.  An Ontology-Based Publish/Subscribe System , 2004, Middleware.

[8]  Jing Hua,et al.  Service-Oriented Architecture for VIEW: A Visual Scientific Workflow Management System , 2008, 2008 IEEE International Conference on Services Computing.

[9]  Anne-Marie Kermarrec,et al.  The many faces of publish/subscribe , 2003, CSUR.