WOODSS and the Web: annotating and reusing scientific workflows

This paper discusses ongoing research on scientific workflows at the Institute of Computing, University of Campinas (IC - UNICAMP) Brazil. Our projects with bio-scientists have led us to develop a scientific workflow infrastructure named WOODSS. This framework has two main objectives in mind: to help scientists to specify and annotate their models and experiments; and to document collaborative efforts in scientific activities. In both contexts, workflows are annotated and stored in a database. This "annotated scientific workflow" database is treated as a repository of (sometimes incomplete) approaches to solving scientific problems. Thus, it serves two purposes: allows comparison of distinct solutions to a problem, and their designs; and provides reusable and executable building blocks to construct new scientific workflows, to meet specific needs. Annotations, moreover, allow further insight into methodology, success rates, underlying hypotheses and other issues in experimental activities.The many research challenges faced by us at the moment include: the extension of this framework to the Web, following Semantic Web standards; providing means of discovering workflow components on the Web for reuse; and taking advantage of planning in Artificial Intelligence to support composition mechanisms. This paper describes our efforts in these directions, tested over two domains - agro-environmental planning and bioinformatics.

[1]  André Santanchè,et al.  Managing Dynamic Repositories for Digital Content Components , 2004, EDBT Workshops.

[2]  Edmundo Roberto Mauro Madeira,et al.  A Collaborative Model for Agricultural Supply Chains , 2004, CoopIS/DOA/ODBASE.

[3]  André Santanchè,et al.  Self Describing Components: Searching for Digital Artifacts on the Web , 2005, SBBD.

[4]  Stefan Edelkamp,et al.  Automated Planning: Theory and Practice , 2007, Künstliche Intell..

[5]  Richard N. Taylor,et al.  Chimera: hypermedia for heterogeneous software development enviroments , 2000, TOIS.

[6]  Paulo F. Pires,et al.  Managing structural genomic workflows using Web services , 2005, Data Knowl. Eng..

[7]  James A. Hendler,et al.  HTN planning for Web Service composition using SHOP2 , 2004, J. Web Semant..

[8]  Claudia Bauzer Medeiros,et al.  Supporting modeling and problem solving from precedent experiences: the role of workflows and case-based reasoning , 2005, Environ. Model. Softw..

[9]  Claudia Bauzer Medeiros,et al.  A framewok based in Web services orchestration for bioinformatics workflow management , 2005, WOB.

[10]  D. A. Palmieri,et al.  The genome sequence of the plant pathogen Xylella fastidiosa , 2000, Nature.

[11]  T. Biggerstaff,et al.  Reusability Framework, Assessment, and Directions , 1987, IEEE Software.

[12]  Carole A. Goble,et al.  Exploring Williams-Beuren syndrome using myGrid , 2004, ISMB/ECCB.

[13]  Yolanda Gil,et al.  Towards Interactive Composition of Semantic Web Services , 2004 .

[14]  Claudia Bauzer Medeiros,et al.  WOODSS - a spatial decision support system based on workflows , 1999, Decis. Support Syst..

[15]  Paolo Traverso,et al.  Automated Planning: Theory & Practice , 2004 .

[16]  André Santanchè,et al.  Geographic Digital Content Components , 2004, GEOINFO.

[17]  Jacques Wainer,et al.  Scientific Workflow Systems , 1996 .

[18]  Claudia Bauzer Medeiros,et al.  Interoperability for GIS Document Management in Environmental Planning , 2005, J. Data Semant..

[19]  Calton Pu,et al.  POESIA: An ontological workflow approach for composing Web services in agriculture , 2003, The VLDB Journal.