Efficient services composition for grid-enabled data-intensive applications

Grids technologies enable the deployment of complex data-intensive scientific applications. Nonspecific scientific codes may benefit from grid computing capabilities by (i) assembling codes in workflows (code parallelism) and (ii) processing large amounts of data (data parallelism). We designed MOTEUR, a service-based workflow manager, to describe data-intensive scientific applications in a compact framework and to efficiently process the resulting computations by transparently exploiting different parallelism levels. Theoretical performances are analyzed and results are shown based on a real application to medical image databases processing

[1]  Carole A. Goble,et al.  Feta: A Light-Weight Architecture for User Oriented Semantic Service Discovery , 2005, ESWC.

[2]  Frédéric Suter,et al.  A Scalable Approach to Network Enabled Servers , 2002, ASIAN.

[3]  Rajkumar Buyya,et al.  A taxonomy of scientific workflow systems for grid computing , 2005, SGMD.

[4]  T. Oinn,et al.  Soaplab - a unified Sesame door to analysis tools , 2003 .

[5]  Franck Cappello,et al.  Grid'5000: a large scale, reconfigurable, controlable and monitorable Grid platform , 2005 .

[6]  Kyle A. Gallivan,et al.  The gSOAP Toolkit for Web Services and Peer-to-Peer Computing Networks , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[7]  Ian T. Foster,et al.  Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, Journal of Computer Science and Technology.

[8]  Matthew Shields,et al.  A Parallel Implementation of the Inspiral Search Algorithm using Triana , 2003 .

[9]  Johan Montagnat,et al.  Medical Images Simulation, Storage, and Processing on the European DataGrid Testbed , 2004, Journal of Grid Computing.

[10]  R Pordes,et al.  The Open Science Grid (OSG) , 2005 .

[11]  Emmanuel Jeannot,et al.  Managing data persistence in network enabled servers , 2005, Sci. Program..

[12]  Ken Kennedy,et al.  TaskScheduling Strategies forWorkflow-based Applications inGrids , 2005 .

[13]  Ian J. Taylor,et al.  Distributed computing with Triana on the Grid , 2005, Concurr. Pract. Exp..

[14]  Francine Berman,et al.  Heuristics for scheduling parameter sweep applications in grid environments , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[15]  Johan Montagnat,et al.  Probabilistic and dynamic optimization of job partitioning on a grid infrastructure , 2006, 14th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP'06).

[16]  Johan Montagnat,et al.  Grid-enabled workflows for data intensive medical applications , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[17]  Ian Taylor,et al.  Grid Enabling Applications Using Triana , 2003 .

[18]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[19]  Luc Soler,et al.  Evaluation of a New 3D/2D Registration Criterion for Liver Radio-Frequencies Guided by Augmented Reality , 2003, IS4TH.

[20]  Henri Casanova,et al.  A GridRPC Model and API for End-User Applications , 2004 .

[21]  Satoshi Matsuoka,et al.  Ninf-G: A Reference Implementation of RPC-based Programming Middleware for Grid Computing , 2003, Journal of Grid Computing.

[22]  LudäscherBertram,et al.  Scientific workflow management and the Kepler system , 2006 .

[23]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[24]  Jack Dongarra,et al.  Users' Guide to NetSolve v1.4.1 , 2002 .

[25]  R. Irani,et al.  Axis: The Next Generation of Java Soap , 2002 .

[26]  Carole A. Goble,et al.  Using Semantic Web Technologies for Representing E-science Provenance , 2004, SEMWEB.

[27]  Thierry Delaitre,et al.  High-level grid application environment to use legacy codes as OGSA grid services , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.