Adapters, shims, and glue - service interoperability for in silico experiments

MOTIVATION Computationally, in silico experiments in biology are workflows describing the collaboration of people, data and methods. The Grid and Web services are proposed to be the next generation infrastructure supporting the deployment of bioinformatics workflows. But the growing number of autonomous and heterogeneous services pose challenges to the used middleware w.r.t. composition, i.e. discovery and interoperability of services required within in silico experiments. In the IRIS project, we handle the problem of service interoperability by a semi-automatic procedure for identifying and placing customizable adapters into workflows built by service composition. RESULTS We show the effectiveness and robustness of the software-aided composition procedure by a case study in the field of life science. In this study we combine different database services with different analysis services with the objective of discovering required adapters. Our experiments show that we can identify relevant adapters with high precision and recall.

[1]  Laura M. Haas,et al.  The Clio project: managing heterogeneity , 2001, SGMD.

[2]  Carole A. Goble,et al.  myGrid: personalised bioinformatics on the information grid , 2003, ISMB.

[3]  Meir Cohen Similarity Search , 2008, Encyclopedia of GIS.

[4]  Jaroslav Kypr,et al.  Mapping the B-A conformational transition along plasmid DNA , 2005, Nucleic acids research.

[5]  Matthias Klusch,et al.  Larks: Dynamic Matchmaking Among Heterogeneous Software Agents in Cyberspace , 2002, Autonomous Agents and Multi-Agent Systems.

[6]  AnHai Doan,et al.  Corpus-based schema matching , 2005, 21st International Conference on Data Engineering (ICDE'05).

[7]  Sanjiva Weerawarana,et al.  Unraveling the Web services web: an introduction to SOAP, WSDL, and UDDI , 2002, IEEE Internet Computing.

[8]  Carole A. Goble,et al.  Exploring Williams-Beuren syndrome using myGrid , 2004, ISMB/ECCB.

[9]  Bertram Ludäscher,et al.  An Ontology-Driven Framework for Data Transformation in Scientific Workflows , 2004, DILS.

[10]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[11]  Armin B. Cremers,et al.  IRIS: a framework for mediator-based composition of service-oriented software , 2004, Proceedings. IEEE International Conference on Web Services, 2004..

[12]  Ajantha Dahanayake,et al.  Service-Oriented Software System Engineering: Challenges and Practices , 2004 .

[13]  A. Cremers,et al.  IRIS: a framework for mediator-based composition of service-oriented software , 2004, Proceedings. IEEE International Conference on Web Services, 2004..

[14]  Takahiro Kawamura,et al.  Semantic Matching of Web Services Capabilities , 2002, SEMWEB.

[15]  Mark D. Wilkinson,et al.  BioMOBY: An Open Source Biological Web Services Proposal , 2002, Briefings Bioinform..

[16]  Matteo Magnani,et al.  Schema Integration Based on Uncertain Semantic Mappings , 2005, ER.

[17]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[18]  Jun Zhang,et al.  Simlarity Search for Web Services , 2004, VLDB.

[19]  Cesare Pautasso,et al.  Programming for dependability in a service-based grid , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[20]  Michael Y. Galperin The Molecular Biology Database Collection: 2005 update , 2004, Nucleic Acids Res..

[21]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[22]  Wei Zhu,et al.  The Institute for Genomic Research Osa1 Rice Genome Annotation Database1 , 2005, Plant Physiology.

[23]  Heiko Schoof,et al.  BioMOBY Successfully Integrates Distributed Heterogeneous Bioinformatics Web Services. The PlaNet Exemplar Case1 , 2005, Plant Physiology.