Service-oriented architecture for integration of bioinformatic data and applications

Service oriented architecture (SOA) is a new paradigm that originated in industry for future distributed computing. It is recognized as a promising architecture for application integration inside and across organizations. Since their introduction, semantic web and web services technologies are increasingly gaining interest in the implementation of e-Science infrastructures. In this dissertation, we survey current research trends and challenges for adopting SOA in general. We present a practical experiment of building a service-oriented system for data integration and analysis using current web services technologies and bioinformatics middleware. The system is enhanced with an ontological model for semantics annotation of services and data. It demonstrates that adopting SOA in the e-Science field can accelerate the scientific research process. A new methodology and an enhanced system design is proposed to facilitate the reuse of workflows and verified knowledge.

[1]  Björn Johansson,et al.  A Web Service Approach for Model Integration in Computational Design , 2003 .

[2]  Claudia Bauzer Medeiros,et al.  A framewok based in Web services orchestration for bioinformatics workflow management , 2005, WOB.

[3]  Natalya Keberle,et al.  Towards a Framework for Agent-Enabled Semantic Web Service Composition , 2004, Int. J. Web Serv. Res..

[4]  Ulf Leser,et al.  Adapters, shims, and glue - service interoperability for in silico experiments , 2006, Bioinform..

[5]  Gregory R. Madey,et al.  Improving the Reuse of Scientific Workflows and Their By-products , 2007 .

[6]  Robert Stevens,et al.  Performing in silico Experiments on the Grid : A Users Perspective , 2003 .

[7]  Michael Winikoff,et al.  Exact functional context matching for web services , 2004, ICSOC '04.

[8]  Ahmed K. Elmagarmid,et al.  Composing Web services on the Semantic Web , 2003, The VLDB Journal.

[9]  Edward A. Lee,et al.  CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. 2000; 00:1–7 Prepared using cpeauth.cls [Version: 2002/09/19 v2.02] Taverna: Lessons in creating , 2022 .

[10]  Jian Yang Web service componentization , 2003, CACM.

[11]  W.M.P. van der Aalst,et al.  Don't go with the flow: web services composition standards exposed , 2003 .

[12]  Chris Greenhalgh,et al.  Performing \emph{In Silico} Experiments on the Grid: A Users' Perspective , 2003 .

[13]  L. Stein Integrating biological databases , 2003, Nature Reviews Genetics.

[14]  Carole A. Goble,et al.  Applying Semantic Web Services to Bioinformatics: Experiences Gained, Lessons Learnt , 2004, SEMWEB.

[15]  Heiko Schuldt,et al.  WebService Composition with O'GRAPE and OSIRIS , 2003, VLDB.

[16]  Carole A. Goble,et al.  Feta: A Light-Weight Architecture for User Oriented Semantic Service Discovery , 2005, ESWC.

[17]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[18]  Vipul Kashyap,et al.  OBSERVER: An Approach for Query Processing in Global Information Systems Based on Interoperation Across Pre-Existing Ontologies , 2000, Distributed and Parallel Databases.

[19]  Eleni Stroulia,et al.  Semantic Structure Matching for Assessing Web-Service Similarity , 2003, ICSOC.

[20]  Shuping Ran,et al.  A model for web services discovery with QoS , 2003, SECO.

[21]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[22]  E. Michael Maximilien,et al.  Toward autonomic web services trust and selection , 2004, ICSOC '04.

[23]  Carole A. Goble,et al.  Seven Bottlenecks to Workflow Reuse and Repurposing , 2005, International Semantic Web Conference.

[24]  Asunción Gómez-Pérez,et al.  A Framework for Design and Composition of Semantic Web Services , 2004, AAAI 2004.

[25]  A. B. Cremers,et al.  IRIS: a framework for mediator-based composition of service-oriented software , 2004 .

[26]  Carole A. Goble,et al.  A Suite of Daml+Oil Ontologies to Describe Bioinformatics Web Services and Data , 2003, Int. J. Cooperative Inf. Syst..

[27]  Patricia J. Johnson,et al.  Ancient Invasions: From Endosymbionts to Organelles , 2004, Science.

[28]  Mark Klein,et al.  Massachusetts Institute of Technology Abraham Bernstein University of Zurich Toward High-Precision Service Retrieval , 2022 .

[29]  Anne H. H. Ngu,et al.  QoS computation and policing in dynamic web service selection , 2004, WWW Alt. '04.

[30]  Amit P. Sheth,et al.  Adding Semantics to Web Services Standards , 2003, ICWS.

[31]  Wendy Hall,et al.  Creating a Science of the Web , 2006, Science.

[32]  Yolanda Gil,et al.  Artificial intelligence and grids: workflow planning and beyond , 2004, IEEE Intelligent Systems.

[33]  Nicholas R. Jennings,et al.  The Semantic Grid: Past, Present, and Future , 2005 .

[34]  Vivek Sarkar,et al.  Decentralizing execution of composite web services , 2004, OOPSLA.

[35]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..

[36]  Paul Hofmann,et al.  A Mixed Initiative Approach to Semantic Web Service Discovery and Composition: SAP's Guided Procedures Framework , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[37]  Uwe Zdun,et al.  Design and Implementation of an Asynchronous Invocation Framework for Web Services , 2003, ICWS-Europe.

[38]  J. Palmer,et al.  THE SYMBIOTIC BIRTH AND SPREAD OF PLASTIDS: HOW MANY TIMES AND WHODUNIT? , 2003 .

[39]  Roderic D. M. Page,et al.  TreeView: an application to display phylogenetic trees on personal computers , 1996, Comput. Appl. Biosci..

[40]  John W. Stiller,et al.  A SINGLE ORIGIN OF PLASTIDS REVISITED: CONVERGENT EVOLUTION IN ORGANELLAR GENOME CONTENT 1 , 2003 .

[41]  Yogesh L. Simmhan,et al.  A survey of data provenance in e-science , 2005, SGMD.

[42]  Pedro M. Domingos,et al.  Learning to match ontologies on the Semantic Web , 2003, The VLDB Journal.

[43]  Kei-Hoi Cheung,et al.  A web services choreography scenario for interoperating bioinformatics applications , 2003, BMC Bioinformatics.

[44]  Jérôme Gouzy,et al.  REMORA: a pilot in the ocean of BioMoby web-services , 2006, Bioinform..

[45]  Carole A. Goble,et al.  Pedro Ontology Services: A Framework for Rapid Ontology Markup , 2005, ESWC.

[46]  Sameer Velankar,et al.  SOAP-based services provided by the European Bioinformatics Institute , 2005, Nucleic Acids Res..

[47]  Craig A. Knoblock,et al.  Web service composition as planning , 2003 .

[48]  Gregory R. Madey,et al.  A semantic Web services enabled Web portal architecture , 2004, Proceedings. IEEE International Conference on Web Services, 2004..

[49]  Jana Koehler,et al.  Web Service Composition - Current Solutions and Open Problems , 2003 .

[50]  Adam Arbree,et al.  Mapping Abstract Complex Workflows onto Grid Environments , 2003, Journal of Grid Computing.

[51]  Miroslaw Malek,et al.  Current solutions for Web service composition , 2004, IEEE Internet Computing.

[52]  Athman Bouguettaya,et al.  Ontological Approach for Information Discovery in Internet Databases , 2004, Distributed and Parallel Databases.

[53]  Manish Parashar,et al.  A Peer-to-Peer Approach to Web Service Discovery , 2004, World Wide Web.

[54]  Ian J. Taylor,et al.  Visual Grid Workflow in Triana , 2005, Journal of Grid Computing.

[55]  Calton Pu,et al.  Querying multiple bioinformatics information sources: can semantic web research help? , 2002, SGMD.

[56]  Carole A. Goble,et al.  Recycling workflows and services through discovery and reuse , 2007, Concurr. Comput. Pract. Exp..

[57]  Piero A. Bonatti,et al.  On optimal service selection , 2005, WWW '05.

[58]  Geoffrey J. Barton,et al.  The Jalview Java alignment editor , 2004, Bioinform..

[59]  Amit P. Sheth,et al.  Framework for Semantic Web Process Composition , 2003, Int. J. Electron. Commer..

[60]  G. Madey,et al.  AN ONTOLOGY FOR AGENT-BASED MODELING AND SIMULATION , 2004 .

[61]  Kaizar Amin,et al.  GridAnt: a client-controllable grid workflow system , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[62]  Amit P. Sheth,et al.  Investigating ontologies for simulation modeling , 2004, 37th Annual Simulation Symposium, 2004. Proceedings..

[64]  D. Roos,et al.  Nuclear-encoded, plastid-targeted genes suggest a single common origin for apicomplexan and dinoflagellate plastids. , 2001, Molecular biology and evolution.

[65]  Carole A. Goble,et al.  Learning domain ontologies for semantic Web service descriptions , 2005, J. Web Semant..

[66]  Ralph Deters,et al.  Selecting the best web service , 2004, CASCON.

[67]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[68]  James A. Hendler,et al.  Semi-automatic Composition ofWeb Services using Semantic Descriptions , 2003, WSMAI.

[69]  Kosuke Ishii,et al.  Integrating Producibility and Product Performance Tools Within a Web-Service Environment , 2003 .

[70]  Anne H. H. Ngu,et al.  Declarative composition and peer-to-peer provisioning of dynamic Web services , 2002, Proceedings 18th International Conference on Data Engineering.

[71]  Steffen Staab,et al.  What Is an Ontology? , 2009, Handbook on Ontologies.

[72]  L. Stein Creating a bioinformatics nation , 2002, Nature.

[73]  Amit P. Sheth,et al.  Meteor-s web service annotation framework , 2004, WWW '04.

[74]  C. Goble,et al.  The {my}Grid Project: Services, Architecture and Demonstrator , 2003 .

[75]  Mark D. Wilkinson,et al.  BioMOBY: An Open Source Biological Web Services Proposal , 2002, Briefings Bioinform..

[76]  Eric Lease Morgan,et al.  Exploiting "Light-weight" Protocols and Open Source Tools to Implement Digital Library Collections and Services , 2005, D Lib Mag..

[77]  Tran Cao Son,et al.  Semantic Web Services , 2001, IEEE Intell. Syst..

[78]  J. Raven,et al.  Genomics and chloroplast evolution: what did cyanobacteria do for plants? , 2003, Genome Biology.

[79]  Gregory R. Madey,et al.  A Service-Oriented Data Integration and Analysis Environment for In Silico Experiments and Bioinformatics Research , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[80]  Lawrence B. Holder,et al.  Graph-Based Data Mining , 2000, IEEE Intell. Syst..

[81]  Carole A. Goble,et al.  Workflow discovery: the problem, a case study from e-Science and a graph-based solution , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[82]  Mark A. Musen,et al.  PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment , 2000, AAAI/IAAI.

[83]  Yildiray Kabak,et al.  Semantically enriched web services for the travel industry , 2004, SGMD.

[84]  Ian Horrocks,et al.  A software framework for matchmaking based on semantic web technology , 2003, WWW '03.

[85]  Martin Senger,et al.  BioMoby extensions to the Taverna workflow management and enactment software , 2006, BMC Bioinformatics.

[86]  James A. Hendler,et al.  Information Gathering During Planning for Web Service Composition , 2004, SEMWEB.

[87]  Katia P. Sycara,et al.  Semantic Web Service Discovery in the OWL-S IDE , 2006, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06).