SemanticSCo: A platform to support the semantic composition of services for gene expression analysis

Gene expression studies often require the combined use of a number of analysis tools. However, manual integration of analysis tools can be cumbersome and error prone. To support a higher level of automation in the integration process, efforts have been made in the biomedical domain towards the development of semantic web services and supporting composition environments. Yet, most environments consider only the execution of simple service behaviours and requires users to focus on technical details of the composition process. We propose a novel approach to the semantic composition of gene expression analysis services that addresses the shortcomings of the existing solutions. Our approach includes an architecture designed to support the service composition process for gene expression analysis, and a flexible strategy for the (semi) automatic composition of semantic web services. Finally, we implement a supporting platform called SemanticSCo to realize the proposed composition approach and demonstrate its functionality by successfully reproducing a microarray study documented in the literature. The SemanticSCo platform provides support for the composition of RESTful web services semantically annotated using SAWSDL. Our platform also supports the definition of constraints/conditions regarding the order in which service operations should be invoked, thus enabling the definition of complex service behaviours. Our proposed solution for semantic web service composition takes into account the requirements of different stakeholders and addresses all phases of the service composition process. It also provides support for the definition of analysis workflows at a high-level of abstraction, thus enabling users to focus on biological research issues rather than on the technical details of the composition process. The SemanticSCo source code is available at https://github.com/usplssb/SemanticSCo.

[1]  Sergei L. Kosakovsky Pond,et al.  An Evolutionary Model-Based Algorithm for Accurate Phylogenetic Breakpoint Mapping and Subtype Prediction in HIV-1 , 2009, PLoS Comput. Biol..

[2]  Eduardo Goncalves da Silva,et al.  User-centric Service Composition - Towards Personalised Service Composition and Delivery , 2011 .

[3]  J. Wolf Principles of transcriptome analysis and gene expression quantification: an RNA‐seq tutorial , 2013, Molecular ecology resources.

[4]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[5]  Christopher D. Town,et al.  SSWAP: A Simple Semantic Web Architecture and Protocol for semantic web services , 2009, BMC Bioinformatics.

[6]  Ulf Leser,et al.  Data Management Challenges in Next Generation Sequencing , 2012, Datenbank-Spektrum.

[7]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[8]  Jeremy Leipzig,et al.  A review of bioinformatic pipeline frameworks , 2016, Briefings Bioinform..

[9]  Rui F. Oliveira,et al.  Brain Transcriptomic Response to Social Eavesdropping in Zebrafish (Danio rerio) , 2015, PloS one.

[10]  Alexander R. Pico,et al.  Mining Biological Pathways Using WikiPathways Web Services , 2009, PloS one.

[11]  Oswaldo Trelles,et al.  jORCA and Magallanes Sailing Together towards Integration of Web Services , 2010, JBI.

[12]  H. Kitano,et al.  Software for systems biology: from tools to integrated platforms , 2011, Nature Reviews Genetics.

[13]  Gabriela D. A. Guardia,et al.  A Methodology for the Development of RESTful Semantic Web Services for Gene Expression Analysis , 2015, PloS one.

[14]  C. Ferrer‐Luque,et al.  Antimicrobial residual effects of irrigation regimens with maleic acid in infected root canals , 2015, Journal of Biological Research-Thessaloniki.

[15]  Terri K. Attwood,et al.  The EMBRACE web service collection , 2010, Nucleic Acids Res..

[16]  Rachel Pottinger,et al.  Semi-automatic web service composition for the life sciences using the BioMoby semantic web framework , 2008, J. Biomed. Informatics.

[17]  Steve Pettifer,et al.  Visualising biological data: a semantic approach to tool and database integration , 2009, BMC Bioinformatics.

[18]  D. G. Pinheiro,et al.  Bone Marrow Mesenchymal Stromal Cells Isolated from Multiple Sclerosis Patients have Distinct Gene Expression Profile and Decreased Suppressive Function Compared with Healthy Counterparts , 2015, Cell transplantation.

[19]  Ola Spjuth,et al.  Experiences with workflows for automating data-intensive bioinformatics , 2015, Biology Direct.

[20]  Michael Gruenberger,et al.  Anatomy ontologies and potential users: bridging the gap , 2011, J. Biomed. Semant..

[21]  Anna-Lena Lamprecht The Bio-jETI Framework , 2013 .

[22]  Donna K. Slonim,et al.  Getting Started in Gene Expression Microarray Analysis , 2009, PLoS Comput. Biol..

[23]  Jan Ramon,et al.  A new ensemble coevolution system for detecting HIV-1 protein coevolution , 2015, Biology Direct.

[24]  Luís Ferreira Pires,et al.  A-DynamiCoS: A Flexible Framework for User-centric Service Composition , 2012, 2012 IEEE 16th International Enterprise Distributed Object Computing Conference.

[25]  Takahiro Kawamura,et al.  Semantic Matching of Web Services Capabilities , 2002, SEMWEB.

[26]  Roberto Chinnici,et al.  Web Services Description Language (WSDL) Version 2.0 Part 1: Core Language , 2007 .

[27]  R. Z. Vêncio,et al.  Semantic integration of gene expression analysis tools and data sources using software connectors , 2013, BMC Genomics.

[28]  Feng Luo,et al.  Sesame: A new bioinformatics semantic workflow design system , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[29]  Peter J. Woolf,et al.  GAGE: generally applicable gene set enrichment for pathway analysis , 2009, BMC Bioinformatics.

[30]  Mark D. Wilkinson,et al.  The Semantic Automated Discovery and Integration (SADI) Web service Design-Pattern, API and Reference Implementation , 2011, J. Biomed. Semant..

[31]  Robert D. Finn,et al.  The European Bioinformatics Institute in 2016: Data growth and integration , 2015, Nucleic Acids Res..

[32]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[33]  Mireille Ducassé,et al.  Safe Suggestions Based on Type Convertibility to Guide Workflow Composition , 2015, ISMIS.

[34]  Jochen Göpfert,et al.  Geschäftsprozessmodellierung mit BPMN 2.0: Business Process Model and Notation , 2013 .

[35]  Mark D. Wilkinson,et al.  The Semantic Automated Discovery and Integration (SADI) Web service Design-Pattern, API and Reference Implementation , 2011 .

[36]  Paolo Romano,et al.  Automation of in-silico data analysis processes through workflow management systems , 2007, Briefings Bioinform..

[37]  C. Sanz-Lozano,et al.  Introduction to the Gene Expression Analysis. , 2016, Methods in molecular biology.

[38]  Paolo Falcarin,et al.  Service Creation in the SPICE Service Platform , 2006 .

[39]  D. Edwards,et al.  Bioinformatics tools and databases for analysis of next-generation sequence data. , 2012, Briefings in functional genomics.

[40]  Bruno Pot,et al.  Pseudomonas aeruginosa Population Structure Revisited , 2009, PloS one.

[41]  S. Aerts,et al.  Transcription factor MITF and remodeller BRG1 define chromatin organisation at regulatory elements in melanoma cells , 2015, eLife.

[42]  Rafael C. Jimenez,et al.  Data integration in biological research: an overview , 2015, Journal of Biological Research-Thessaloniki.

[43]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[44]  R. Fielding,et al.  Architectural Styles and the Design of Network-based Software Architectures (CHAPTER 5) , 2000 .

[45]  Mark D. Wilkinson,et al.  Semantically-Guided Workflow Construction in Taverna: The SADI and BioMoby Plug-Ins , 2010, ISoLA.

[46]  John A. Miller,et al.  Suggestions for Galaxy Workflow Design Using Semantically Annotated Services , 2012, FOIS.

[47]  Carole A. Goble,et al.  BioCatalogue: a universal catalogue of web services for the life sciences , 2010, Nucleic Acids Res..