UBioLab: a web-laboratory for ubiquitous in-silico experiments.

The huge and dynamic amount of bioinformatic resources (e.g., data and tools) available nowadays in Internet represents a big challenge for biologists –for what concerns their management and visualization– and for bioinformaticians –for what concerns the possibility of rapidly creating and executing in-silico experiments involving resources and activities spread over the WWW hyperspace. Any framework aiming at integrating such resources as in a physical laboratory has imperatively to tackle –and possibly to handle in a transparent and uniform way– aspects concerning physical distribution, semantic heterogeneity, co-existence of different computational paradigms and, as a consequence, of different invocation interfaces (i.e., OGSA for Grid nodes, SOAP for Web Services, Java RMI for Java objects, etc.). The framework UBioLab has been just designed and developed as a prototype following the above objective. Several architectural features –as those ones of being fully Web-based and of combining domain ontologies, Semantic Web and workflow techniques– give evidence of an effort in such a direction. The integration of a semantic knowledge management system for distributed (bioinformatic) resources, a semantic-driven graphic environment for defining and monitoring ubiquitous workflows and an intelligent agent-based technology for their distributed execution allows UBioLab to be a semantic guide for bioinformaticians and biologists providing (i) a flexible environment for visualizing, organizing and inferring any (semantics and computational) "type" of domain knowledge (e.g., resources and activities, expressed in a declarative form), (ii) a powerful engine for defining and storing semantic-driven ubiquitous in-silico experiments on the domain hyperspace, as well as (iii) a transparent, automatic and distributed environment for correct experiment executions.

[1]  Carole A. Goble,et al.  State of the nation in data integration for bioinformatics , 2008, J. Biomed. Informatics.

[2]  Ezio Bartocci,et al.  Enacting Proactive Workflows Engine in e-Science , 2006, International Conference on Computational Science.

[3]  David J. Reiss,et al.  The Gaggle: An open-source software system for integrating bioinformatics software and data sources , 2006, BMC Bioinformatics.

[4]  Maria-Esther Vidal,et al.  Path-based Systems to Guide Scientists in the Maze of Biological Data Sources , 2006, J. Bioinform. Comput. Biol..

[5]  Andreas D Baxevanis,et al.  Searching NCBI Databases Using Entrez , 2004, Current protocols in bioinformatics.

[6]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[7]  Arthur W. Toga,et al.  iTools: A Framework for Classification, Categorization and Integration of Computational Biology Resources , 2008, PloS one.

[8]  Ezio Bartocci,et al.  Model driven design and implementation of activity-based applications in Hermes , 2006, WOA.

[9]  Mario Cannataro,et al.  MS-Analyzer: Intelligent Preprocessing, Management, and Data Mining Analysis of Mass Spectrometry Data on the Grid , 2005, 2005 First International Conference on Semantics, Knowledge and Grid.

[10]  David Maier,et al.  From databases to dataspaces: a new abstraction for information management , 2005, SGMD.

[11]  Ezio Bartocci,et al.  An XML View of the "World" , 2003, ICEIS.

[12]  Andrzej Cichocki,et al.  Migrating workflows and their transactional properties , 1998 .

[13]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004 .

[14]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[15]  L. Milanesi,et al.  An Agent-Based Multilayer Architecture for Bioinformatics Grids , 2007, IEEE Transactions on NanoBioscience.

[16]  Nicholas R. Jennings,et al.  The Semantic Grid: A Future e‐Science Infrastructure , 2003 .

[17]  Andrea Manconi,et al.  A Survey on Integrating Data in Bioinformatics , 2011, Learning Structure and Schemas from Documents.

[18]  Chris T. A. Evelo,et al.  The BridgeDb framework: standardized access to gene, protein and metabolite identifier mapping services , 2010, BMC Bioinformatics.

[19]  Kei-Hoi Cheung,et al.  Data Integration in Bioinformatics: Current Efforts and Challenges , 2011 .

[20]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[21]  David Maier,et al.  Principles of dataspace systems , 2006, PODS '06.

[22]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[23]  Stephen A. Jarvis,et al.  Localised workload management using performance prediction and QoS contracts , 2002 .

[24]  Sung Lee,et al.  Bio-STEER: A Semantic Web workflow tool for Grid computing in the life sciences , 2007, Future Gener. Comput. Syst..

[25]  Alfonso Valencia,et al.  Interoperability with Moby 1.0--it's better than sharing your toothbrush! , 2008, Briefings in bioinformatics.

[26]  L. Stein Integrating biological databases , 2003, Nature Reviews Genetics.

[27]  Subhash Saini,et al.  GridFlow: workflow management for grid computing , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[28]  Emanuela Merelli,et al.  Hermes: Agent-Based Middleware for Mobile Computing , 2005, SFM.

[29]  Emanuela Merelli,et al.  Resourceome: A Multilevel Model and a Semantic Web Tool for Managing Domain and Operational Knowledge , 2009, 2009 Third International Conference on Advances in Semantic Processing.

[30]  Subhash Saini,et al.  ARMS: An agent-based resource management system for grid computing , 2002, Sci. Program..

[31]  Jérôme Gouzy,et al.  REMORA: a pilot in the ocean of BioMoby web-services , 2006, Bioinform..

[32]  Ronald H. Perrott,et al.  GeneGrid: grid based solution for bioinformatics application integration and experiment execution , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[33]  Fausto Giunchiglia,et al.  Towards Knowledge in the Cloud , 2008, OTM Workshops.

[34]  Graham R. Nudd,et al.  Pace—A Toolset for the Performance Prediction of Parallel and Distributed Systems , 2000, Int. J. High Perform. Comput. Appl..

[35]  Lincoln D. Stein,et al.  Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges , 2008, Nature Reviews Genetics.

[36]  Ezio Bartocci,et al.  Building a MultiAgent System from a User Workflow Specification , 2006, WOA.

[37]  Ezio Bartocci,et al.  BioWMS: a web-based Workflow Management System for bioinformatics , 2007, BMC Bioinformatics.

[38]  Paolo Romano,et al.  Automation of in-silico data analysis processes through workflow management systems , 2007, Briefings Bioinform..

[39]  Carole A. Goble,et al.  myGrid: personalised bioinformatics on the information grid , 2003, ISMB.

[40]  P. Argos,et al.  SRS: information retrieval system for molecular biology data banks. , 1996, Methods in enzymology.

[41]  L. Stein Creating a bioinformatics nation , 2002, Nature.