Establishing a distributed system for the simple representation and integration of diverse scientific assertions

BackgroundInformation technology has the potential to increase the pace of scientific progress by helping researchers in formulating, publishing and finding information. There are numerous projects that employ ontologies and Semantic Web technologies towards this goal. However, the number of applications that have found widespread use among biomedical researchers is still surprisingly small. In this paper we present the aTag (‘associative tags’) convention, which aims to drastically lower the entry barriers to the biomedical Semantic Web. aTags are short snippets of HTML+RDFa with embedded RDF/OWL based on the Semantically Interlinked Online Communities (SIOC) vocabulary and domain ontologies and taxonomies, such as the Open Biomedical Ontologies and DBpedia. The structure of aTags is very simple: a short piece of human-readable text that is ‘tagged’ with relevant ontological entities. This paper describes our efforts for seeding the creation of a viable ecosystem of datasets, tools and services around aTags.ResultsNumerous biomedical datasets in aTag format and systems for the creation of aTags have been set-up and are described in this paper. Prototypes of some of these systems are accessible at http://hcls.deri.org/atagConclusionsThe aTags convention enables the rapid development of diverse, integrated datasets and semantically interoperable applications. More work needs to be done to study the practicability of this approach in different use-case scenarios, and to encourage uptake of the convention by other groups.

[1]  Dietrich Rebholz-Schuhmann,et al.  Text processing through Web services: calling Whatizit , 2008, Bioinform..

[2]  Alfonso Valencia,et al.  Implementing the iHOP concept for navigation of biomedical literature , 2005, ECCB/JBI.

[3]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[4]  Dietrich Rebholz-Schuhmann,et al.  MedEvi: Retrieving textual evidence of relations between biomedical concepts from Medline , 2008, Bioinform..

[5]  Alan Ruttenberg,et al.  Life sciences on the Semantic Web: the Neurocommons and beyond , 2009, Briefings Bioinform..

[6]  David Wong,et al.  Web Applications , 2002 .

[7]  Eyal Oren,et al.  Sindice.com: Weaving the Open Linked Data , 2007, ISWC/ASWC.

[8]  Kei-Hoi Cheung,et al.  Entrez Neuron RDFa: A Pragmatic Semantic Web Application for Data Integration in Neuroscience Research , 2009, MIE.

[9]  Simon Buckingham Shum,et al.  Hypotheses, evidence and relationships: The HypER approach for representing scientific knowledge claims , 2009, ISWC 2009.

[10]  R. Hoffmann A wiki for the life sciences where authorship matters , 2008, Nature Genetics.

[11]  Giulio Superti-Furga Finally: The digital, democratic age of scientific abstracts , 2008, FEBS letters.

[12]  M A Musen,et al.  BioPortal: ontologies and data resources with the click of a mouse. , 2008, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[13]  Mark A. Musen,et al.  The Open Biomedical Annotator , 2009, Summit on translational bioinformatics.

[14]  Alan Ruttenberg,et al.  The SWAN biomedical discourse ontology , 2008, J. Biomed. Informatics.

[15]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[16]  A. Rector,et al.  Relations in biomedical ontologies , 2005, Genome Biology.

[17]  Holger Stenzhorn,et al.  Das Semantic Web als Werkzeug in der biomedizinischen Forschung , 2009, Social Semantic Web.