TogoTable: cross-database annotation system using the Resource Description Framework (RDF) data model

TogoTable (http://togotable.dbcls.jp/) is a web tool that adds user-specified annotations to a table that a user uploads. Annotations are drawn from several biological databases that use the Resource Description Framework (RDF) data model. TogoTable uses database identifiers (IDs) in the table as a query key for searching. RDF data, which form a network called Linked Open Data (LOD), can be searched from SPARQL endpoints using a SPARQL query language. Because TogoTable uses RDF, it can integrate annotations from not only the reference database to which the IDs originally belong, but also externally linked databases via the LOD network. For example, annotations in the Protein Data Bank can be retrieved using GeneID through links provided by the UniProt RDF. Because RDF has been standardized by the World Wide Web Consortium, any database with annotations based on the RDF data model can be easily incorporated into this tool. We believe that TogoTable is a valuable Web tool, particularly for experimental biologists who need to process huge amounts of data such as high-throughput experimental output.

[1]  Jo McEntyre,et al.  The NCBI Handbook , 2002 .

[2]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[3]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[4]  Hajime Nakaoka,et al.  Hyperlink Management System and ID Converter System: enabling maintenance-free hyperlinks among major biological databases , 2009, Nucleic Acids Res..

[5]  Akiyasu C. Yoshizawa,et al.  Integrated Proteomics Identified Novel Activation of Dynein IC2-GR-COX-1 Signaling in Neurofibromatosis Type I (NF1) Disease Model Cells* , 2013, Molecular & Cellular Proteomics.

[6]  Rodrigo Lopez,et al.  Assembly information services in the European Nucleotide Archive , 2013, Nucleic Acids Res..

[7]  David Haussler,et al.  The UCSC Genome Browser database: 2014 update , 2013, Nucleic Acids Res..

[8]  Andrea Splendiani,et al.  Gauging triple stores with actual biological data , 2012, BMC Bioinformatics.

[9]  Susumu Goto,et al.  Data, information, knowledge and principle: back to metabolism in KEGG , 2013, Nucleic Acids Res..

[10]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[11]  M. Kanehisa,et al.  DBGET/LinkDB: an integrated database retrieval system. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[12]  Kathi Canese,et al.  PubMed: The Bibliographic Database , 2013 .

[13]  K. Bretonnel Cohen,et al.  BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains , 2014, Journal of Biomedical Semantics.

[14]  Philip V. Toukach,et al.  Introducing glycomics data into the Semantic Web , 2013, J. Biomed. Semant..

[15]  Juan Antonio Vizcaíno,et al.  Improvements in the protein identifier cross-reference service , 2012, Nucleic Acids Res..

[16]  Daniel R. Zerbino,et al.  Ensembl 2014 , 2013, Nucleic Acids Res..

[17]  Melissa J. Landrum,et al.  RefSeq: an update on mammalian reference sequences , 2013, Nucleic Acids Res..

[18]  Robert M. Stephens,et al.  DAVID gene ID conversion tool , 2008, Bioinformation.

[19]  Henning Hermjakob,et al.  The Reactome pathway Knowledgebase , 2015, Nucleic acids research.

[20]  Emden R. Gansner,et al.  An open graph visualization system and its applications to software engineering , 2000, Softw. Pract. Exp..

[21]  Akira R. Kinjo,et al.  Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format , 2011, Nucleic Acids Res..

[22]  Toshihisa Takagi,et al.  DDBJ progress report: a new submission system for leading to a correct annotation , 2013, Nucleic Acids Res..

[23]  Nicole Tourigny,et al.  Bio2RDF: Towards a mashup to build bioinformatics knowledge systems , 2008, J. Biomed. Informatics.

[24]  Xiaoshu Wang,et al.  From XML to RDF: how semantic web technologies will change the design of 'omic' standards , 2005, Nature Biotechnology.

[25]  Andrew M. Jenkinson,et al.  The EBI RDF platform: linked open data for the life sciences , 2014, Bioinform..