Annotating scientific data: why it is important and why it is difficult

Annotation of existing data is becoming a standard tool in many branches of e-science. Increasingly, databases are being built to receive annotation, and other tools are being developed to annotate existing databases. Annotation is becoming an important part of communication among scientists. In this paper we review various kinds of annotation systems and describe the importance of designing databases in such a way that they can receive annotation. This includes designing extensible databases and the need for some form of co-ordinate system for the attachment of annotations. 1 Annotation: adding to existing structure

[1]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[2]  Michael Y. Galperin The Molecular Biology Database Collection: 2006 update , 2005, Nucleic Acids Res..

[3]  Wang Chiew Tan,et al.  An annotation management system for relational databases , 2004, The VLDB Journal.

[4]  Robert G. Mann,et al.  AstroDAS: Sharing Assertions Across Astronomy Catalogues Through Distributed Annotation , 2006, IPAW.

[5]  Floris Geerts,et al.  MONDRIAN: Annotating and Querying Databases through Colors and Blocks , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[6]  Sanjeev Khanna,et al.  Edinburgh Research Explorer On the Propagation of Deletions and Annotations through Views , 2013 .

[7]  Vannevar Bush,et al.  As we may think , 1945, INTR.

[8]  Michael Y. Galperin The Molecular Biology Database Collection: 2005 update , 2004, Nucleic Acids Res..

[9]  Lois M. L. Delcambre,et al.  Superimposed Information for the Internet , 1999, WebDB.

[10]  Robert Stevens,et al.  Annotating, Linking and Browsing Provenance Logs for {e-Science} , 2003 .

[11]  Sean R. Eddy,et al.  The Distributed Annotation System , 2001, BMC Bioinformatics.

[12]  David McKelvie,et al.  Hyperlink semantics for standoff markup of read-only documents , 1997 .

[13]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[14]  Michael Gertz,et al.  Annotating scientific images: a concept-based approach , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[15]  Rick L. Stevens,et al.  The SEED: a peer-to-peer environment for genome annotation , 2004, CACM.

[16]  Wenfei Fan,et al.  Keys for XML , 2002, Comput. Networks.