AstroDAS: Sharing Assertions Across Astronomy Catalogues Through Distributed Annotation

As diverse scientific data collections migrate online, researchers want the ability to share their assertions regarding the entities that span these disparate databases. We focus on a case study provided by the astronomical community's Virtual Observatory effort to investigate the use of annotation to record and share the celestial object mappings asserted by different research groups. The prototype for our Astronomy Distributed Annotation System (AstroDAS) complements the existing OpenSkyQuery tools for federated database queries, and provides web service methods to allow clients to create and store mapping annotations as relational database tuples on annotation servers. We expect the mechanisms for creating and querying annotations in AstroDAS can be extended to assist with tasks other than entity mapping, in other domains with relational data sources.

[1]  Jaideep Srivastava,et al.  Entity identification in database integration , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[2]  Michael Gertz,et al.  Annotating scientific images: a concept-based approach , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[3]  Renée J. Miller,et al.  Mapping data in peer-to-peer systems: semantics and algorithmic issues , 2003, SIGMOD '03.

[4]  T. Downs,et al.  Applying machine learning to catalogue matching in astrophysics , 2005, astro-ph/0504013.

[5]  A. Zeroual,et al.  MSQL: A Multidatabase Language , 1989, Inf. Sci..

[6]  Stanley Letovsky,et al.  Bioinformatics: Databases and Systems , 2013, Springer US.

[7]  Floris Geerts,et al.  MONDRIAN: Annotating and Querying Databases through Colors and Blocks , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[8]  Alexander S. Szalay,et al.  SkyQuery: A Web Service Approach to Federate Databases , 2003, CIDR.

[9]  William Kent The Entity Join , 1979, Fifth International Conference on Very Large Data Bases, 1979..

[10]  Alexander S. Szalay,et al.  Open SkyQuery - VO Compliant Dynamic Federation of Astronomical Archives , 2004 .

[11]  Christopher K. I. Williams,et al.  An Expectation Maximisation Algorithm for One-to-Many Record Linkage, Illustrated on the Problem of Matching Far Infra-Red Astronomical Sources to Optical Counterparts , 2005 .

[12]  Bill Hill,et al.  The Edinburgh Mouse Atlas: Basic Structure and Informatics , 2002 .

[13]  Wang Chiew Tan,et al.  An annotation management system for relational databases , 2004, The VLDB Journal.