Sub-image data processing in Astro-WISE

Most often, astronomers are interested in a source (e.g., moving, variable, or extreme in some colour index) that lies on a few pixels of an image. However, the classical approach in astronomical data processing is the processing of the entire image or set of images even when the sole source of interest may exist on only a few pixels of one or a few images. This is because pipelines have been written and designed for instruments with fixed detector properties (e.g., image size, calibration frames, overscan regions, etc.). Furthermore, all metadata and processing parameters are based on an instrument or a detector. Accordingly, out of many thousands of images for a survey, this can lead to unnecessary processing of data that is both time-consuming and wasteful. We describe the architecture and an implementation of sub-image processing in Astro-WISE. The architecture enables a user to select, retrieve and process only the relevant pixels in an image where the source exists. We show that lineage data collected during the processing and analysis of datasets can be reused to perform selective reprocessing (at sub-image level) on datasets while the remainder of the dataset is untouched, a difficult process to automate without lineage.

[1]  Yogesh L. Simmhan,et al.  Special Issue: The First Provenance Challenge , 2008, Concurr. Comput. Pract. Exp..

[2]  Edwin A. Valentijn,et al.  The Astro-WISE optical image pipeline , 2011, Experimental Astronomy.

[3]  Rajkumar Buyya,et al.  A taxonomy of scientific workflow systems for grid computing , 2005, SGMD.

[4]  Jaan Kiusalaas,et al.  Numerical methods in engineering with Python , 2005 .

[5]  E. Greisen,et al.  Representations of celestial coordinates in FITS , 2002, astro-ph/0207413.

[6]  Scott Klasky,et al.  Introduction to scientific workflow management and the Kepler system , 2006, SC.

[7]  Cláudio T. Silva,et al.  Using Provenance to Support Real-Time Collaborative Design of Workflows , 2008, IPAW.

[8]  James P. Ahrens,et al.  Provenance in Comparative Analysis: A Study in Cosmology , 2008, Computing in Science & Engineering.

[9]  Yogesh L. Simmhan,et al.  Karma2: Provenance Management for Data-Driven Workflows , 2008, Int. J. Web Serv. Res..

[10]  Gerald L. Engel,et al.  VISUALIZATION AND COMPUTER GRAPHICS , 2005 .

[11]  Andre Heck,et al.  Information Handling in Astronomy - Historical Vistas , 2002 .

[12]  Barbara Horner-Miller,et al.  Proceedings of the 2006 ACM/IEEE conference on Supercomputing , 2006 .

[13]  Andrey N. Belikov,et al.  Merging Grid Technologies , 2010, Journal of Grid Computing.

[14]  James Frew,et al.  Lineage retrieval for scientific data processing: a survey , 2005, CSUR.

[15]  Perry Greenfield Reaching for the Stars with Python , 2007, Computing in Science & Engineering.

[16]  Ralf Bender,et al.  Astronomical Data Analysis Software and Systems XVI ASP Conference Series , 2007 .

[17]  Carole A. Goble,et al.  Using provenance to manage knowledge of In Silico experiments , 2007, Briefings Bioinform..

[18]  Gregory Bryan Computing in Science and Engineering , 1999, IEEE Software.

[19]  Cláudio T. Silva,et al.  Querying and Creating Visualizations by Analogy , 2007, IEEE Transactions on Visualization and Computer Graphics.

[20]  Ralf Bender,et al.  Astro-WISE: Chaining to the Universe , 2007 .

[21]  Cláudio T. Silva,et al.  Provenance for Computational Tasks: A Survey , 2008, Computing in Science & Engineering.

[22]  S. Gwyn,et al.  The CFHT Legacy Survey: stacked images and catalogs , 2011, 1101.1084.

[23]  Danny Boxhoorn,et al.  Tracing and using data lineage for pipeline processing in Astro-WISE , 2013 .

[24]  Klaus R. Dittrich,et al.  Data Provenance: A Categorization of Existing Approaches , 2007, BTW.

[25]  原田 秀逸 私の computer 環境 , 1998 .

[26]  Hui Deng,et al.  C-SWF: A Lightweight Scientific Workflow System for Astronomical Data Processing , 2009, 2009 Second International Workshop on Computer Science and Engineering.

[27]  Luc Moreau,et al.  The Foundations for Provenance on the Web , 2010, Found. Trends Web Sci..

[28]  Alexander S. Szalay,et al.  The Sloan Digital Sky Survey and beyond , 2008, SGMD.

[29]  Daniel Durand,et al.  Astronomical Data Analysis Software and Systems XI , 2009 .

[30]  Geoffrey C. Fox,et al.  Examining the Challenges of Scientific Workflows , 2007, Computer.

[31]  S. Gwyn,et al.  THE CANADA–FRANCE–HAWAII TELESCOPE LEGACY SURVEY: STACKED IMAGES AND CATALOGS , 2012 .

[32]  Robert A. Shaw,et al.  Astronomical data analysis software and systems IV : meeting held at Baltimore, Maryland, 25-28 September 1994 , 1995 .

[33]  E. W. Greisen,et al.  Representations of spectral coordinates in FITS , 2005 .

[34]  Paul T. Groth,et al.  Provenance-based validation of e-science experiments , 2005, J. Web Semant..

[35]  Mark R. Calabretta,et al.  Representations of distortions in FITS world coordinate systems , 2004 .