Automated data reduction workflows for astronomy

Data from complex modern astronomical instruments often consist of a large number of different science and calibration files, and their reduction requires a variety of software tools. The execution chain of the tools represents a complex workflow that needs to be tuned and supervised, often by individual researchers that are not necessarily experts for any specific instrument. The efficiency of data reduction can be improved by using automatic workflows to organise data and execute the sequence of data reduction steps. To realize such efficiency gains, we designed a system that allows intuitive representation, execution and modification of the data reduction workflow, and has facilities for inspection and interaction with the data. The European Southern Observatory (ESO) has developed Reflex, an environment to automate data reduction workflows. Reflex is implemented as a package of customized components for the Kepler workflow engine. Kepler provides the graphical user interface to create an executable flowchart-like representation of the data reduction process. Key features of Reflex are a rule-based data organiser, infrastructure to re-use results, thorough book-keeping, data progeny tracking, interactive user interfaces, and a novel concept to exploit information created during data organisation for the workflow execution. Reflex includes novel concepts to increase the efficiency of astronomical data processing. While Reflex is a specific implementation of astronomical scientific workflows within the Kepler workflow engine, the overall design choices and methods can also be applied to other environments for running automated science workflows.

[1]  G. A. Renting,et al.  Astronomical Data Analysis Software and Systems XXI , 2012 .

[2]  B. Garilli,et al.  F-VIPGI: a new adapted version of VIPGI for FORS2 spectroscopy. Application to a sample of 16 X-ray selected galaxy clusters at 0.6 < z < 1.2 , 2013, 1301.2327.

[3]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[4]  M. J. Currie,et al.  The ORAC-DR data reduction pipeline , 2008 .

[5]  A. Mazure,et al.  The VVDS Data‐Reduction Pipeline: Introducing VIPGI, the VIMOS Interactive Pipeline and Graphical Interface , 2004, astro-ph/0409248.

[6]  Abhishek Tiwari,et al.  Workflow based framework for life science informatics , 2007, Comput. Biol. Chem..

[7]  Matthew B. Jones,et al.  A metadata-driven approach to loading and querying heterogeneous scientific data , 2010, Ecol. Informatics.

[8]  Jean-Luc Starck,et al.  Astronomical Data Analysis , 2007 .

[9]  Andrea Modigliani,et al.  The common pipeline library: standardizing pipeline processing , 2004, SPIE Astronomical Telescopes + Instrumentation.

[10]  B. Yanny,et al.  The Sloan Digital Sky Survey monitor telescope pipeline , 2006 .

[11]  T. A. Lister,et al.  RoboNet-II: Follow-up observations of microlensing events with a robotic network of telescopes , 2008, 0808.0813.

[12]  C. Leitherer,et al.  Calibrating Hubble Space Telescope. POST Servicing Mission. , 1995 .

[13]  Edwin A. Valentijn,et al.  The Astro-WISE optical image pipeline , 2011, Experimental Astronomy.

[14]  C. Garino,et al.  Computational Biology and Chemistry , 2015 .

[15]  P. S. Bunclark,et al.  Astronomical Data Analysis Software and Systems , 2008 .

[16]  Preben Grosbol,et al.  MIDAS - ESO's new image processing system , 1983 .