Fastr: A Workflow Engine for Advanced Data Flows in Medical Image Analysis

With the increasing number of datasets encountered in imaging studies, the increasing complexity of processing workflows, and a growing awareness for data stewardship, there is a need for managed, automated workflows. In this paper we introduce Fastr, an automated workflow engine with support for advanced data flows. Fastr has built-in data provenance for recording processing trails and ensuring reproducible results. The extensible plugin-based design allows the system to interface with virtually any image archive and processing infrastructure. This workflow engine is designed to consolidate quantitative imaging biomarker pipelines in order to enable easy application to new data.

[1]  Michael J Ackerman,et al.  Engineering and algorithm design for an image processing Api: a technical report on ITK--the Insight Toolkit. , 2002, Studies in health technology and informatics.

[2]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[3]  C. Jack,et al.  Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI) , 2005, Alzheimer's & Dementia.

[4]  Emden R. Gansner,et al.  An open graph visualization system and its applications to software engineering , 2000 .

[5]  Tristan Glatard,et al.  CBRAIN: a web-based, distributed computing platform for collaborative neuroimaging research , 2014, Front. Neuroinform..

[6]  Thorsten Meinl,et al.  KNIME - the Konstanz information miner: version 2.0 and beyond , 2009, SKDD.

[7]  Arthur W Toga,et al.  The LONI Pipeline Processing Environment , 2003, NeuroImage.

[8]  J. Paul Morrison,et al.  Flow-Based Programming, 2nd Edition: A New Approach to Application Development , 2010 .

[9]  Max A. Viergever,et al.  elastix: A Toolbox for Intensity-Based Medical Image Registration , 2010, IEEE Transactions on Medical Imaging.

[10]  Edward A. Lee,et al.  CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. 2000; 00:1–7 Prepared using cpeauth.cls [Version: 2002/09/19 v2.02] Taverna: Lessons in creating , 2022 .

[11]  D. Stott Parker,et al.  Neuroimaging Study Designs, Computational Analyses and Data Provenance Using the LONI Pipeline , 2010, PloS one.

[12]  Geert Jan Biessels,et al.  The heart-brain connection: a multidisciplinary approach targeting a missing link in the pathophysiology of vascular cognitive impairment. , 2014, Journal of Alzheimer's disease : JAD.

[13]  Wiro J. Niessen,et al.  The Rotterdam Scan Study: design and update up to 2012 , 2011, European Journal of Epidemiology.

[14]  Johan Montagnat,et al.  Flexible and Efficient Workflow Deployment of Data-Intensive Applications On Grids With MOTEUR , 2008, Int. J. High Perform. Comput. Appl..

[15]  John G. Csernansky,et al.  Open Access Series of Imaging Studies (OASIS): Cross-sectional MRI Data in Young, Middle Aged, Nondemented, and Demented Older Adults , 2007, Journal of Cognitive Neuroscience.

[16]  Daniel S. Marcus,et al.  The extensible neuroimaging archive toolkit , 2007, Neuroinformatics.

[17]  John Chilton,et al.  Common Workflow Language, v1.0 , 2016 .

[18]  Johan Montagnat,et al.  A data-driven workflow language for grids based on array programming principles , 2009, WORKS '09.

[19]  Satrajit S. Ghosh,et al.  Nipype: A Flexible, Lightweight and Extensible Neuroimaging Data Processing Framework in Python , 2011, Front. Neuroinform..