Seamless Utilization of Heterogeneous XSede Resources to Accelerate Processing of a High Value Functional Neuroimaging Dataset

We describe the technical effort used to process a voluminous high value human neuroimaging dataset on the Open Science Grid with opportunistic use of idle HPC resources to boost computing capacity more than 5-fold. With minimal software development effort and no discernable competitive interference with other HPC users, this effort delivered 15,000,000 core hours over 7 months. The CamCAN Lifespan Neuroimaging Dataset [1,2], Cambridge (UK) Centre for Ageing and Neuroscience, was acquired and processed beginning in December 2016. The referee consensus solver deployed to the Open Science Grid [3,4] was used for this task. The dataset includes demographic and screening measures, a high-resolution MRI scan of the brain, and whole-head magnetoencephalographic (MEG) recordings during eyes closed rest (560 sec), a simple task (540 sec), and passive listening/viewing (140 sec). The data were collected from 619 neurologically normal individuals, ages 18-87. The processed results from both resting and task recordings are completed and available for download at http://stash.osgconnect.net/+krieger/. These constitute ≈3.0 TBytes of data including the location within the brain (1 mm resolution), time stamp (1 msec resolution), and 80 msec time course for each of 3.7 billion validated neuroelectric events, i.e. mean 6.1 million events for each of the 619 participants. Primary processing utilized the referee consensus solver deployed on the Open Science Grid (OSG). The pool of computing elements used by the solver was boosted using XSede allocations on Bridges (Pittsburgh Supercomputing Center) and Comet (San Diego Supercomputing Center). Computing elements were added directly to the OSG's HTcondor pool. This required no changes to the solver and only minor changes to the workflow management software which was fully operational in two days. HTcondor's glidein mechanism was used to add the booster computing elements to the HTcondor pool. A script was deployed on each machine which queues Glidein's only when there are idle computing elements. Each Glidein's lifetime is limited to two hours, insuring that these jobs do not significantly delay other users of these important supercomputing resources. This opportunistic use of Brides and Comet delivered more than a 3-fold increase in the number of core hours available for this effort.

[1]  J. Sarvas Basic mathematical and electromagnetic concepts of the biomagnetic inverse problem. , 1987, Physics in medicine and biology.

[2]  Aviva Abosch,et al.  Prediction of STN-DBS Electrode Implantation Track in Parkinson's Disease by Using Local Field Potentials , 2016, Front. Neurosci..

[3]  William D. Marslen-Wilson,et al.  The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) study protocol: a cross-sectional, lifespan, multidisciplinary examination of healthy cognitive ageing , 2014, BMC Neurology.

[4]  Cam-CAN Group,et al.  The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample , 2017, NeuroImage.

[5]  Walter Schneider,et al.  Referee consensus: a platform technology for nonlinear optimization , 2013, XSEDE.

[6]  M. Bertini,et al.  Sleep in the Human Hippocampus: A Stereo-EEG Study , 2007, PloS one.

[7]  Ronald R. Willis,et al.  Software quality engineering: a total technical and management approach , 1988 .

[8]  D. Lehmann,et al.  Low resolution electromagnetic tomography: a new method for localizing electrical activity in the brain. , 1994, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[9]  J. Vrba,et al.  Signal processing in magnetoencephalography. , 2001, Methods.

[10]  Jorge Luis Rodriguez,et al.  The Open Science Grid , 2005 .

[11]  R. Ilmoniemi,et al.  Interpreting magnetic fields of the brain: minimum norm estimates , 2006, Medical and Biological Engineering and Computing.

[12]  Nadim Joni Shah,et al.  The relationship between BOLD fMRI response and the underlying white matter as measured by fractional anisotropy (FA): A systematic review , 2017, NeuroImage.

[13]  Igor Sfiligoi,et al.  The Pilot Way to Grid Resources Using glideinWMS , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[14]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[15]  Mats Rynge,et al.  The OSG open facility: A sharing ecosystem , 2015 .

[16]  Yousef Qan’ir,et al.  Evidence Based Practice: Transcranial Magnetic Stimulation for Major Depression , 2016 .

[17]  A. Dale,et al.  Cortical Surface-Based Analysis II: Inflation, Flattening, and a Surface-Based Coordinate System , 1999, NeuroImage.