Quantitative cost comparison of on-premise and cloud infrastructure based EEG data processing

High-density, high-sampling rate EEG measurements generate large amounts of measurement data. When coupled with sophisticated processing methods, this presents a storage, computation and system management challenge for research groups and clinical units. Commercial cloud providers offer remote storage and on-demand compute infrastructure services that seem ideal for outsourcing the usually burst-like EEG processing workflow execution. There is little available guidance, however, on whether or when users should migrate to the cloud. The objective of this paper is to investigate the factors that determine the costs of on-premises and cloud execution of EEG workloads, and compare their total costs of ownership. An analytical cost model is developed that can be used for making informed decisions about the long-term costs of on-premises and cloud infrastructures. The model includes the cost-critical factors of the computing systems under evaluation, and expresses the effects of length of usage, system size, computational and storage capacity needs. Detailed cost models are created for on-premises clusters and cloud systems. Using these models, the costs of execution and data storage on clusters and in the cloud are investigated in detail, followed by a break-even analysis to determine when the use of an on-demand cloud infrastructure is preferable to on-premises clusters. The cost models presented in this paper help to characterise the cost-critical infrastructure and execution factors, and can support decision-makers in various scenarios. The analyses showed that cloud-based EEG data processing can reduce execution time considerably and is, in general, more economical when the computational and data storage requirements are relatively low. The cloud becomes competitive even in heavy load case scenarios if expensive, high quality, high-reliability clusters would be used locally. While the paper focuses on EEG processing, the models can be easily applied to CT, MRI, fMRI based neuroimaging workflows as well, which can provide guidance to the wider neuroimaging community for making infrastructure decisions.

[1]  Erwin Laure,et al.  Parallel Real Time Seizure Detection in Large EEG Data , 2016, IoTBD.

[2]  William Gropp,et al.  MPI: The Complete Reference , Vol. 2 - The MPI-2 Extensions , 1998 .

[3]  Claus-Peter Richter,et al.  Evaluation of Electroencephalography Source Localization Algorithms with Multiple Cortical Sources , 2016, PloS one.

[4]  Peter J. Tonellato,et al.  Biomedical Cloud Computing With Amazon Web Services , 2011, PLoS Comput. Biol..

[5]  Shaun A. Hussain,et al.  Visual and semi-automatic non-invasive detection of interictal fast ripples: A potential biomarker of epilepsy in children with tuberous sclerosis complex , 2018, Clinical Neurophysiology.

[6]  Saeed Mian Qaisar,et al.  Cloud-based mobile platform for EEG signal analysis , 2016, 2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA).

[7]  Radu Sion,et al.  To cloud or not to cloud?: musings on costs and viability , 2011, SOCC '11.

[8]  Allen D. Malony,et al.  Parallel ICA methods for EEG neuroimaging , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[9]  Rajkumar Buyya,et al.  Pricing Cloud Compute Commodities: A Novel Financial Economic Model , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[10]  Miron Livny,et al.  The cost of doing science on the cloud: The Montage example , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[11]  Katsuhiro Kobayashi,et al.  Significance of High-frequency Electrical Brain Activity. , 2017, Acta medica Okayama.

[12]  Surya Ganguli,et al.  On simplicity and complexity in the brave new world of large-scale neuroscience , 2015, Current Opinion in Neurobiology.

[13]  Anand Sivasubramaniam,et al.  To Move or Not to Move: The Economics of Cloud Computing , 2011, HotCloud.

[14]  Dimitrios I. Fotiadis,et al.  Epileptic Seizure Detection in EEGs Using Time–Frequency Analysis , 2009, IEEE Transactions on Information Technology in Biomedicine.

[15]  L. Dagum,et al.  OpenMP: an industry standard API for shared-memory programming , 1998 .

[16]  Anand Sivasubramaniam,et al.  Cloudy with a Chance of Cost Savings , 2013, IEEE Transactions on Parallel and Distributed Systems.

[17]  Michael Breakspear,et al.  Exploiting the potential of three dimensional spatial wavelet analysis to explore nesting of temporal oscillations and spatial variance in simultaneous EEG-fMRI data. , 2011, Progress in biophysics and molecular biology.

[18]  Walter Brisken,et al.  To Lease or Not to Lease from Storage Clouds , 2010, Computer.

[19]  Andy B. Yoo,et al.  Approved for Public Release; Further Dissemination Unlimited X-ray Pulse Compression Using Strained Crystals X-ray Pulse Compression Using Strained Crystals , 2002 .

[20]  Han Yuan,et al.  Spatiotemporal dynamics of the brain at rest — Exploring EEG microstates as electrophysiological signatures of BOLD resting state networks , 2012, NeuroImage.

[21]  Dimitri Van De Ville,et al.  The dynamic functional connectome: State-of-the-art and perspectives , 2017, NeuroImage.

[22]  Hossein Pourreza,et al.  Heads in the Cloud: A Primer on Neuroimaging Applications of High Performance Computing , 2016, Magnetic resonance insights.

[23]  U. Rajendra Acharya,et al.  Automated EEG analysis of epilepsy: A review , 2013, Knowl. Based Syst..

[24]  Yu-Te Wang,et al.  Pervasive brain monitoring and data sharing based on multi-tier distributed computing and linked data technology , 2014, Front. Hum. Neurosci..

[25]  Don M. Tucker,et al.  Localizing Acute Stroke-related EEG Changes:: Assessing the Effects of Spatial Undersampling , 2001, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[26]  Hong Bao,et al.  GPGPU-Aided Ensemble Empirical-Mode Decomposition for EEG Analysis During Anesthesia , 2010, IEEE Transactions on Information Technology in Biomedicine.

[27]  Bart Vanrumste,et al.  Journal of Neuroengineering and Rehabilitation Open Access Review on Solving the Inverse Problem in Eeg Source Analysis , 2022 .

[28]  Mohammad S. Eslampanah Sendi,et al.  A Spark-based Analytic Pipeline for Seizure Detection in EEG Big Data Streams , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[29]  William Gropp The MPI-2 extensions , 1998 .

[30]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[31]  Li Wang,et al.  NeuroPigPen: A Scalable Toolkit for Processing Electrophysiological Signal Data in Neuroscience Applications Using Apache Pig , 2016, Front. Neuroinform..

[32]  J. Fermaglich Electric Fields of the Brain: The Neurophysics of EEG , 1982 .

[33]  Allen D. Malony,et al.  Concurrency in electrical neuroinformatics: parallel computation for studying the volume conduction of brain electrical fields in human head tissues , 2016, Concurr. Comput. Pract. Exp..

[34]  G. Buzsáki,et al.  Neuronal Oscillations in Cortical Networks , 2004, Science.

[35]  Thomas J. Grabowski,et al.  Running Neuroimaging Applications on Amazon Web Services: How, When, and at What Cost? , 2017, Front. Neuroinform..

[36]  Christoph M. Michel,et al.  Head model and electrical source imaging: A study of 38 epileptic patients☆ , 2014, NeuroImage: Clinical.

[37]  Takashi Kawashima,et al.  Mapping brain activity at scale with cluster computing , 2014, Nature Methods.

[38]  Shrideep Pallickara,et al.  Analyzing Electroencephalograms Using Cloud Computing Techniques , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[39]  Mikael Persson,et al.  Evaluation of a finite-element reciprocity method for epileptic EEG source localization: Accuracy, computational complexity and noise robustness , 2013 .

[40]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[41]  Petr Jezek,et al.  Cloud Infrastructure for Storing and Processing EEG and ERP Experimental Data , 2019, ICT4AWE.

[42]  D. Tucker,et al.  EEG source localization: Sensor density and head surface coverage , 2015, Journal of Neuroscience Methods.

[43]  Jaakko Malmivuo,et al.  Effect of measurement noise and electrode density on the spatial resolution of cortical potential distribution with different resistivity values for the skull , 2006, IEEE Transactions on Biomedical Engineering.

[44]  Yiannakis Sazeides,et al.  An analytical framework for estimating TCO and exploring data center design space , 2013, 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[45]  William Gropp,et al.  Mpi - The Complete Reference: Volume 2, the Mpi Extensions , 1998 .

[46]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[47]  Ramesh Srinivasan,et al.  Estimating the spatial Nyquist of the human EEG , 1998 .

[48]  Konrad Paul Kording,et al.  To the Cloud! A Grassroots Proposal to Accelerate Brain Science Discovery , 2016, Neuron.

[49]  Benjamin H. Brinkmann,et al.  Large-scale electrophysiology: Acquisition, compression, encryption, and storage of big data , 2009, Journal of Neuroscience Methods.

[50]  Ravishankar K. Iyer,et al.  Integrating artificial intelligence with real-time intracranial EEG monitoring to automate interictal identification of seizure onset zones in focal epilepsy , 2018, Journal of neural engineering.

[51]  Rajiv Ranjan,et al.  Parallel Processing of Massive EEG Data with MapReduce , 2012, 2012 IEEE 18th International Conference on Parallel and Distributed Systems.

[52]  SalmanAdnan,et al.  Concurrency in electrical neuroinformatics , 2016 .

[53]  Brian Litt,et al.  Science in the cloud (SIC): A use case in MRI connectomics , 2016, GigaScience.

[54]  Edward Walker,et al.  The Real Cost of a CPU Hour , 2009, Computer.

[55]  Hasan Ocak,et al.  Automatic detection of epileptic seizures in EEG using discrete wavelet transform and approximate entropy , 2009, Expert Syst. Appl..

[56]  Vince D. Calhoun,et al.  Moving Beyond ERP Components: A Selective Review of Approaches to Integrate EEG and Behavior , 2018, Front. Hum. Neurosci..

[57]  Mohamed Adel Serhani,et al.  New algorithms for processing time-series big EEG data within mobile health monitoring systems , 2017, Comput. Methods Programs Biomed..

[58]  Terrence J. Sejnowski,et al.  Enhanced detection of artifacts in EEG data using higher-order statistics and independent component analysis , 2007, NeuroImage.