Low-power manycore accelerator for personalized biomedical applications

Wearable personal health monitoring systems can offer a cost effective solution for human healthcare. These systems must provide both highly accurate, secured and quick processing and delivery of vast amount of data. In addition, wearable biomedical devices are used in inpatient, outpatient, and at home e-Patient care that must constantly monitor the patient's biomedical and physiological signals 24/7. These biomedical applications require sampling and processing multiple streams of physiological signals with strict power and area footprint. The processing typically consists of feature extraction, data fusion, and classification stages that require a large number of digital signal processing and machine learning kernels. In response to these requirements, in this paper, a low-power, domain-specific manycore accelerator named Power Efficient Nano Clusters (PENC) is proposed to map and execute the kernels of these applications. Experimental results show that the manycore is able to reduce energy consumption by up to 80% and 14% for DSP and machine learning kernels, respectively, when optimally parallelized. The performance of the proposed PENC manycore when acting as a coprocessor to an Intel Atom processor is compared with existing commercial off-the-shelf embedded processing platforms including Intel Atom, Xilinx Artix-7 FPGA, and NVIDIA TK1 ARM-A15 with GPU SoC. The results show that the PENC manycore architecture reduces the energy by as much as 10X while outperforming all off-the-shelf embedded processing platforms across all studied machine learning classifiers.

[1]  Naveen Verma,et al.  A Low-Power Processor With Configurable Embedded Machine-Learning Accelerators for High-Order and Adaptive Analysis of Medical-Sensor Signals , 2013, IEEE Journal of Solid-State Circuits.

[2]  Maysam Ghovanloo,et al.  Live demonstration: Towards an ultra low power on-board processor for Tongue Drive System , 2015, 2015 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[3]  Tinoosh Mohsenin,et al.  A many-core platform implemented for multi-channel seizure detection , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[4]  Dean M. Tullsen,et al.  Harnessing ISA diversity: Design of a heterogeneous-ISA chip multiprocessor , 2014, 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA).

[5]  Maysam Ghovanloo,et al.  Toward an Ultralow-Power Onboard Processor for Tongue Drive System , 2015, IEEE Transactions on Circuits and Systems II: Express Briefs.

[6]  Hassan Ghasemzadeh,et al.  Ultra low-power signal processing in wearable monitoring systems: A tiered screening architecture with optimal bit resolution , 2013, TECS.

[7]  Tinoosh Mohsenin,et al.  A low power seizure detection processor based on direct use of compressively-sensed data and employing a deterministic random matrix , 2015, 2015 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[8]  Houman Homayoun,et al.  A 64-core platform for biomedical signal processing , 2013, International Symposium on Quality Electronic Design (ISQED).

[9]  Tim Oates,et al.  A Flexible Multichannel EEG Feature Extractor and Classifier for Seizure Detection , 2015, IEEE Transactions on Circuits and Systems II: Express Briefs.

[10]  Ray-Jade Chen,et al.  A sub-100µW multi-functional cardiac signal processor for mobile healthcare applications , 2012, 2012 Symposium on VLSI Circuits (VLSIC).

[11]  Mario Konijnenburg,et al.  ULP-SRP: Ultra low power Samsung Reconfigurable Processor for biomedical applications , 2012, 2012 International Conference on Field-Programmable Technology.

[12]  Tinoosh Mohsenin,et al.  Sketching-based high-performance biomedical big data processing accelerator , 2016, 2016 IEEE International Symposium on Circuits and Systems (ISCAS).

[13]  Tim Oates,et al.  An ultra low power feature extraction and classification system for wearable seizure detection , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[14]  Tinoosh Mohsenin,et al.  Real-Time Anomaly Detection Framework for Many-Core Router through Machine-Learning Techniques , 2016, JETC.

[15]  Houman Homayoun,et al.  Power and performance characterization, analysis and tuning for energy-efficient edge detection on atom and ARM based platforms , 2015, 2015 33rd IEEE International Conference on Computer Design (ICCD).

[16]  Tinoosh Mohsenin,et al.  Wearable seizure detection using convolutional neural networks with transfer learning , 2016, 2016 IEEE International Symposium on Circuits and Systems (ISCAS).

[17]  Houman Homayoun,et al.  Energy-efficient mapping of biomedical applications on domain-specific accelerator under process variation , 2014, 2014 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED).

[18]  Anantha Chandrakasan,et al.  An 8-Channel Scalable EEG Acquisition SoC With Patient-Specific Seizure Classification and Recording Processor , 2013, IEEE Journal of Solid-State Circuits.