Fast body part segmentation and tracking of neonatal video data using deep learning

Photoplethysmography imaging (PPGI) for non-contact monitoring of preterm infants in the neonatal intensive care unit (NICU) is a promising technology, as it could reduce medical adhesive-related skin injuries and associated complications. For practical implementations of PPGI, a region of interest has to be detected automatically in real time. As the neonates’ body proportions differ significantly from adults, existing approaches may not be used in a straightforward way, and color-based skin detection requires RGB data, thus prohibiting the use of less-intrusive near-infrared (NIR) acquisition. In this paper, we present a deep learning-based method for segmentation of neonatal video data. We augmented an existing encoder-decoder semantic segmentation method with a modified version of the ResNet-50 encoder. This reduced the computational time by a factor of 7.5, so that 30 frames per second can be processed at 960 × 576 pixels. The method was developed and optimized on publicly available databases with segmentation data from adults. For evaluation, a comprehensive dataset consisting of RGB and NIR video recordings from 29 neonates with various skin tones recorded in two NICUs in Germany and India was used. From all recordings, 643 frames were manually segmented. After pre-training the model on the public adult data, parts of the neonatal data were used for additional learning and left-out neonates are used for cross-validated evaluation. On the RGB data, the head is segmented well (82% intersection over union, 88% accuracy), and performance is comparable with those achieved on large, public, non-neonatal datasets. On the other hand, performance on the NIR data was inferior. By employing data augmentation to generate additional virtual NIR data for training, results could be improved and the head could be segmented with 62% intersection over union and 65% accuracy. The method is in theory capable of performing segmentation in real time and thus it may provide a useful tool for future PPGI applications. Graphical Abstract This work presents the development of a customized, real-time capable Deep Learning architecture for segmenting of neonatal videos recorded in the intensive care unit. In addition to hand-annotated data, transfer learning is exploited to improve performance.

[1]  Lorenzo Scalise,et al.  Heart rate measurement in neonatal patients using a webcamera , 2012, 2012 IEEE International Symposium on Medical Measurements and Applications Proceedings.

[2]  Vladimir Blazek,et al.  Photoplethysmography imaging: a new noninvasive and noncontact method for mapping of the dermal perfusion changes , 2000, European Conference on Biomedical Optics.

[3]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Steffen Leonhardt,et al.  A Broader Look: Camera-Based Vital Sign Estimation across the Spectrum , 2019, Yearbook of Medical Informatics.

[5]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[6]  M Paul,et al.  Non-contact sensing of neonatal pulse rate using camera-based imaging: a clinical feasibility study , 2020, Physiological measurement.

[7]  Carolyn H. Lund,et al.  Medical Adhesives in the NICU , 2014 .

[8]  W. Verkruysse,et al.  Non-contact heart rate monitoring utilizing camera photoplethysmography in the neonatal intensive care unit - a pilot study. , 2013, Early human development.

[9]  Hagen Malberg,et al.  Cardiovascular assessment by imaging photoplethysmography – a review , 2018, Biomedizinische Technik. Biomedical engineering.

[10]  João Jorge,et al.  Non-Contact Monitoring of Respiration in the Neonatal Intensive Care Unit , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[11]  Arindam Sikdar,et al.  Contactless vision-based pulse rate detection of Infants Under Neurological Examinations , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[12]  Wolfram Burgard,et al.  Deep learning for human part discovery in images , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[14]  Ann-Beth Moller,et al.  National, regional, and worldwide estimates of preterm birth rates in the year 2010 with time trends since 1990 for selected countries: a systematic analysis and implications , 2012, The Lancet.

[15]  S. Thornton,et al.  Preterm Birth: Causes, Consequences and Prevention , 2008 .

[16]  L. Tarassenko,et al.  Continuous non-contact vital sign monitoring in neonatal intensive care unit , 2014, Healthcare technology letters.

[17]  Mohamed Abderrahim,et al.  Non-Contact, Simple Neonatal Monitoring by Photoplethysmography , 2018, Sensors.

[18]  Sanja Fidler,et al.  Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Yang Wang,et al.  Gated Feedback Refinement Network for Coarse-to-Fine Dense Semantic Image Labeling , 2018, ArXiv.

[20]  Assuring Healthy Outcomes,et al.  Preterm Birth : Causes , Consequences , and Prevention , 2005 .

[21]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Steffen Leonhardt,et al.  Remote vital parameter monitoring in neonatology – robust, unobtrusive heart rate detection in a realistic clinical scenario , 2016, Biomedizinische Technik. Biomedical engineering.

[25]  Lorenzo Scalise,et al.  Assessment of cardio-respiratory rates by non-invasive measurement methods in hospitalized preterm neonates , 2018, 2018 IEEE International Symposium on Medical Measurements and Applications (MeMeA).

[26]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[27]  Ian D. Reid,et al.  RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[29]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[30]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Andrew Zisserman,et al.  Localised photoplethysmography imaging for heart rate estimation of pre-term infants in the clinic , 2018, BiOS.

[32]  Andrew Zisserman,et al.  Multi-Task Convolutional Neural Network for Patient Detection and Skin Segmentation in Continuous Non-Contact Vital Sign Monitoring , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).