Imputing Missing Data In Large-Scale Multivariate Biomedical Wearable Recordings Using Bidirectional Recurrent Neural Networks With Temporal Activation Regularization

Miniaturized and wearable sensor-based measurements offer unprecedented opportunities to study and assess human behavior in natural settings with wide ranging applications including in healthcare, wellness tracking and entertainment. However, wearable sensors are vulnerable to data loss due to body movement, sensor displacement, software malfunctions, etc. This generally hinders advanced data analytics including for clustering, data summarization, and pattern recognition requiring robust solutions for handling missing data to obtain accurate and unbiased analysis. Conventional data imputation strategies to address the challenges of missing data, including statistical fill-in, matrix factorization and traditional machine learning approaches, are inadequate in capturing temporal variations in multivariate time series. In this paper, we investigate data imputation using bidirectional recurrent neural networks with temporal activation regularization, which can directly learn and fill in the missing data. We evaluate the method on a large-scale multimodal wearable recording data-set of bio-behavioral signals we recently collected from over 100 hospital staff for a period of 10 weeks. Experimental results on these multimodal time series show the superiority of the proposed RNN-based method in terms of imputation accuracy.

[1]  Subhas Chandra Mukhopadhyay,et al.  Wearable Sensors for Human Activity Monitoring: A Review , 2015, IEEE Sensors Journal.

[2]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[3]  Amay J Bandodkar,et al.  Non-invasive wearable electrochemical sensors: a review. , 2014, Trends in biotechnology.

[4]  Zhikui Chen,et al.  A weighted kernel possibilistic c‐means algorithm based on cloud computing for clustering big data , 2014, Int. J. Commun. Syst..

[5]  Yan Liu,et al.  Recurrent Neural Networks for Multivariate Time Series with Missing Values , 2016, Scientific Reports.

[6]  K. Anandakumar,et al.  A comprehensive review on usage of Internet of Things (IoT) in healthcare system , 2015, 2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT).

[7]  Ali Farhadi,et al.  Unsupervised Deep Embedding for Clustering Analysis , 2015, ICML.

[8]  Joseph G. Ibrahim,et al.  Missing data methods in longitudinal studies: a review , 2009 .

[9]  J. Ross Quinlan,et al.  Unknown Attribute Values in Induction , 1989, ML.

[10]  Robert Tibshirani,et al.  Spectral Regularization Algorithms for Learning Large Incomplete Matrices , 2010, J. Mach. Learn. Res..

[11]  R. DarshanK,et al.  A comprehensive review on usage of Internet of Things (IoT) in healthcare system , 2015 .

[12]  Wei Cao,et al.  BRITS: Bidirectional Recurrent Imputation for Time Series , 2018, NeurIPS.

[13]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[14]  Constantine Frangakis,et al.  Multiple imputation by chained equations: what is it and how does it work? , 2011, International journal of methods in psychiatric research.

[15]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[16]  Inderjit S. Dhillon,et al.  Temporal Regularized Matrix Factorization for High-dimensional Time Series Prediction , 2016, NIPS.

[17]  Chengqi Zhang,et al.  Missing Value Imputation Based on Data Clustering , 2008, Trans. Comput. Sci..

[18]  Aníbal R. Figueiras-Vidal,et al.  Pattern classification with missing data: a review , 2010, Neural Computing and Applications.

[19]  Shyamal Patel,et al.  A review of wearable sensors and systems with application in rehabilitation , 2012, Journal of NeuroEngineering and Rehabilitation.

[20]  Amy Loutfi,et al.  Data Mining for Wearable Sensors in Health Monitoring Systems: A Review of Recent Trends and Challenges , 2013, Sensors.

[21]  J. Shao,et al.  Nearest Neighbor Imputation for Survey Data , 2000 .

[22]  Trevor J. Hastie,et al.  Matrix completion and low-rank SVD via fast alternating least squares , 2014, J. Mach. Learn. Res..

[23]  Dan Jackson,et al.  What Is Meant by "Missing at Random"? , 2013, 1306.2812.