Self-supervised transfer learning of physiological representations from free-living wearable data

Wearable devices such as smartwatches are becoming increasingly popular tools for objectively monitoring physical activity in free-living conditions. To date, research has primarily focused on the purely supervised task of human activity recognition, demonstrating limited success in inferring high-level health outcomes from low-level signals. Here, we present a novel self-supervised representation learning method using activity and heart rate (HR) signals without semantic labels. With a deep neural network, we set HR responses as the supervisory signal for the activity data, leveraging their underlying physiological relationship. In addition, we propose a custom quantile loss function that accounts for the long-tailed HR distribution present in the general population. We evaluate our model in the largest free-living combined-sensing dataset (comprising >280k hours of wrist accelerometer & wearable ECG data). Our contributions are two-fold: i) the pre-training task creates a model that can accurately forecast HR based only on cheap activity sensors, and ii) we leverage the information captured through this task by proposing a simple method to aggregate the learnt latent representations (embeddings) from the window-level to user-level. Notably, we show that the embeddings can generalize in various downstream tasks through transfer learning with linear classifiers, capturing physiologically meaningful, personalized information. For instance, they can be used to predict variables associated with individuals' health, fitness and demographic characteristics, outperforming unsupervised autoencoders and common bio-markers. Overall, we propose the first multimodal self-supervised method for behavioral and physiological data with implications for large-scale health and lifestyle monitoring.

[1]  John V. Guttag,et al.  EXTRACT: Strong Examples from Weakly-Labeled Sensor Data , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[2]  Robert J. Piechocki,et al.  Online Heart Rate Prediction using Acceleration from a Wrist Worn Wearable , 2018, KDD 2018.

[3]  Scott Lundberg,et al.  Deep Transfer Learning for Physiological Signals , 2020, ArXiv.

[4]  Qi Liu,et al.  Multi-Task Self-Supervised Learning for Disfluency Detection , 2019, AAAI.

[5]  Walter Karlen,et al.  PhoneMD: Learning to Diagnose Parkinson's Disease from Smartphone Data , 2018, AAAI.

[6]  Steven E. Nissen,et al.  Association of Cardiorespiratory Fitness With Long-term Mortality Among Adults Undergoing Exercise Treadmill Testing , 2018, JAMA network open.

[7]  Akane Sano,et al.  Predicting Tomorrow's Mood, Health, and Stress Level using Personalized Multitask Learning and Domain Adaptation , 2017, AffComp@IJCAI.

[8]  Bernt Schiele,et al.  A tutorial on human activity recognition using body-worn inertial sensors , 2014, CSUR.

[9]  Geoffrey H. Tison,et al.  DeepHeart: Semi-Supervised Sequence Learning for Cardiovascular Risk Prediction , 2018, AAAI.

[10]  Alan L. Yuille,et al.  Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images , 2016, NIPS.

[11]  Ashish Sharma,et al.  Insights from the Long-Tail: Learning Latent Representations of Online User Behavior in the Presence of Skew and Sparsity , 2018, CIKM.

[12]  Ian Janssen,et al.  Metabolic syndrome, obesity, and mortality: impact of cardiorespiratory fitness. , 2005, Diabetes care.

[13]  Shaohan Hu,et al.  DeepSense: A Unified Deep Learning Framework for Time-Series Mobile Sensing Data Processing , 2016, WWW.

[14]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[15]  Marc G. Bellemare,et al.  Distributional Reinforcement Learning with Quantile Regression , 2017, AAAI.

[16]  Hazem Elzarka,et al.  Advanced machine learning techniques for building performance simulation: a comparative analysis , 2018, Journal of Building Performance Simulation.

[17]  U. Ekelund,et al.  Branched equation modeling of simultaneous accelerometry and heart rate monitoring improves estimate of directly measured physical activity energy expenditure. , 2004, Journal of applied physiology.

[18]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[19]  Nils Y. Hammerla,et al.  Large Scale Population Assessment of Physical Activity Using Wrist Worn Accelerometers: The UK Biobank Study , 2017, PloS one.

[20]  Wenzhong Li,et al.  AttnSense: Multi-level Attention Mechanism For Multimodal Human Activity Recognition , 2019, IJCAI.

[21]  Tamara B Harris,et al.  Daily activity energy expenditure and mortality among older adults. , 2006, JAMA.

[22]  Andrew M. Jones,et al.  The Effect of Endurance Training on Parameters of Aerobic Fitness , 2000, Sports medicine.

[23]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[24]  Jean-Claude Tardif,et al.  Resting heart rate in cardiovascular disease. , 2007, Journal of the American College of Cardiology.

[25]  Ali Etemad,et al.  Self-Supervised Learning for ECG-Based Emotion Recognition , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[26]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[27]  Thomas Plötz,et al.  Deep, Convolutional, and Recurrent Models for Human Activity Recognition Using Wearables , 2016, IJCAI.

[28]  Xian Wu,et al.  Representation Learning on Variable Length and Incomplete Wearable-Sensory Time Series , 2020, ArXiv.

[29]  Kevin Gimpel,et al.  ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[30]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[31]  Rainer Rauramaa,et al.  Heart rate response during exercise test and cardiovascular mortality in middle-aged men. , 2006, European heart journal.

[32]  Luca Foschini,et al.  Learning Individualized Cardiovascular Responses from Large-scale Wearable Sensors Data , 2018, ArXiv.

[33]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[34]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[35]  Francisco C. Pereira,et al.  Beyond Expectation: Deep Joint Mean and Quantile Regression for Spatiotemporal Problems. , 2018, IEEE transactions on neural networks and learning systems.

[36]  Julian J. McAuley,et al.  Modeling Heart Rate and Activity Data for Personalized Fitness Recommendation , 2019, WWW.

[37]  Paolo Favaro,et al.  Self-Supervised Feature Learning by Learning to Spot Artifacts , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38]  Johan Lukkien,et al.  Multi-task Self-Supervised Learning for Human Activity Detection , 2019, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[39]  Xiaoli Li,et al.  Deep Convolutional Neural Networks on Multichannel Time Series for Human Activity Recognition , 2015, IJCAI.

[40]  Vera Maljkovic,et al.  Developing Measures of Cognitive Impairment in the Real World from Consumer-Grade Multimodal Sensor Streams , 2019, KDD.

[41]  Ming-Hsuan Yang,et al.  Unsupervised Representation Learning by Sorting Sequences , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[42]  Hwee Pink Tan,et al.  Deep Activity Recognition Models with Triaxial Accelerometers , 2015, AAAI Workshop: Artificial Intelligence Applied to Assistive Technologies and Smart Environments.

[43]  Richard C. Gerkin,et al.  Machine Learning for Scent: Learning Generalizable Perceptual Representations of Small Molecules , 2019, ArXiv.

[44]  VALENTIN RADU,et al.  Multimodal Deep Learning for Activity and Context Recognition , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[45]  J. Leskovec,et al.  Large-scale physical activity data reveal worldwide activity inequality , 2017, Nature.

[46]  Hirofumi Tanaka,et al.  Age-predicted maximal heart rate revisited. , 2001, Journal of the American College of Cardiology.

[47]  Andrew Owens,et al.  Ambient Sound Provides Supervision for Visual Learning , 2016, ECCV.

[48]  Shafiq R. Joty,et al.  Adversarial Unsupervised Representation Learning for Activity Time-Series , 2018, AAAI.