Towards prediction of Quality of Life aspects using wearable data with limited ground truth

Today people live with a variety of diseases and health conditions that can impact their daily lives, from chronic illnesses such as cancer to mental health issues such as anxiety and depression. With the advancement of technology and healthcare, quality of life (QoL) prediction and improvement is becoming increasingly important to help individuals live better. In addition to data collected by questionnaires, machine learning can be employed to predict aspects of QoL, such as fatigue and anxiety, based on data from wearable devices. However, this requires ground truth information, which, in real-world cases is not always present, or is scarce. This paper addresses the problem of fatigue prediction, which is part of the overall QoL prediction. We extend previous literature by viewing the problem as regression, with the ultimate goal of transferring a model trained with a complete dataset of the literature to a real-world dataset with no ground truth, facing the challenge of only a few common features. We compare several regression and classification techniques to select the best-performing ones. We then explore the effect of using only limited important features on accuracy, which allow us to assess the applicability of transferring the models to a real-world dataset. We apply the trained model on data from the ONCORELIEF project and discuss the identified challenges.

[1]  M. Bolic,et al.  Handling Massive Proportion of Missing Labels in Multivariate Long-Term Time Series Forecasting , 2021, Journal of Physics: Conference Series.

[2]  Aurélie Boisbunon,et al.  PaZoe: classifying time series with few labels , 2021, 2021 29th European Signal Processing Conference (EUSIPCO).

[3]  Hannah Kerner,et al.  Learning to predict crop type from heterogeneous sparse labels using meta-learning , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[4]  Hongyu Luo,et al.  Assessment of Fatigue Using Wearable Sensors: A Pilot Study , 2020, Digital Biomarkers.

[5]  Pramod Bobade,et al.  Stress Detection with Machine Learning and Deep Learning using Multimodal Physiological Data , 2020, 2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA).

[6]  Rosalind W. Picard,et al.  Objective assessment of depressive symptoms with machine learning and wearable sensors data , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).

[7]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[8]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Michael B. Miller Linear Regression Analysis , 2013 .

[10]  P. Bühlmann,et al.  The group lasso for logistic regression , 2008 .

[11]  B. Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[12]  Yang Liu,et al.  An introduction to decision tree modeling , 2004 .

[13]  C. Y. Peng,et al.  An Introduction to Logistic Regression Analysis and Reporting , 2002 .

[14]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[15]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[16]  L. Breiman Random Forests , 2001, Machine Learning.