Machine Learning for Biomedical Time Series Classification: From Shapelets to Deep Learning

With the biomedical field generating large quantities of time series data, there has been a growing interest in developing and refining machine learning methods that allow its mining and exploitation. Classification is one of the most important and challenging machine learning tasks related to time series. Many biomedical phenomena, such as the brain's activity or blood pressure, change over time. The objective of this chapter is to provide a gentle introduction to time series classification. In the first part we describe the characteristics of time series data and challenges in its analysis. The second part provides an overview of common machine learning methods used for time series classification. A real-world use case, the early recognition of sepsis, demonstrates the applicability of the methods discussed.

[1]  R. Fisher On the Interpretation of χ2 from Contingency Tables, and the Calculation of P , 2010 .

[2]  James R. Bence,et al.  Analysis of Short Time Series: Correcting for Autocorrelation , 1995 .

[3]  M. Levy,et al.  Empiric Antibiotic Treatment Reduces Mortality in Severe Sepsis and Septic Shock From the First Hour: Results From a Guideline-Based Performance Improvement Program* , 2014, Critical care medicine.

[4]  Andrey Kormilitzin,et al.  A Primer on the Signature Method in Machine Learning , 2016, ArXiv.

[5]  S. Dudoit,et al.  Multiple Testing Procedures with Applications to Genomics , 2007 .

[6]  Gregory D. Hager,et al.  Temporal Convolutional Networks for Action Segmentation and Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Eamonn J. Keogh,et al.  Time Series Chains: A Novel Tool for Time Series Data Mining , 2018, IJCAI.

[8]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[9]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[10]  G. Miller,et al.  Science Perspectives on Psychological the Smartphone Psychology Manifesto on Behalf Of: Association for Psychological Science the Smartphone Psychology Manifesto Previous Research Using Mobile Electronic Devices What Smartphones Can Do Now and Will Be Able to Do in the near Future , 2022 .

[11]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[12]  Mirjana Ivanovic,et al.  A Framework for Time-Series Analysis , 2010, AIMSA.

[13]  Roberto Refinetti,et al.  The circadian rhythm of body temperature , 1992, Physiology & Behavior.

[14]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[15]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[16]  R. Bellomo,et al.  The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). , 2016, JAMA.

[17]  Eamonn J. Keogh,et al.  Matrix Profile I: All Pairs Similarity Joins for Time Series: A Unifying View That Includes Motifs, Discords and Shapelets , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[18]  Tim Salimans,et al.  Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.

[19]  Peng Zhang,et al.  Data Mining on ICU Mortality Prediction Using Early Temporal Data: A Survey , 2017, Int. J. Inf. Technol. Decis. Mak..

[20]  Charu C. Aggarwal,et al.  Data Mining: The Textbook , 2015 .

[21]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[22]  M. Emre Celebi,et al.  Unsupervised Learning Algorithms , 2016 .

[23]  Samy Bengio,et al.  Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Michael E. Symonds,et al.  Brown Adipose Tissue and Seasonal Variation in Humans , 2009, Diabetes.

[25]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[26]  Tarone Re A modified Bonferroni method for discrete data. , 1990 .

[27]  De Gooijer,et al.  Elements of nonlinear time series analysis and forecasting , 2017 .

[28]  Eamonn J. Keogh,et al.  Time series shapelets: a new primitive for data mining , 2009, KDD.

[29]  Katherine A. Heller,et al.  Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier , 2017, ICML.

[30]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[31]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Andreas Stolcke,et al.  The Microsoft 2017 Conversational Speech Recognition System , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[33]  Shamim Nemati,et al.  Early Prediction of Sepsis From Clinical Data: The PhysioNet/Computing in Cardiology Challenge 2019 , 2019, 2019 Computing in Cardiology (CinC).

[34]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[35]  Yan Liu,et al.  Recurrent Neural Networks for Multivariate Time Series with Missing Values , 2016, Scientific Reports.

[36]  Jason Lines,et al.  Classification of time series by shapelet transformation , 2013, Data Mining and Knowledge Discovery.

[37]  Antonino Fiannaca,et al.  A Deep Learning Approach to DNA Sequence Classification , 2015, CIBB.

[38]  Uli K. Chettipally,et al.  Prediction of Sepsis in the Intensive Care Unit With Minimal Electronic Health Record Data: A Machine Learning Approach , 2016, JMIR medical informatics.

[39]  Quoc V. Le,et al.  Searching for Activation Functions , 2018, arXiv.

[40]  Claude Sammut,et al.  Classification of Multivariate Time Series and Structured Data Using Constructive Induction , 2005, Machine Learning.

[41]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[42]  Eamonn J. Keogh,et al.  Fast Shapelets: A Scalable Algorithm for Discovering Time Series Shapelets , 2013, SDM.

[43]  Mohamed F. Ghalwash,et al.  Early classification of multivariate temporal observations by extraction of interpretable shapelets , 2012, BMC Bioinformatics.

[44]  Karsten M. Borgwardt,et al.  Association mapping in biomedical time series via statistically significant shapelet mining , 2018, Bioinform..

[45]  Suman V. Ravuri,et al.  A Clinically Applicable Approach to Continuous Prediction of Future Acute Kidney Injury , 2019, Nature.

[46]  Gebhard Kirchgässner,et al.  Introduction to Modern Time Series Analysis , 2007 .

[47]  Aram Galstyan,et al.  Multitask learning and benchmarking with clinical time series data , 2017, Scientific Data.

[48]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[50]  S. Cochi,et al.  Measles mortality reduction contributes substantially to reduction of all cause mortality among children less than five years of age, 1990-2008. , 2011, The Journal of infectious diseases.

[51]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[52]  Christopher W. Barton,et al.  A computational approach to early sepsis detection , 2016, Comput. Biol. Medicine.

[53]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[54]  Jianqing Wang,et al.  Motion artefact removals for wearable ECG using stationary wavelet transform , 2017, Healthcare technology letters.

[55]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[56]  Li Wei,et al.  Fast time series classification using numerosity reduction , 2006, ICML.

[57]  K. Tsuda,et al.  Statistical significance of combinatorial regulations , 2013, Proceedings of the National Academy of Sciences.

[58]  T. Shajina,et al.  Human Gait Recognition and Classification Using Time Series Shapelets , 2012, 2012 International Conference on Advances in Computing and Communications.

[59]  Germain Forestier,et al.  Optimizing dynamic time warping’s window width for time series data mining applications , 2018, Data Mining and Knowledge Discovery.

[60]  Lars Schmidt-Thieme,et al.  Learning time-series shapelets , 2014, KDD.

[61]  G. C. Tiao,et al.  A Course in Time Series Analysis , 2000 .

[62]  Katherine A. Heller,et al.  An Improved Multi-Output Gaussian Process RNN with Real-Time Validation for Early Sepsis Detection , 2017, MLHC.

[63]  John V. Guttag,et al.  Multiple Instance Learning for ECG Risk Stratification , 2019, MLHC.

[64]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[65]  Germain Forestier,et al.  Deep learning for time series classification: a review , 2018, Data Mining and Knowledge Discovery.

[66]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[67]  R. Fisher On the Interpretation of χ2 from Contingency Tables, and the Calculation of P , 2018, Journal of the Royal Statistical Society Series A (Statistics in Society).

[68]  Eamonn J. Keogh,et al.  The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances , 2016, Data Mining and Knowledge Discovery.

[69]  Mohamed F. Ghalwash,et al.  Early Diagnosis and Its Benefits in Sepsis Blood Purification Treatment , 2013, 2013 IEEE International Conference on Healthcare Informatics.

[70]  J. Durbin,et al.  Testing for serial correlation in least squares regression. II. , 1950, Biometrika.

[71]  Ritankar Das,et al.  Effect of a machine learning-based severe sepsis prediction algorithm on patient survival and hospital length of stay: a randomised clinical trial , 2017, BMJ Open Respiratory Research.

[72]  Lovekesh Vig,et al.  Long Short Term Memory Networks for Anomaly Detection in Time Series , 2015, ESANN.

[73]  Vladlen Koltun,et al.  An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling , 2018, ArXiv.

[74]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[75]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[76]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[77]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[78]  Claude E. Shannon,et al.  The mathematical theory of communication , 1950 .

[79]  Eamonn J. Keogh,et al.  The UCR time series archive , 2018, IEEE/CAA Journal of Automatica Sinica.

[80]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[81]  Karsten Borgwardt,et al.  Early Recognition of Sepsis with Gaussian Process Temporal Convolutional Networks and Dynamic Time Warping , 2019, MLHC.

[82]  Björn W. Schuller,et al.  Social signal classification using deep blstm recurrent neural networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[83]  Luca Maria Gambardella,et al.  Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks , 2013, MICCAI.

[84]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[85]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.