Outlier Detection for Multidimensional Time Series Using Deep Neural Networks

Due to the continued digitization of industrial and societal processes, including the deployment of networked sensors, we are witnessing a rapid proliferation of time-ordered observations, known as time series. For example, the behavior of drivers can be captured by GPS or accelerometer as a time series of speeds, directions, and accelerations. We propose a framework for outlier detection in time series that, for example, can be used for identifying dangerous driving behavior and hazardous road locations. Specifically, we first propose a method that generates statistical features to enrich the feature space of raw time series. Next, we utilize an autoencoder to reconstruct the enriched time series. The autoencoder performs dimensionality reduction to capture, using a small feature space, the most representative features of the enriched time series. As a result, the reconstructed time series only capture representative features, whereas outliers often have non-representative features. Therefore, deviations of the enriched time series from the reconstructed time series can be taken as indicators of outliers. We propose and study autoencoders based on convolutional neural networks and long-short term memory neural networks. In addition, we show that embedding of contextual information into the framework has the potential to further improve the accuracy of identifying outliers. We report on empirical studies with multiple time series data sets, which offers insight into the design properties of the proposed framework, indicating that it is effective at detecting outliers.

[1]  Christian S. Jensen,et al.  Enabling Time-Dependent Uncertain Eco-Weights For Road Networks , 2014, GeoRich'14.

[2]  Thomas Bäck,et al.  Online anomaly detection on the webscope S5 dataset: A comparative study , 2017, 2017 Evolving and Adaptive Intelligent Systems (EAIS).

[3]  Charu C. Aggarwal,et al.  Outlier Detection for Temporal Data: A Survey , 2014, IEEE Transactions on Knowledge and Data Engineering.

[4]  Mohamed Nadif,et al.  Denoising Autoencoder as an Effective Dimensionality Reduction and Clustering of Text Data , 2017, PAKDD.

[5]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[6]  Jian Dai,et al.  Personalized route recommendation using big trajectory data , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[7]  Sridhar Ramaswamy,et al.  Efficient algorithms for mining outliers from large data sets , 2000, SIGMOD '00.

[8]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[9]  Christian S. Jensen,et al.  EcoSky: Reducing vehicular environmental impact through eco-routing , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[10]  Ralf Hartmut Güting,et al.  Network-Matched Trajectory-Based Moving-Object Database: Models and Applications , 2015, IEEE Transactions on Intelligent Transportation Systems.

[11]  Mohammed J. Zaki,et al.  ADMIT: anomaly-based data mining for intrusions , 2002, KDD.

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  Malik Yousef,et al.  One-Class SVMs for Document Classification , 2002, J. Mach. Learn. Res..

[14]  Niels Agerholm,et al.  Identification of Hazardous Road Locations on the basis of jerks , 2015 .

[15]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[16]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[17]  Eric Horvitz,et al.  A Deep Hybrid Model for Weather Forecasting , 2015, KDD.

[18]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Yu Cheng,et al.  Unsupervised Sequential Outlier Detection With Deep Architectures , 2017, IEEE Transactions on Image Processing.

[20]  Raman K. Mehra,et al.  Detection and classification of intrusions and faults using sequences of system calls , 2001, SGMD.

[21]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[22]  Takehisa Yairi,et al.  Anomaly Detection Using Autoencoders with Nonlinear Dimensionality Reduction , 2014, MLSDA'14.

[23]  Bin Yang,et al.  Enabling Smart Transportation Systems: A Parallel Spatio-Temporal Database Approach , 2016, IEEE Transactions on Computers.

[24]  Gang Hua,et al.  Learning Discriminative Reconstructions for Unsupervised Outlier Removal , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[25]  Cheng Guo,et al.  Entity Embeddings of Categorical Variables , 2016, ArXiv.

[26]  Changsheng Li,et al.  Autoencoder Regularized Network For Driving Style Representation Learning , 2017, IJCAI.

[27]  Luis Miguel Bergasa,et al.  Need data for driver behaviour analysis? Presenting the public UAH-DriveSet , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[28]  Lovekesh Vig,et al.  LSTM-based Encoder-Decoder for Multi-sensor Anomaly Detection , 2016, ArXiv.

[29]  Christian S. Jensen,et al.  Path Cost Distribution Estimation Using Trajectory Data , 2016, Proc. VLDB Endow..

[30]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[31]  Christian S. Jensen,et al.  Risk-aware path selection with time-varying, uncertain travel costs: a time series approach , 2018, The VLDB Journal.

[32]  R. Tsay,et al.  Outlier Detection in Multivariate Time Series by Projection Pursuit , 2006 .

[33]  Christian S. Jensen,et al.  Learning to Route with Sparse Trajectory Sets , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[34]  Christian S. Jensen,et al.  Toward personalized, context-aware routing , 2015, The VLDB Journal.

[35]  Christian S. Jensen,et al.  PACE: a PAth-CEntric paradigm for stochastic path finding , 2017, The VLDB Journal.

[36]  Christian S. Jensen,et al.  Travel Cost Inference from Sparse, Spatio-Temporally Correlated Time Series Using Markov Models , 2013, Proc. VLDB Endow..

[37]  Andrew McCallum,et al.  Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.

[38]  Sven Behnke,et al.  Evaluation of Pooling Operations in Convolutional Architectures for Object Recognition , 2010, ICANN.

[39]  Asher Bender,et al.  A Flexible System Architecture for Acquisition and Storage of Naturalistic Driving Data , 2016, IEEE Transactions on Intelligent Transportation Systems.

[40]  Aoying Zhou,et al.  Finding Top-k Shortest Paths with Diversity , 2018, IEEE Transactions on Knowledge and Data Engineering.

[41]  Aoying Zhou,et al.  TRUSTER: TRajectory Data Processing on ClUSTERs , 2009, DASFAA.

[42]  Subutai Ahmad,et al.  Evaluating Real-Time Anomaly Detection Algorithms -- The Numenta Anomaly Benchmark , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[43]  Rajat Raina,et al.  Large-scale deep unsupervised learning using graphics processors , 2009, ICML '09.

[44]  Christian S. Jensen,et al.  Towards Total Traffic Awareness , 2014, SGMD.

[45]  Aoying Zhou,et al.  Finding Top-k Optimal Sequenced Routes , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[46]  Yasushi Sakurai Mining and Forecasting of Big Time-Series Data , 2019, PerCom Workshops.

[47]  John Salvatier,et al.  Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.