Symbolic Analysis of Machine Behaviour and the Emergence of the Machine Language

This paper takes a fundamental new approach to symbolic time series analysis of real time data acquired from human driven mining equipment, which can be seen as stochastic physical systems with non analytic human interaction (hybrid systems). The developed framework uses linear differential operators (LDO) to include the system dynamics within the analysis, whereas the metaphor of language is used to mimic the human interaction. After applying LDO, the multidimensional data stream is converted into a single symbolic time series yielding a more abstract but highly condense representation of the original data. Inspired by natural language, the presented algorithm combines iteratively symbol pairs (word pairs) which occur frequently to new symbols/words; a machine-specific language emerges in a hierarchical manner, which automatically structures the dataset into segments and sub-segments. As a demonstration, the automatic recognition of operation modes of a bucket-wheel excavator is presented, proving the metaphor of language to be valuable in such hybrid systems.

[1]  János Abonyi,et al.  Dynamic Principal Component Analysis in Multivariate Time-Series Segmentation , 2011 .

[2]  Claude Sammut,et al.  Variance-wise Segmentation for a Temporal-Adaptive SAX , 2012, AusDM.

[3]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[4]  Ian H. Witten,et al.  Identifying Hierarchical Structure in Sequences: A linear-time algorithm , 1997, J. Artif. Intell. Res..

[5]  Bernhard Sick,et al.  Online Segmentation of Time Series Based on Polynomial Least-Squares Approximations , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Marina Milanović,et al.  ALGORITHMIC METHODS FOR SEGMENTATION OF TIME SERIES: AN OVERVIEW , 2014 .

[7]  Tim Oates,et al.  GrammarViz 2.0: A Tool for Grammar-Based Pattern Discovery in Time Series , 2014, ECML/PKDD.

[8]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[9]  Matthew Harker,et al.  Advanced Symbolic Time Series Analysis in Cyber Physical Systems , 2018, ArXiv.

[10]  Ramin Zabih,et al.  The 30th Anniversary of the IEEE Transactions on Pattern Analysis and Machine Intelligence , 2010, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Irfan A. Essa,et al.  Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[12]  Sahin Albayrak,et al.  Pattern recognition and classification for multivariate time series , 2011, SensorKDD '11.

[13]  Haifeng Chen,et al.  Time Series Segmentation to Discover Behavior Switching in Complex Physical Systems , 2015, 2015 IEEE International Conference on Data Mining.

[14]  Heikki Mannila,et al.  Time series segmentation for context recognition in mobile devices , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[15]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[16]  Irfan A. Essa,et al.  Discovering Characteristic Actions from On-Body Sensor Data , 2006, 2006 10th IEEE International Symposium on Wearable Computers.

[17]  Jean Gotman,et al.  Adaptive segmentation of electroencephalographic data using a nonlinear energy operator , 1999, ISCAS'99. Proceedings of the 1999 IEEE International Symposium on Circuits and Systems VLSI (Cat. No.99CH36349).

[18]  Xiaodong Liu,et al.  Dynamic programming approach for segmentation of multivariate time series , 2014, Stochastic Environmental Research and Risk Assessment.

[19]  Nagiza F. Samatova,et al.  Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data , 2016, IEEE Transactions on Knowledge and Data Engineering.

[20]  Lars Schmidt-Thieme,et al.  Scalable Classification of Repetitive Time Series Through Frequencies of Local Polynomials , 2015, IEEE Transactions on Knowledge and Data Engineering.

[21]  J. Gillis,et al.  Linear Differential Operators , 1963 .

[22]  Gerhard Thonhauser,et al.  Multivariate Time Series Classification by Combining Trend-Based and Value-Based Approximations , 2012, ICCSA.

[23]  Dan Lusthaus Buddhist Phenomenology: A Philosophical Investigation of Yogacara Buddhism and the Ch'eng Wei-shih Lun , 2002 .

[24]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[25]  János Abonyi,et al.  Monitoring process transitions by Kalman filtering and time-series segmentation , 2005, Comput. Chem. Eng..

[26]  Tim Oates,et al.  GrammarViz 3.0 , 2018, ACM Trans. Knowl. Discov. Data.

[27]  Asok Ray,et al.  Pattern identification in dynamical systems via symbolic time series analysis , 2007, Pattern Recognit..

[28]  Nagiza F. Samatova,et al.  Theory-Guided Data Science for Climate Change , 2014, Computer.

[29]  Shimon Ullman,et al.  Combined Top-Down/Bottom-Up Segmentation , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Eamonn J. Keogh,et al.  Segmenting Time Series: A Survey and Novel Approach , 2002 .

[31]  Matthew Harker,et al.  An inverse problem approach to approximating sensor data in cyber physical systems , 2015, 2015 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) Proceedings.