Queries on Synthetic Images for Large Multivariate Engineering Data Base Searches

Engineers are often interested in searching through historical engine testing data sets to explore systems behaviours across experiments or simply to validate the integrity of data by reference to past experience. Engine tests usually consist of design of experiments (DoE) tests carried out at different operating conditions on an engine. While DoE tests record a time or measurement field so could fall under the multivariate time series domain, however, the engineering experiments have no important connection with order. Using this knowledge we have developed a method of visualising the engineering data as a set of normalised histograms to manufacture synthetic images for every test. From these images and their pixel rows we propose a list base indexing method to efficiently index the engine test images and a fast algorithm to search both the univariate and multivariate cases. Querying the data for similar behaviours thus becomes much simpler for the engineer, compared to manually searching files. Our data is of a heterogeneous form which can often mean the absence of columns where they are present in other data sets. It can even sometimes mean the comparison between different data types. This data inconsistency is something that from our research is rarely reported or attempted to traverse, but is consistently occurring in real multivariate data case studies. In order to overcome this realistic problem our method will allow for comparison of univariate data with differing quality scores.

[1]  Georges G. Grinstein,et al.  Iconographic Displays For Visualizing Multidimensional Data , 1988, Proceedings of the 1988 IEEE International Conference on Systems, Man, and Cybernetics.

[2]  D. Lu,et al.  Change detection techniques , 2004 .

[3]  W. Malila Change Vector Analysis: An Approach for Detecting Forest Changes with Landsat , 1980 .

[4]  Daniel B. Carr,et al.  Scatterplot matrix techniques for large N , 1986 .

[5]  Jiju Antony,et al.  Design of experiments for engineers and scientists , 2003 .

[6]  Macarena Espinilla,et al.  Sensor-Based Datasets for Human Activity Recognition – A Systematic Review of Literature , 2018, IEEE Access.

[7]  Tullio Vernazza,et al.  Human motion modelling and recognition: A computational approach , 2012, 2012 IEEE International Conference on Automation Science and Engineering (CASE).

[8]  Kanishka Bhaduri,et al.  Fast and Flexible Multivariate Time Series Subsequence Search , 2010, 2010 IEEE International Conference on Data Mining.

[9]  F. Chaudoye,et al.  Comparison of Engine Calibration Methods Based on Design of Experiments (DoE) , 2008 .

[10]  Max A. Little,et al.  Highly comparative time-series analysis: the empirical structure of time series and their methods , 2013, Journal of The Royal Society Interface.

[11]  Ashbindu Singh,et al.  Review Article Digital change detection techniques using remotely-sensed data , 1989 .

[12]  Claudio Gallicchio,et al.  Human activity recognition using multisensor data fusion based on Reservoir Computing , 2016, J. Ambient Intell. Smart Environ..

[13]  Li Wei,et al.  Intelligent Icons: Integrating Lite-Weight Data Mining and Visualization into GUI Operating Systems , 2006, Sixth International Conference on Data Mining (ICDM'06).

[14]  Wojciech Basalaj Proximity visualisation of abstract data , 2001 .