Anomaly Detections for Manufacturing Systems Based on Sensor Data—Insights into Two Challenging Real-World Production Settings

To build, run, and maintain reliable manufacturing machines, the condition of their components has to be continuously monitored. When following a fine-grained monitoring of these machines, challenges emerge pertaining to the (1) feeding procedure of large amounts of sensor data to downstream processing components and the (2) meaningful analysis of the produced data. Regarding the latter aspect, manifold purposes are addressed by practitioners and researchers. Two analyses of real-world datasets that were generated in production settings are discussed in this paper. More specifically, the analyses had the goals (1) to detect sensor data anomalies for further analyses of a pharma packaging scenario and (2) to predict unfavorable temperature values of a 3D printing machine environment. Based on the results of the analyses, it will be shown that a proper management of machines and their components in industrial manufacturing environments can be efficiently supported by the detection of anomalies. The latter shall help to support the technical evangelists of the production companies more properly.

[1]  Mohamed Medhat Gaber,et al.  Learning from Data Streams: Processing Techniques in Sensor Networks , 2007 .

[2]  Damminda Alahakoon,et al.  Minority report in fraud detection: classification of skewed data , 2004, SKDD.

[3]  José A. Ramírez-Hernández,et al.  Optimization of Preventive Maintenance scheduling in semiconductor manufacturing models using a simulation-based Approximate Dynamic Programming approach , 2010, 49th IEEE Conference on Decision and Control (CDC).

[4]  Wolfgang Mahnke,et al.  OPC UA - Service-oriented Architecture for Industrial Applications , 2006, Softwaretechnik-Trends.

[5]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[6]  David J. Hill,et al.  Anomaly detection in streaming environmental sensor data: A data-driven modeling approach , 2010, Environ. Model. Softw..

[7]  James Moyne,et al.  An Online Virtual Metrology Model With Sample Selection for the Tracking of Dynamic Manufacturing Processes With Slow Drift , 2019, IEEE Transactions on Semiconductor Manufacturing.

[8]  Raymond T. Ng,et al.  Algorithms for Mining Distance-Based Outliers in Large Datasets , 1998, VLDB.

[9]  Xiaodong Yao,et al.  Optimal Preventive Maintenance Scheduling in Semiconductor Manufacturing Systems: Software Tool and Simulation Case Studies , 2010, IEEE Transactions on Semiconductor Manufacturing.

[10]  Ansgar Bernardi,et al.  Big Data Analysis of Manufacturing Processes , 2015 .

[11]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[12]  Alexander Verl,et al.  Making existing production systems Industry 4.0-ready , 2015, Prod. Eng..

[13]  João Gama,et al.  Regression Trees from Data Streams with Drift Detection , 2009, Discovery Science.

[14]  Oliver Niggemann,et al.  Detecting anomalous energy consumptions in distributed manufacturing systems , 2012, IEEE 10th International Conference on Industrial Informatics.

[15]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[16]  Peng Li,et al.  Data Driven Modeling for System-Level Condition Monitoring on Wind Power Plants , 2015, DX.

[17]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[18]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[19]  Subutai Ahmad,et al.  Unsupervised real-time anomaly detection for streaming data , 2017, Neurocomputing.

[20]  Marek Obitko,et al.  Understanding Data Heterogeneity in the Context of Cyber-Physical Systems Integration , 2017, IEEE Transactions on Industrial Informatics.

[21]  Lionel Tarassenko,et al.  A System for the Analysis of Jet Engine Vibration Data , 1999, Integr. Comput. Aided Eng..

[22]  Eamonn J. Keogh,et al.  Matrix Profile I: All Pairs Similarity Joins for Time Series: A Unifying View That Includes Motifs, Discords and Shapelets , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[23]  WebbReis Programmable Logic Controllers , 2015 .

[24]  Manfred Reichert,et al.  Towards a Hierarchical Approach for Outlier Detection in Industrial Production Settings , 2019, EDBT/ICDT Workshops.

[25]  A. Madansky Identification of Outliers , 1988 .

[26]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[27]  Bernard Zenko,et al.  Speeding-Up Hoeffding-Based Regression Trees With Options , 2011, ICML.

[28]  Craig Chambers,et al.  The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing , 2015, Proc. VLDB Endow..

[29]  João Gama,et al.  Forest trees for on-line data , 2004, SAC '04.

[30]  Kai Ming Ting,et al.  Fast Anomaly Detection for Streaming Data , 2011, IJCAI.

[31]  Eyal Amir,et al.  Real‐time Bayesian anomaly detection in streaming environmental data , 2007 .

[32]  Eamonn J. Keogh,et al.  Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping , 2012, KDD.

[33]  Hans Kleine Büning,et al.  Using behavior models for anomaly detection in hybrid systems , 2011, 2011 XXIII International Symposium on Information, Communication and Automation Technologies.

[34]  James Moyne,et al.  Big Data Analytics for Smart Manufacturing: Case Studies in Semiconductor Manufacturing , 2017 .

[35]  Manfred Reichert,et al.  Techniques and Emerging Trends for State of the Art Equipment Maintenance Systems—A Bibliometric Analysis , 2018 .

[36]  Saso Dzeroski,et al.  Online tree-based ensembles and option trees for regression on evolving data streams , 2015, Neurocomputing.