Machine Learning: Anomaly Detection

It is important to identify deviation from the nominally healthy behavior of the product and detect the onset of the product's potential faults for achieving prognostics and health management (PHM). This chapter offers a comprehensive overview of the research on anomaly detection and discusses the challenges in anomaly detection. For anomaly detection, methods can be categorized into distance‐based, clustering based, classification‐based, and statistical anomaly detection methods. The chapter provides the underlying background of the type of anomalies that can be classified into one of the following categories: point anomalies, contextual anomalies, and collective anomalies. Clustering is the partitioning of a dataset into clusters by maximizing inter‐cluster distances and minimizing intra‐cluster distances. The chapter summarizes the advantages and disadvantages of clustering‐based anomaly detection methods. A self‐organizing maps (SOM), also known as a Kohonen neural network, is a type of unsupervised learning.

[1]  K. Worden,et al.  Statistical Damage Classification Using Sequential Probability Ratio Tests , 2002 .

[2]  Michael G. Pecht,et al.  Anomaly Detection of Light-Emitting Diodes Using the Similarity-Based Metric Test , 2014, IEEE Transactions on Industrial Informatics.

[3]  Andrew Kusiak,et al.  Monitoring Wind Turbine Vibration Based on SCADA Data , 2012 .

[4]  M.G. Pecht,et al.  In situ temperature measurement of a notebook computer - a case study in health and usage monitoring of electronics , 2004, IEEE Transactions on Device and Materials Reliability.

[5]  Sushil Jajodia,et al.  ADAM: a testbed for exploring the use of data mining in intrusion detection , 2001, SGMD.

[6]  Iren Valova,et al.  Initialization Issues in Self-organizing Maps , 2013, Complex Adaptive Systems.

[7]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[8]  Qiang Miao,et al.  Online Anomaly Detection for Hard Disk Drives Based on Mahalanobis Distance , 2013, IEEE Transactions on Reliability.

[9]  Andreas Theissler,et al.  Detecting known and unknown faults in automotive systems using ensemble-based anomaly detection , 2017, Knowl. Based Syst..

[10]  M. Pecht,et al.  A Wireless Sensor System for Prognostics and Health Management , 2010, IEEE Sensors Journal.

[11]  Ruggero G. Pensa,et al.  From Context to Distance: Learning Dissimilarity for Categorical Data Clustering , 2012, TKDD.

[12]  Mario Vento,et al.  To reject or not to reject: that is the question-an answer in case of neural classifiers , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[13]  Frank H. Clarke,et al.  A New Approach to Lagrange Multipliers , 1976, Math. Oper. Res..

[14]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[15]  F. J. Anscombe,et al.  Rejection of Outliers , 1960 .

[16]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[17]  Jun Zhou,et al.  Anomaly detection for satellite power subsystem with associated rules based on Kernel Principal Component Analysis , 2015, Microelectron. Reliab..

[18]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[19]  Michael G. Pecht,et al.  A prognostics and health management roadmap for information and electronics-rich systems , 2010, Microelectron. Reliab..

[20]  Enrico Zio,et al.  Transients Analysis of a Nuclear Power Plant Component for Fault Diagnosis , 2013 .

[21]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[22]  Geoffrey E. Hinton Deep belief networks , 2009, Scholarpedia.

[23]  Tommy W. S. Chow,et al.  Anomaly detection of cooling fan and fault classification of induction motor using Mahalanobis-Taguchi system , 2013, Expert Syst. Appl..

[24]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[25]  Fei Tony Liu,et al.  Isolation-Based Anomaly Detection , 2012, TKDD.

[26]  Haibin Zhu,et al.  An Improved kNN Algorithm - Fuzzy kNN , 2005, CIS.

[27]  Michael G. Pecht,et al.  Health Monitoring of Cooling Fans Based on Mahalanobis Distance With mRMR Feature Selection , 2012, IEEE Transactions on Instrumentation and Measurement.

[28]  Felix Naumann,et al.  Data fusion , 2009, CSUR.

[29]  Chang Liu,et al.  Clustering diagnosis of rolling element bearing fault based on integrated Autoregressive/Autoregressive Conditional Heteroscedasticity model , 2012 .

[30]  C. P. Gupta,et al.  Nonlinear elliptic boundary value problems in Lp-spaces and sums of ranges of accretive operators , 1978 .

[31]  K. Gross,et al.  Sequential probability ratio test for nuclear plant component surveillance , 1991 .

[32]  Marion R. Reynolds,et al.  The SPRT control chart for the process mean with samples starting at fixed times , 2001 .