The data learning and anomaly detection based on the rudder system testing facility

Abstract For the data analysis of existing rudder system testing facility (RSTF) being a manual process now, a machine learning (ML) method for fault diagnosis based on RSTF is proposed to realize intelligent data analysis. For this purpose, we have focused on developing a new decision-points-distribution and weight-assignment-oversampling method integrated with optimized Support Vector Machine (SVM) to conduct anomaly detection based on RSTF in this paper, which takes advantage of decision making derived from SVM and is combined with the cluster-based synthetic samples generation mechanism. It is proposed to solve the problem caused by imbalanced data collected from RSTF. Additionally, the SVM classifier is optimized by Perturbed Particle Swarm Optimization (PPSO) while avoiding the risk of falling into local optimization. Experiments are conducted on the imbalanced dataset collected from RSTF and the proposed strategy exhibits its superiority over some existing algorithms.

[1]  S. Nie,et al.  Rapid identification of edible oil species using supervised support vector machine based on low-field nuclear magnetic resonance relaxation features. , 2019, Food chemistry.

[2]  David A. Cieslak,et al.  Start Globally, Optimize Locally, Predict Globally: Improving Performance on Imbalanced Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[3]  Jun Liu,et al.  GA-Based Feature Selection Method for Imbalanced Data with Application in Radio Signal Recognition , 2015, Int. J. Comput. Intell. Syst..

[4]  Dursun Delen,et al.  A synthetic informative minority over-sampling (SIMO) algorithm leveraging support vector machine to enhance learning from imbalanced datasets , 2018, Decis. Support Syst..

[5]  Lior Rokach,et al.  Fast-CBUS: A fast clustering-based undersampling method for addressing the class imbalance problem , 2017, Neurocomputing.

[6]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[7]  Xin Yao,et al.  MWMOTE--Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning , 2014 .

[8]  Zhao Xinchao A perturbed particle swarm algorithm for numerical optimization , 2010 .

[9]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[10]  Longbing Cao,et al.  Effective detection of sophisticated online banking fraud on extremely imbalanced data , 2012, World Wide Web.

[11]  Seyed Mohammad Hosseini,et al.  A Novel Weighted Support Vector Machine Based on Particle Swarm Optimization for Gene Selection and Tumor Classification , 2012, Comput. Math. Methods Medicine.

[12]  Jian Gao,et al.  A new sampling method for classifying imbalanced data based on support vector machine ensemble , 2016, Neurocomputing.

[13]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[14]  A. Alavudeen Basha,et al.  Classification of mammogram for early detection of breast cancer using SVM classifier and Hough transform , 2019, Measurement.

[15]  Xue-wen Chen,et al.  Combating the Small Sample Class Imbalance Problem Using Feature Selection , 2010, IEEE Transactions on Knowledge and Data Engineering.

[16]  Jing Zhang,et al.  Cost-Sensitive Large margin Distribution Machine for classification of imbalanced data , 2016, Pattern Recognit. Lett..

[17]  Zhen Liu,et al.  A hybrid method based on ensemble WELM for handling multi class imbalance in cancer microarray data , 2017, Neurocomputing.

[18]  Xue Wang,et al.  Wearable biosensor network enabled multimodal daily-life emotion recognition employing reputation-driven imbalanced fuzzy classification , 2017 .

[19]  Zhong Liu,et al.  A Novel Ensemble Method for Imbalanced Data Learning: Bagging of Extrapolation-SMOTE SVM , 2017, Comput. Intell. Neurosci..

[20]  Changyin Sun,et al.  Support vector machine-based optimized decision threshold adjustment strategy for classifying imbalanced data , 2015, Knowl. Based Syst..

[21]  Chen Qiu,et al.  A Novel Minority Cloning Technique for Cost-Sensitive Learning , 2015, Int. J. Pattern Recognit. Artif. Intell..

[22]  Guanghua Xu,et al.  Learning deep representation of imbalanced SCADA data for fault detection of wind turbines , 2019, Measurement.