A GS-MPSO-WKNN method for missing data imputation in wireless sensor networks monitoring manufacturing conditions

Wireless sensor networks have been utilized to monitor complex manufacturing processes but missing data from sensors cause problems for data-based applications. In this paper, a missing data estimation algorithm, GS-MPSO-WKNN (Gaussian mutation and simulated annealing-based memetic particle swarm optimization for weighted K nearest neighbours), based on a weighted K nearest neighbour (WKNN) and memetic computing is proposed. The GS-MPSO developed in our previous work is adopted in order to adjust the feature weights for the WKNN. A real world data set from a semiconductor manufacturing process is used to evaluate GS-MPSO-WKNN. Experimental results show that GS-MPSO-WKNN can reach a higher estimation accuracy, and GS-MPSO-WKNN is also robust to a high missing data ratio.

[1]  Natalio Krasnogor,et al.  Studies on the theory and design space of memetic algorithms , 2002 .

[2]  L.E. Parker,et al.  Classification with missing data in a wireless sensor network , 2008, IEEE SoutheastCon 2008.

[3]  Jianzhong Li,et al.  K-Nearest Neighbor Based Missing Data Estimation Algorithm in Wireless Sensor Networks , 2010, Wirel. Sens. Netw..

[4]  Paul K. Wright,et al.  Trends in wireless sensor networks for manufacturing , 2006, Int. J. Manuf. Res..

[5]  Werasak Kurutach,et al.  Cluster-based KNN missing value imputation for DNA microarray data , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[6]  Yuhua Li,et al.  Causality Challenge: Benchmarking relevant signal components for effective monitoring and process control , 2008, NIPS Causality: Objectives and Assessment.

[7]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[8]  Ian F. Akyildiz,et al.  Wireless sensor networks: a survey , 2002, Comput. Networks.

[9]  Fei Qiao,et al.  A novel memetic algorithm and its application to data clustering , 2013, Memetic Comput..

[10]  Leonardo Franco,et al.  Missing data imputation using statistical and machine learning methods in a real breast cancer problem , 2010, Artif. Intell. Medicine.

[11]  James Kennedy,et al.  The particle swarm: social adaptation of knowledge , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).

[12]  Ahmet Arslan,et al.  A hybrid method for imputation of missing values using optimized fuzzy c-means with support vector regression and a genetic algorithm , 2013, Inf. Sci..

[13]  Hitoshi Iba,et al.  Particle swarm optimization with Gaussian mutation , 2003, Proceedings of the 2003 IEEE Swarm Intelligence Symposium. SIS'03 (Cat. No.03EX706).

[14]  Michel Verleysen,et al.  K nearest neighbours with mutual information for simultaneous classification and missing data imputation , 2009, Neurocomputing.

[15]  Giovanni Iacca,et al.  Ockham's Razor in memetic computing: Three stage optimal memetic exploration , 2012, Inf. Sci..

[16]  Ren Jiang PSO Based Feature Weighting Algorithm for KNN , 2007 .

[17]  Paul S. Andrews,et al.  An Investigation into Mutation Operators for Particle Swarm Optimization , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[18]  R. Little A Test of Missing Completely at Random for Multivariate Data with Missing Values , 1988 .

[19]  Lawrence Davis,et al.  A Hybrid Genetic Algorithm for Classification , 1991, IJCAI.

[20]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[21]  Li Li,et al.  Data-based scheduling framework and adaptive dispatching rule of complex manufacturing systems , 2013 .

[22]  Yew-Soon Ong,et al.  Memetic Computation—Past, Present & Future [Research Frontier] , 2010, IEEE Computational Intelligence Magazine.

[23]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[24]  Maurice Clerc,et al.  The particle swarm - explosion, stability, and convergence in a multidimensional complex space , 2002, IEEE Trans. Evol. Comput..

[25]  Michael N. Vrahatis,et al.  Memetic particle swarm optimization , 2007, Ann. Oper. Res..

[26]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[27]  James Kennedy,et al.  Small worlds and mega-minds: effects of neighborhood topology on particle swarm performance , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[28]  Shi Gao,et al.  A method for missing data interpolation by SVR , 2012, 2012 IEEE Symposium on Electrical & Electronics Engineering (EEESYM).

[29]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[30]  Xin Yao,et al.  Evolutionary programming made faster , 1999, IEEE Trans. Evol. Comput..

[31]  James Smith,et al.  A tutorial for competent memetic algorithms: model, taxonomy, and design issues , 2005, IEEE Transactions on Evolutionary Computation.