Missing Attribute Value Prediction Based on Artificial Neural Network and Rough Set Theory

In this research, artificial neural network (ANN) combined with rough set theory (RST), named as ANNRST, is proposed to predict missing values of attribute. The prediction of missing values of attribute is applied on heart disease data from UCI datasets. The ANN used is multilayer perceptron (MLP) with resilient back-propagation learning. RST can reduce the dimensionality of attributes through its reduct. Reduct is used as input of ANN combined with decision attribute. By simulating of missing values, the prediction accuracy of ANN is compared to ANNRST. The accuracy of ANNRST is also compared with missing data imputation ofk-Nearest Neighbor (k-NN), most common attribute value method and ANN with piecewise linear network-orthonormal least square feature selection (PLN-OLS). Simulation results show that ANNRST can predict the missing value with maximum accuracy close to ANN without dimensionality reduction (pure ANN) and outperform k-NN, most common attribute value method, and ANN with PLN-OLS.