Performance Evaluations of Diagnostic Prediction with Neural Networks with Data Filters in Different Types

The advent of the information age and the rapid development of IT skills have led to the construction of massive databases, thus the current research focus is shifting to the efficient utilization of these vast volumes of stored information. Among the data mining algorithms that have been applied to this problem, neural networks can be used with various types, qualities, distributions, or volumes of data and they have high predictive power. Thus, neural networks are known to be the most useful and extensible algorithms, whereas logistic analysis has many constraints. In addition, neural networks obtain better results when the assumptions of linear discriminant analysis cannot be satisfied. The present study evaluated a multilayer perceptron (MLP) and radial-basis function network (RBFN), and their performance levels were compared with logistic regression based on cross-validation using the same data. The experiments showed that MLP delivered better performance than other methods in medical diagnostic applications where numerical data are used. MLP also performed better with the heart disease dataset using finely specified data types compared with the diabetes dataset using simple data types.

[1]  Claudio Conversano,et al.  Generalized additive multi-mixture model for data mining , 2002 .

[2]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[3]  Bogdan Wilamowski,et al.  Comparison between traditional neural networks and radial basis function networks , 2011, 2011 IEEE International Symposium on Industrial Electronics.

[4]  David R. Cox,et al.  The Oxford Dictionary of Statistical Terms , 2006 .

[5]  Leonardo Noriega,et al.  Multilayer Perceptron Tutorial , 2005 .

[6]  Suresh Babu Changalasetty,et al.  OPTIMUM LEARNING RATE FOR CLASSIFICATION PROBLEM WITH MLP IN DATA MINING , 2013 .

[7]  Richard S. Johannes,et al.  Using the ADAP Learning Algorithm to Forecast the Onset of Diabetes Mellitus , 1988 .

[8]  A. W. Jayawardena,et al.  Comparison of Multilayer Perceptron and radial Basis Function networks as tools for flood forecasting , 1996 .

[9]  Iñaki Albisua,et al.  The quest for the optimal class distribution: an approach for enhancing the effectiveness of learning via resampling methods for imbalanced data sets , 2013, Progress in Artificial Intelligence.

[10]  Yongjian Fu,et al.  Data mining , 1997 .

[11]  Haijia Shi Best-first Decision Tree Learning , 2007 .

[12]  Taghi M. Khoshgoftaar,et al.  A comparative study of filter-based feature ranking techniques , 2010, 2010 IEEE International Conference on Information Reuse & Integration.

[13]  Taeho Jo,et al.  A Multiple Resampling Method for Learning from Imbalanced Data Sets , 2004, Comput. Intell..

[14]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.