Colon Tumor Microarray Classification Using Neural Network with Feature Selection and Rule-Based Classification

The efficient feature subset selection for predictive and accurate classification is highly desirable in bioinformatic datasets. This paper proposes a method to apply our previously proposed neural network to microarray classification problem. The adjustable linguistic features are embedded in the network structure. After the training process, the informative features are selected. The network performs classification task either by the direct calculation or by rule-based approach. The structure of the three-layer feedforward neural network is designed with the consideration of useful information during the training process. The hidden layer is embedded with the linguistic feature tuning and mechanism for rule extraction. The colon tumor microarray dataset is used in the experiments. Good results from both direct calculation and from logical rules are achieved using the 10-fold cross validation. The results demonstrate the importance of the linguistic features selected by the network. The results show that the proposed method achieves better classification performance than the other previously proposed methods.

[1]  Dimitris K. Tasoulis,et al.  Differential Evolution Algorithms for Finding Predictive Gene Subsets in Microarray Data , 2006, AIAI.

[2]  Jude Shavlik,et al.  Refinement ofApproximate Domain Theories by Knowledge-Based Neural Networks , 1990, AAAI.

[3]  Jacek M. Zurada,et al.  Computational intelligence methods for rule-based data understanding , 2004, Proceedings of the IEEE.

[4]  Wlodzislaw Duch,et al.  A new methodology of extraction, optimization and application of crisp and fuzzy logical rules , 2001, IEEE Trans. Neural Networks.

[5]  Seyed Mostafa Fakhrahmad,et al.  Efficient Fuzzy Rule Generation: A New Approach Using Data Mining Principles and Rule Weighting , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[6]  Enrique Alba,et al.  Hybrid DE-SVM Approach for Feature Selection: Application to Gene Expression Datasets , 2009, 2009 2nd International Symposium on Logistics and Industrial Informatics.

[7]  S. Hashimoto,et al.  An Interpretable Neural Network Ensemble , 2007, IECON 2007 - 33rd Annual Conference of the IEEE Industrial Electronics Society.

[8]  Kwong-Sak Leung,et al.  Classification of Heterogeneous Fuzzy Data by Choquet Integral With Fuzzy-Valued Integrand , 2007, IEEE Transactions on Fuzzy Systems.

[9]  Sun I. Kim,et al.  Nonlinear Support Vector Machine Visualization for Risk Factor Analysis Using Nomograms and Localized Radial Basis Function Kernels , 2008, IEEE Transactions on Information Technology in Biomedicine.

[10]  M. Teshnehlab,et al.  Hierarchical Takagi-Sugeno type fuzzy system for diabetes mellitus forecasting , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[11]  Nikhil R. Pal,et al.  A neuro-fuzzy scheme for simultaneous feature selection and fuzzy rule-based classification , 2004, IEEE Transactions on Neural Networks.

[12]  Xi-Zhao Wang,et al.  A new approach to weighted fuzzy production rule extraction from neural networks , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[13]  Sansanee Auephanwiriyakul,et al.  A novel neuro-fuzzy method for linguistic feature selection and rule-based classification , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[14]  W. Wettayaprasit,et al.  Linguistic Knowledge Extraction from Neural Networks Using Maximum Weight and Frequency Data Representation , 2006, 2006 IEEE Conference on Cybernetics and Intelligent Systems.