Thalassaemia classification by neural networks and genetic programming

This paper presents the use of a neural network and a decision tree, which is evolved by genetic programming (GP), in thalassaemia classification. The aim is to differentiate between thalassaemic patients, persons with thalassaemia trait and normal subjects by inspecting characteristics of red blood cells, reticulocytes and platelets. A structured representation on genetic algorithms for non-linear function fitting or STROGANOFF is the chosen architecture for genetic programming implementation. For comparison, multilayer perceptrons are explored in classification via a neural network. The classification results indicate that the performance of the GP-based decision tree is approximately equal to that of the multilayer perceptron with one hidden layer. But the multilayer perceptron with two hidden layers, which is proven to have the most suitable architecture among networks with different number of hidden layers, outperforms the GP-based decision tree. Nonetheless, the structure of the decision tree reveals that some input features have no effects on the classification performance. The results confirm that the classification accuracy of the multilayer perceptron with two hidden layers can still be maintained after the removal of the redundant input features. Detailed analysis of the classification errors of the multilayer perceptron with two hidden layers, in which a reduced feature set is used as the network input, is also included. The analysis reveals that the classification ambiguity and misclassification among persons with minor thalassaemia trait and normal subjects is the main cause of classification errors. These results suggest that a combination of a multilayer perceptron with a blood cell analysis may give rise to a guideline/hint for further investigation of thalassaemia classification.

[1]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[2]  C. Jiménez,et al.  New indices from the H*2 analyser improve differentiation between heterozygous β or δβ thalassaemia and iron‐deficiency anaemia , 1995 .

[3]  A. G. Ivakhnenko,et al.  Polynomial Theory of Complex Systems , 1971, IEEE Trans. Syst. Man Cybern..

[4]  Hitoshi Iba,et al.  Genetic programming using a minimum description length principle , 1994 .

[5]  Hitoshi Iba,et al.  Inductive genetic programming of polynomial learning networks , 2000, 2000 IEEE Symposium on Combinations of Evolutionary Computation and Neural Networks. Proceedings of the First IEEE Symposium on Combinations of Evolutionary Computation and Neural Networks (Cat. No.00.

[6]  P R Lund,et al.  Automated classification of anaemia using image analysis. , 1972, Lancet.

[7]  Cheng-Seen Ho,et al.  A new hybrid case-based architecture for medical diagnosis , 2004, Inf. Sci..

[8]  Oliver Nelles,et al.  Genetic programming for model selection of TSK-fuzzy systems , 2001, Inf. Sci..

[9]  M Stefanelli,et al.  A performance evaluation of the expert system ANEMIA. , 1988, Computers and biomedical research, an international journal.

[10]  D J Weatherall,et al.  The thalassemia syndromes. , 2016, Texas reports on biology and medicine.

[11]  K A Spackman,et al.  An expert system to diagnose anemia and report results directly on hematology forms. , 1996, Computers and biomedical research, an international journal.

[12]  Hitoshi Iba,et al.  Regularization approach to inductive genetic programming , 2001, IEEE Trans. Evol. Comput..

[13]  Abdurrahman Kara,et al.  Most reliable indices in differentiation between thalassemia trait and iron deficiency anemia , 2002, Pediatrics international : official journal of the Japan Pediatric Society.

[14]  M Stefanelli,et al.  Classification of anaemia on the basis of ferrokinetic parameters , 1985, British journal of haematology.

[15]  Lalit M. Patnaik,et al.  Genetic programming based pattern classification with feature space partitioning , 2001, Inf. Sci..

[16]  Giovanni Luca Christian Masala,et al.  A comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening , 2003 .

[17]  Marian B. Gorzalczany,et al.  Neuro-fuzzy Approach versus Rough-Set Inspired Methodology for Intelligent Decision Support , 1999, Inf. Sci..

[18]  B. Golosio,et al.  A Real-Time Classification System of Thalassemic Pathologies Based on Artificial Neural Networks , 2002, Medical decision making : an international journal of the Society for Medical Decision Making.

[19]  Hitoshi Iba,et al.  Overfitting avoidance in genetic programming of polynomials , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[20]  Inés Couso,et al.  Combining GP operators with SA search to evolve fuzzy rule based classifiers , 2001, Inf. Sci..

[21]  B J Flehinger,et al.  HEME: a computer aid to diagnosis of hematologic disease. , 1976, Bulletin of the New York Academy of Medicine.

[22]  M Stefanelli,et al.  NEOANEMIA: a knowledge-based system emulating diagnostic reasoning. , 1990, Computers and biomedical research, an international journal.

[23]  J. K. Kinnear,et al.  Advances in Genetic Programming , 1994 .

[24]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[25]  S. Fucharoen,et al.  Hemoglobinopathies in Southeast Asia: molecular biology and clinical medicine. , 1997, Hemoglobin.

[26]  M Stefanelli,et al.  ANEMIA: an expert consultation system. , 1986, Computers and biomedical research, an international journal.

[27]  Athanasios Tsakonas,et al.  A comparison of classification accuracy of four genetic programming-evolved intelligent structures , 2006, Inf. Sci..