Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images

Background Malaria is a life-threatening disease caused by Plasmodium parasites that infect the red blood cells (RBCs). Manual identification and counting of parasitized cells in microscopic thick/thin-film blood examination remains the common, but burdensome method for disease diagnosis. Its diagnostic accuracy is adversely impacted by inter/intra-observer variability, particularly in large-scale screening under resource-constrained settings. Introduction State-of-the-art computer-aided diagnostic tools based on data-driven deep learning algorithms like convolutional neural network (CNN) has become the architecture of choice for image recognition tasks. However, CNNs suffer from high variance and may overfit due to their sensitivity to training data fluctuations. Objective The primary aim of this study is to reduce model variance, improve robustness and generalization through constructing model ensembles toward detecting parasitized cells in thin-blood smear images. Methods We evaluate the performance of custom and pretrained CNNs and construct an optimal model ensemble toward the challenge of classifying parasitized and normal cells in thin-blood smear images. Cross-validation studies are performed at the patient level to ensure preventing data leakage into the validation and reduce generalization errors. The models are evaluated in terms of the following performance metrics: (a) Accuracy; (b) Area under the receiver operating characteristic (ROC) curve (AUC); (c) Mean squared error (MSE); (d) Precision; (e) F-score; and (f) Matthews Correlation Coefficient (MCC). Results It is observed that the ensemble model constructed with VGG-19 and SqueezeNet outperformed the state-of-the-art in several performance metrics toward classifying the parasitized and uninfected cells to aid in improved disease screening. Conclusions Ensemble learning reduces the model variance by optimally combining the predictions of multiple models and decreases the sensitivity to the specifics of training data and selection of training algorithms. The performance of the model ensemble simulates real-world conditions with reduced variance, overfitting and leads to improved generalization.

[1]  Chandan Chakraborty,et al.  Machine learning approach for automated screening of malaria parasite using light microscopic images. , 2013, Micron.

[2]  P. L. DaviesFebruary THE ONE-WAY ANALYSIS OF VARIANCE , 1997 .

[3]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[5]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[6]  Honglak Lee,et al.  Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units , 2016, ICML.

[7]  Xiangji Huang,et al.  CNN-based image analysis for malaria diagnosis , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[8]  P. Lakhani,et al.  Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks. , 2017, Radiology.

[9]  David M. Rubin,et al.  Automated image processing method for the diagnosis and classification of malaria on thin blood smears , 2006, Medical and Biological Engineering and Computing.

[10]  G. R. K. Sai Subrahmanyam,et al.  Convolutional neural network‐based malaria diagnosis from focus stack of blood smear images acquired using custom‐built slide scanner , 2018, Journal of biophotonics.

[11]  Heinrich Magnus Manske,et al.  LookSeq: a browser-based viewer for deep sequencing data. , 2009, Genome research.

[12]  Hilal Olgun Kucuk,et al.  Importance of using proper post hoc test with ANOVA. , 2016, International journal of cardiology.

[13]  C. Dolea,et al.  World Health Organization , 1949, International Organization.

[14]  Madhu S. Nair,et al.  Malaria Parasite Detection From Peripheral Blood Smear Images Using Deep Belief Networks , 2017, IEEE Access.

[15]  Yuhang Dong,et al.  Evaluations of deep convolutional neural networks for automatic identification of malaria infected cells , 2017, 2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[16]  J. Gastwirth,et al.  The impact of Levene’s test of equality of variances on statistical theory and practice , 2009, 1010.0308.

[17]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[18]  K. Mitiku,et al.  The reliability of blood film examination for malaria at the peripheral health unit , 2003 .

[19]  Joseph S. Rossi One-Way Anova from Summary Statistics , 1987 .

[20]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[21]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[22]  Mahdieh Poostchi,et al.  Image analysis and machine learning for detecting malaria , 2018, Translational research : the journal of laboratory and clinical medicine.

[23]  George R. Thoma,et al.  A novel stacked generalization of models for improved TB detection in chest radiographs , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[24]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[25]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[26]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[27]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[28]  George R. Thoma,et al.  Pre-trained convolutional neural networks as feature extractors toward improved malaria parasite detection in thin blood smear images , 2018, PeerJ.