Application of extreme learning machine in plant disease prediction for highly imbalanced dataset

Abstract Plant diseases are responsible for global economic losses due to degradation in the quality and productivity of plants. Therefore, plant disease prediction has become an essential area of research for agricultural scientists. The current study implements Extreme Learning Machine (ELM) algorithm for plant disease prediction based on a dataset collected in real time scenario namely Tomato Powdery Mildew Disease (TPMD) dataset. Since, the collected TPMD dataset was imbalanced thus; various resampling techniques namely Importance Sampling (IMPS), Synthetic Minority Over-sampling Technique (SMOTE), Random under Sampling (RUS), and Random over Sampling (ROS) have been used here for balancing the dataset before using it in the specified prediction model. ELM models have been developed for each of the balanced TPMD datasets obtained from these resampling techniques as well as for the imbalanced TPMD dataset. Area under Curve (AUC) and Classification Accuracy (CA) are considered as performance measures to analyze the performance of ELM model. Results show that ELM performed better for TPMD dataset using resampling techniques. Best results are obtained with IMPS technique having maximum values for CA and AUC i.e. 89.19% and 88.57% respectively.

[1]  Amit Prakash Singh,et al.  Hybrid SVM-LR Classifier for Powdery Mildew Disease Prediction in Tomato Plant , 2020, 2020 7th International Conference on Signal Processing and Integrated Networks (SPIN).

[2]  Zhuang Fengqing,et al.  Patients’ Responsibilities in Medical Ethics , 2016 .

[3]  Cristina E. Davis,et al.  Advanced methods of plant disease detection. A review , 2014, Agronomy for Sustainable Development.

[4]  Sang Cheol Kim,et al.  A Robust Deep-Learning-Based Detector for Real-Time Tomato Plant Diseases and Pests Recognition , 2017, Sensors.

[5]  Gajendra P. S. Raghava,et al.  BMC Bioinformatics BioMed Central Methodology article Machine learning techniques in disease forecasting: a case study on rice blast prediction , 2006 .

[6]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[7]  L. Plümer,et al.  Original paper: Early detection and classification of plant diseases with Support Vector Machines based on hyperspectral reflectance , 2010 .

[8]  Amit Prakash Singh,et al.  Application of convolutional neural networks for evaluation of disease severity in tomato plant , 2020, Journal of Discrete Mathematical Sciences and Cryptography.

[9]  S. Mukherjee,et al.  Deep learning with perspective modeling for early detection of malignancy in mammograms , 2019, Journal of Discrete Mathematical Sciences and Cryptography.

[10]  Mike Preuss,et al.  Forecasting Soybean Diseases from Symptoms by Means of Evolutionary Support Vector Machines , 2006 .

[11]  Shikha Gupta,et al.  Software maintainability prediction using an enhanced random forest algorithm , 2020 .

[12]  Raj Kamal,et al.  An improved random forest classifier for multi-class classification , 2016 .

[13]  Donghui Yang,et al.  Predicting non-performing loan of business bank by multiple classifier fusion algorithms , 2016 .

[14]  S. Chakraborty,et al.  Weather-based prediction of anthracnose severity using artificial neural network models , 2004 .

[15]  Amit Prakash Singh,et al.  Prediction Models for Identification and Diagnosis of Tomato Plant Diseases , 2018, 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[16]  Amit Prakash Singh,et al.  Deep Learning-Based Mobile Application for Plant Disease Diagnosis , 2019, Advances in Environmental Engineering and Green Technologies.

[17]  Hesham Hefny,et al.  Tomato leaves diseases detection approach based on Support Vector Machines , 2015, 2015 11th International Computer Engineering Conference (ICENCO).

[18]  Dumitru Dumitrescu,et al.  Evolutionary Support Vector Regression Machines , 2006, 2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing.

[19]  Anuradha Chug,et al.  Recent Advancements in Multimedia Big Data Computing for IoT Applications in Precision Agriculture: Opportunities, Issues, and Challenges , 2019, Intelligent Systems Reference Library.

[20]  Virendra P. Vishwakarma,et al.  An adaptive illumination normalization using non-linear regression for robust person identification under varying illuminations , 2020 .

[21]  Gustavo E. A. P. A. Batista,et al.  A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.