Improving model robustness for soybean iron deficiency chlorosis rating by unsupervised pre-training on unmanned aircraft system derived images

Abstract Iron deficiency chlorosis (IDC) is a major yield-limiting factor for soybean production in the mid-western USA. The most practical solution in mitigating losses due to IDC is the development and characterization of IDC tolerant varieties. Leveraging the advanced technique of unmanned aircraft system (UAS) and the thriving deep learning methodology, a convolutional neural network (CNN) could be trained to assist breeders with IDC resistance selection. However, a known difficulty in IDC screening is that the symptoms often vary across diverse genetic backgrounds and spatial or temporal soil heterogeneities. A robust CNN model is desired to mitigate such difficulty. While high robustness usually relies on a sufficiently large labeled training data, the available labeled samples in most breeding programs are normally not enough. Under this limitation, it is critical to find an alternative way to train a robust model. The solution proposed in this study was to apply unsupervised pre-training on the unlabeled aerial images that are much easier to obtain by the UAS. Specifically, a convolutional autoencoder (CAE) was pre-trained on unlabeled sub-images clipped from aerial RGB images; then, the pre-trained weights were reused to initialize the CNN model that was trained on labeled plot-wise sub-images clipped from stitched RGB maps. To test the robustness of this CAE initialized model (CAE1-CNN), two baseline models were equally trained: the first was CAE2-CNN, where the CAE2 was pre-trained with three times of unlabeled data as that of CAE1, by adding wniter wheat and sorghum aerial images; the second was Ran-CNN where the CNN was randomly initialized. Three conditions were considered for testing model robustness: different soybean trials, field locations and vegetative growth stages. Results revealed that both the CAE1-CNN and the CAE2-CNN had relatively better robustness than the Ran-CNN model, i.e., higher R2 and lower RMSE values, especially on different soybean trials and growth stages, which proved that the unsupervised pre-training added gains to the model robustness across diverse trials and growth stages. Similar performances were found between the CAE1-CNN andthe CAE2-CNN model, suggesting that augmenting the unlabled data did not bring significant improvement to model robustness. Additionally, during robustness test on different soybean trials, the unsupervised pre-training seemly showed the potential of alleviating the required number of labeled training samples. These promising findings could contribute to the research on crop stresses by providing a potential path towards developing a robust system for classifying or predicting stress severities under more varied conditions.

[1]  Y. Ge,et al.  Field-Based Scoring of Soybean Iron Deficiency Chlorosis Using RGB Imaging and Statistical Learning , 2018, Front. Plant Sci..

[2]  Shunming Li,et al.  Construction of a batch-normalized autoencoder network and its application in mechanical intelligent fault diagnosis , 2018, Measurement Science and Technology.

[3]  Baskar Ganapathysubramanian,et al.  Computer vision and machine learning for robust phenotyping in genome-wide studies , 2017, Scientific Reports.

[4]  Baskar Ganapathysubramanian,et al.  A real-time phenotyping framework using machine learning for plant stress severity rating in soybean , 2017, Plant Methods.

[5]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[6]  Nikolaos Doulamis,et al.  Deep Learning for Computer Vision: A Brief Review , 2018, Comput. Intell. Neurosci..

[7]  S. Naeve,et al.  Iron deficiency chlorosis in soybean: Soybean seeding rate and companion crop effects , 2006 .

[8]  S. Naeve,et al.  Genotype × Environment Interactions within Iron Deficiency Chlorosis-Tolerant Soybean Genotypes , 2006 .

[9]  Y. Ge,et al.  Principal variable selection to explain grain yield variation in winter wheat from features extracted from UAV imagery , 2019, Plant Methods.

[10]  Blaise Hanczar,et al.  Phenotypes Prediction from Gene Expression Data with Deep Multilayer Perceptron and Unsupervised Pre-training , 2018 .

[11]  S. Z. Gürbüz,et al.  Deep convolutional autoencoder for radar-based classification of similar aided and unaided human activities , 2018, IEEE Transactions on Aerospace and Electronic Systems.

[12]  A. Lorenz,et al.  Soybean iron deficiency chlorosis high throughput phenotyping using an unmanned aircraft system , 2019, Plant Methods.

[13]  E. Santen,et al.  Iron Chelates Alleviate Iron Chlorosis in Soybean on High pH Soils , 2014 .

[14]  Flavio Esposito,et al.  UAV-Based High Resolution Thermal Imaging for Vegetation Monitoring, and Plant Phenotyping Using ICI 8640 P, FLIR Vue Pro R 640, and thermoMap Cameras , 2019, Remote. Sens..

[15]  Fei Wang,et al.  Deep learning for healthcare: review, opportunities and challenges , 2018, Briefings Bioinform..

[16]  Ruqiang Yan,et al.  A sparse auto-encoder-based deep neural network approach for induction motor faults classification , 2016 .

[17]  David W. Franzen,et al.  Soybean Iron-Deficiency Chlorosis Tolerance and Yield Decrease on Calcareous Soils , 2010 .

[18]  R. Shoemaker,et al.  Mapping genetic loci for iron deficiency chlorosis in soybean , 1997, Molecular Breeding.

[19]  Pascual Campoy Cervera,et al.  A Review of Deep Learning Methods and Applications for Unmanned Aerial Vehicles , 2017, J. Sensors.

[20]  Yoshua Bengio,et al.  Why Does Unsupervised Pre-training Help Deep Learning? , 2010, AISTATS.

[21]  Forrest W. Nutter,et al.  Assessing the accuracy, intra-rater repeatability, and inter-rater reliability of disease assessment systems , 1993 .

[22]  Abeer Alsadoon,et al.  Deep Learning for Aspect-Based Sentiment Analysis: A Comparative Review , 2019, Expert Syst. Appl..

[23]  Klaus H. Maier-Hein,et al.  Exploiting the potential of unlabeled endoscopic video data with self-supervised learning , 2017, International Journal of Computer Assisted Radiology and Surgery.

[24]  S. Sankaran,et al.  Low-altitude, high-resolution aerial imaging systems for row and field crop phenotyping: A review , 2015 .

[25]  D. Fairbanks Development of genetic resistance to iron‐deficiency chlorosis in soybean , 2000 .

[26]  David W. Franzen,et al.  Soil factors affecting iron chlorosis of soybean in the red river valley of North Dakota and Minnesota , 2000 .

[27]  Baskar Ganapathysubramanian,et al.  An explainable deep machine vision framework for plant stress phenotyping , 2018, Proceedings of the National Academy of Sciences.

[28]  R. J. Goos,et al.  A comparison of three methods for reducing iron-deficiency chlorosis in soybean. , 2000 .

[29]  Wei Xiong,et al.  Stacked Convolutional Denoising Auto-Encoders for Feature Representation , 2017, IEEE Transactions on Cybernetics.

[30]  Binhua Tang,et al.  Recent Advances of Deep Learning in Bioinformatics and Computational Biology , 2019, Front. Genet..

[31]  I. C. Anderson,et al.  Genotypic Evaluation for Iron Deficiency Chlorosis in Soybeans by Visual Scores and Chlorophyll Concentration 1 , 1979 .

[32]  William D. Philpot,et al.  Toward the Discrimination of Manganese, Zinc, Copper, and Iron Deficiency in ‘Bragg’ Soybean Using Spectral Detection Methods , 2000 .

[33]  Yansheng Li,et al.  Unsupervised Spectral–Spatial Feature Learning With Stacked Sparse Autoencoder for Hyperspectral Imagery Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[34]  Jun Li,et al.  Structural damage identification based on autoencoder neural networks and deep learning , 2018, Engineering Structures.