Testing the Suitability of Automated Machine Learning for Weeds Identification

In the past years, several machine-learning-based techniques have arisen for providing effective crop protection. For instance, deep neural networks have been used to identify different types of weeds under different real-world conditions. However, these techniques usually require extensive involvement of experts working iteratively in the development of the most suitable machine learning system. To support this task and save resources, a new technique called Automated Machine Learning has started being studied. In this work, a complete open-source Automated Machine Learning system was evaluated with two different datasets, (i) The Early Crop Weeds dataset and (ii) the Plant Seedlings dataset, covering the weeds identification problem. Different configurations, such as the use of plant segmentation, the use of classifier ensembles instead of Softmax and training with noisy data, have been compared. The results showed promising performances of 93.8% and 90.74% F1 score depending on the dataset used. These performances were aligned with other related works in AutoML, but they are far from machine-learning-based systems manually fine-tuned by human experts. From these results, it can be concluded that finding a balance between manual expert work and Automated Machine Learning will be an interesting path to work in order to increase the efficiency in plant protection.

[1]  Marcel Salathé,et al.  Using Deep Learning for Image-Based Plant Disease Detection , 2016, Front. Plant Sci..

[2]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[3]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, ICANN.

[4]  Lars Kotthoff,et al.  Auto-WEKA 2.0: Automatic model selection and hyperparameter optimization in WEKA , 2017, J. Mach. Learn. Res..

[5]  Nikos Mylonas,et al.  Towards weeds identification assistance through transfer learning , 2020, Comput. Electron. Agric..

[6]  Darko Stefanovic,et al.  Deep Neural Networks Based Recognition of Plant Diseases by Leaf Image Classification , 2016, Comput. Intell. Neurosci..

[7]  Spyros Fountas,et al.  Improving weeds identification with a repository of agricultural pre-trained deep neural networks , 2020, Comput. Electron. Agric..

[8]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[9]  Aaron Klein,et al.  Efficient and Robust Automated Machine Learning , 2015, NIPS.

[10]  Brian C. Ross Mutual Information between Discrete and Continuous Data Sets , 2014, PloS one.

[11]  Nathan Halko,et al.  Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions , 2009, SIAM Rev..

[12]  Henrik Skov Midtiby,et al.  Plant species classification using deep convolutional neural network , 2016 .

[13]  Spyros Fountas,et al.  Big Data for weed control and crop protection , 2017 .

[14]  O. Mutanga,et al.  Automated classification of a tropical landscape infested by Parthenium weed (Parthenium hyterophorus) , 2020 .

[15]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[16]  Masayuki Hayashi,et al.  Automated machine learning for identification of pest aphid species (Hemiptera: Aphididae) , 2019, Applied Entomology and Zoology.

[17]  Christopher M. Bishop,et al.  Mixtures of Probabilistic Principal Component Analyzers , 1999, Neural Computation.

[18]  Bo Fu,et al.  A salt and pepper noise image denoising method based on the generative classification , 2018, Multimedia Tools and Applications.

[19]  Gonzalo Pajares,et al.  Fleets of robots for environmentally-safe pest control in agriculture , 2017, Precision Agriculture.

[20]  Jayme Garcia Arnal Barbedo,et al.  Factors influencing the use of deep learning for plant disease recognition , 2018, Biosystems Engineering.

[21]  Rasmus Nyholm Jørgensen,et al.  A Public Image Database for Benchmark of Plant Seedling Classification Algorithms , 2017, ArXiv.

[22]  German Spangenberg,et al.  Automated Machine Learning for High-Throughput Image-Based Plant Phenotyping , 2020, bioRxiv.

[23]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[24]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[25]  F. López-Granados,et al.  Is the current state of the art of weed monitoring suitable for site-specific weed management in arable crops? , 2018 .

[26]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[27]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[28]  Joris IJsselmuiden,et al.  Transfer learning for the classification of sugar beet and volunteer potato under field conditions , 2018, Biosystems Engineering.

[29]  Daniele Nardi,et al.  Fast and Accurate Crop and Weed Identification with Summarized Train Sets for Precision Agriculture , 2016, IAS.

[30]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, International Conference on Artificial Neural Networks.

[31]  Mostafa Rahimi Azghadi,et al.  DeepWeeds: A Multiclass Weed Species Image Dataset for Deep Learning , 2018, Scientific Reports.