Assessing Active Learning Strategies to Improve the Quality Control of the Soybean Seed Vigor

Seed companies increasingly seek excellence in production quality through rigorous processes, such as the tetrazolium test (TZ test) and the vigor definition. However, these are extremely laborious processes since it necessitates the experience of a specialist and the visual analysis of a considerable quantity of seeds as sampling for determining the vigor of the seed lot.Moreover, although the TZ test has a defined protocol, this analysis may vary from analyst to analyst because it is a subjective human process. In this context, several efforts have been carried out in an attempt to automate the analysis process, in order to reduce their intrinsic problems. Thus, this article presents approaches for the learning and classification of the soybean seed vigor. In addition, alternative active learning strategies are proposed to improve the selection of the most informative samples for the learning process. An extensive experimental evaluation is performed considering different datasets and state-of-the-art learning techniques. Based on the obtained results, it is possible to observe that active learning approaches lead to more robust classifiers, which reach higher accuracies faster (in less learning iterations) than traditional supervised learning approaches. We also obtained a reduction of $\text{95.22}{\%}$ of labeled samples used in the learning process.

[1]  Naif Alajlan,et al.  Large-Scale Image Classification Using Active Learning , 2014, IEEE Geoscience and Remote Sensing Letters.

[2]  Pedro Henrique Bugatti,et al.  Content-based image retrieval towards the automatic characterization of soybean seed vigor , 2014, SAC.

[3]  Pedro Jussieu de Rezende,et al.  Robust active learning for the diagnosis of parasites , 2015, Pattern Recognit..

[4]  Umaa Rebbapragada,et al.  Using Ensemble Decisions and Active Selection to Improve Low-Cost Labeling for Multi-View Data , 2011 .

[5]  Pedro Henrique Bugatti,et al.  An image analysis framework for effective classification of seed damages , 2016, SAC.

[6]  Xin Li,et al.  Adaptive Active Learning for Image Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Edward Y. Chang,et al.  Support vector machine active learning for image retrieval , 2001, MULTIMEDIA '01.

[8]  F. C. Krzyzanowski,et al.  The tetrazolium test for soybean seeds. , 1998 .

[9]  A. L. Filho,et al.  Physiological quality of dynamically cooled and stored soybean seeds , 2013 .

[10]  João Paulo Papa,et al.  Efficient supervised optimum-path forest classification for large datasets , 2012, Pattern Recognit..

[11]  Pedro Jussieu de Rezende,et al.  An active learning paradigm based on a priori data reduction and organization , 2014, Expert Syst. Appl..

[12]  Nikolaos Papanikolopoulos,et al.  Scalable Active Learning for Multiclass Image Classification , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  K. Fujimura,et al.  An automated system for vigor testing three-day-old soybean seedlings , 2003 .

[14]  Tao Jiang,et al.  Column subset selection for active learning in image classification , 2011, Neurocomputing.

[15]  Moacir Ponti,et al.  An incremental linear-time learning algorithm for the Optimum-Path Forest classifier , 2016, Inf. Process. Lett..

[16]  Alexandre X. Falcão,et al.  Choosing the Most Effective Pattern Classification Model under Learning-Time Constraint , 2015, PloS one.

[17]  Pedro H. Bugatti,et al.  Contributing to agriculture by using soybean seed data from the tetrazolium test , 2019, Data in brief.