Computer vision approach to characterize size and shape phenotypes of horticultural crops using high-throughput imagery

For many horticultural crops, variation in quality (e.g., shape and size) contribute significantly to the crop’s market value. Metrics characterizing less subjective harvest quantities (e.g., yield and total biomass) are routinely monitored. In contrast, metrics quantifying more subjective crop quality characteristics such as ideal size and shape remain difficult to characterize objectively at the production-scale due to the lack of modular technologies for high-throughput sensing and computation. Several horticultural crops are sent to packing facilities after having been harvested, where they are sorted into boxes and containers using high-throughput scanners. These scanners capture images of each fruit or vegetable being sorted and packed, but the images are typically used solely for sorting purposes and promptly discarded. With further analysis, these images could offer unparalleled insight on how crop quality metrics vary at the industrial production-scale and provide further insight into how these characteristics translate to overall market value. At present, methods for extracting and quantifying quality characteristics of crops using images generated by existing industrial infrastructure have not been developed. Furthermore, prior studies that investigated horticultural crop quality metrics, specifically of size and shape, used a limited number of samples, did not incorporate deformed or non-marketable samples, and did not use images captured from high-throughput systems. In this work, using sweetpotato (SP) as a use case, we introduce a computer vision algorithm for quantifying shape and size characteristics in a high-throughput manner. This approach generates 3D model of SPs from two 2D images captured by an industrial sorter 90 degrees apart and extracts 3D shape features in a few hundred milliseconds. We applied the 3D reconstruction and feature extraction method to thousands of image samples to demonstrate how variations in shape features across sweetptoato cultivars can be quantified. We created a sweetpotato shape dataset containing sweetpotato images, extracted shape features, and qualitative shape types (U.S. No. 1 or Cull). We used this dataset to develop a neural network-based shape classifier that was able to predict Cull vs. U.S. No. 1 sweetpotato with 84.59% accuracy. In addition, using univariate Chi-squared tests and random forest, we identified the most important features for determining qualitative shape (U.S. No. 1 or Cull) of the sweetpotatoes. Our study serves as the first step towards enabling big data analytics for sweetpotato agriculture. The methodological framework is readily transferable to other horticultural crops, particularly those that are sorted using commercial imaging equipment.

[1]  Arthur Villordon,et al.  Direct Measurement of Sweetpotato Surface Area and Volume Using a Low-cost 3D Scanner for Identification of Shape Features Related to Processing Product Recovery , 2020, HortScience.

[2]  I. Dan,et al.  CLASSIFICATION OF STRAWBERRY FRUIT SHAPE BY MACHINE LEARNING , 2018, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[3]  Richard H. Moore Nasty Weather and Ugly Produce: Climate Change, Agricultural Adaptation, and Food Waste , 2017 .

[4]  I. D. Hooge,et al.  This apple is too ugly for me!: Consumer preferences for suboptimal food products in the supermarket and at home , 2017 .

[5]  Sapan Naik,et al.  Color, Size and Shape Feature Extraction Techniques for Fruits: A Technical Review , 2015 .

[6]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[7]  A. Kumar,et al.  Automatic Fruit Grading and Classification System Using Computer Vision: A Review , 2015, 2015 Second International Conference on Advances in Computing and Communication Engineering.

[8]  Samina Khalid,et al.  A survey of feature selection and feature extraction techniques in machine learning , 2014, 2014 Science and Information Conference.

[9]  Francisco Manzano-Agugliaro,et al.  An active contour computer algorithm for the classification of cucumbers , 2013 .

[10]  Syed Khaleel Ahmed,et al.  Classification of fruits using Probabilistic Neural Networks - Improvement using color features , 2011, TENCON 2011 - 2011 IEEE Region 10 Conference.

[11]  N. Pappa,et al.  GMC Design Based on SVM Model of Batch Reactor , 2010 .

[12]  K. S. Hareesh,et al.  Prospects of Computer Vision Automated Grading and Sorting Systems in Agricultural and Food Products for Quality Evaluation , 2010 .

[13]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[14]  Yong-Xianga Sun,et al.  Shape feature extraction of fruit image based on chain code , 2007, 2007 International Conference on Wavelet Analysis and Pattern Recognition.

[15]  Sing Kiong Nguang,et al.  Low cost sensor for volume and surface area computation of axi-symmetric agricultural products , 2007 .

[16]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[17]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[18]  A. Villordon,et al.  Digital Image Analysis of Sweetpotato Storage Roots in Herbicide Trials , 2002 .

[19]  M. K. Upadhyaya,et al.  Comparison of Three Nondestructive Methods for Determination of Vegetable Surface Area , 2002 .

[20]  B. Farkas,et al.  Physical and thermal properties of three sweetpotato cultivars (Ipomoea Batatas L.) , 2000 .

[21]  I. Paulus,et al.  Shape Characterization of New Apple Cultivars by Fourier Expansion of Digitized Images , 1999 .

[22]  P. Bosland,et al.  Relationships between postharvest water loss and physical properties of pepper fruit (Capsicum annuum L.) , 1993 .

[23]  Stephen W. Searcy,et al.  Estimation of tip shape for carrot classification by machine vision , 1992 .

[24]  Stephen W. Searcy,et al.  Fresh market carrot inspection by machine vision , 1991, Other Conferences.

[25]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[26]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[27]  Sue Nokes,et al.  The American Society of Agricultural and Biological Engineers , 2020, Scientia.

[28]  Michael D. Boyette,et al.  Evaluating the Shape and Size Characteristics of Sweetpotatoes Using Digital Image Analysis , 2017 .

[29]  Margarita Ruiz-Altisent,et al.  Shape determination of horticultural produce using two-dimensional computer vision – A review , 2012 .

[30]  L. Ladha,et al.  FEATURE SELECTION METHODS AND ALGORITHMS , 2011 .

[31]  Sri Ramakrishna,et al.  FEATURE SELECTION METHODS AND ALGORITHMS , 2011 .

[32]  G. NarendraV PROSPECTS OF COMPUTER VISION AUTOMATED GRADING AND SORTING SYSTEMS IN AGRICULTURAL AND FOOD PRODUCTS FOR QUALITY EVALUATION , 2010 .

[33]  Liu Gang,et al.  Recognition and shape features extraction of apples based on machine vision. , 2009 .

[34]  Zhang Na,et al.  Fractionation of soybean proteins based on technique of Ca2+ induced-precipitation. , 2009 .

[35]  Naoshi Kondo,et al.  Classification of Shape of Bell Pepper by Machine Vision System , 2006 .

[36]  F. E. Sistler,et al.  The Size and Shape of Typical Sweet Potatoes , 1986 .