CLASSIFICATION OF STRAWBERRY FRUIT SHAPE BY MACHINE LEARNING

Abstract. Shape is one of the most important traits of agricultural products due to its relationships with the quality, quantity, and value of the products. For strawberries, the nine types of fruit shape were defined and classified by humans based on the sampler patterns of the nine types. In this study, we tested the classification of strawberry shapes by machine learning in order to increase the accuracy of the classification, and we introduce the concept of computerization into this field. Four types of descriptors were extracted from the digital images of strawberries: (1) the Measured Values (MVs) including the length of the contour line, the area, the fruit length and width, and the fruit width/length ratio; (2) the Ellipse Similarity Index (ESI); (3) Elliptic Fourier Descriptors (EFDs), and (4) Chain Code Subtraction (CCS). We used these descriptors for the classification test along with the random forest approach, and eight of the nine shape types were classified with combinations of MVs + CCS + EFDs. CCS is a descriptor that adds human knowledge to the chain codes, and it showed higher robustness in classification than the other descriptors. Our results suggest machine learning's high ability to classify fruit shapes accurately. We will attempt to increase the classification accuracy and apply the machine learning methods to other plant species.

[1]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[2]  Herbert Freeman,et al.  Computer Processing of Line-Drawing Images , 1974, CSUR.

[3]  Charles R. Giardina,et al.  Elliptic Fourier features of a closed contour , 1982, Comput. Graph. Image Process..

[4]  F. Rohlf,et al.  A COMPARISON OF FOURIER METHODS FOR THE DESCRIPTION OF WING SHAPE IN MOSQUITOES (DIPTERA: CULICIDAE) , 1984 .

[5]  The workloads of farmers who sort and pack strawberries in accordance with standards of shipment and their awareness of standards of shipment. , 1989 .

[6]  Seishi Ninomiya,et al.  Quantitative Evaluation of Soybean (Glycine max L. Merr.) Leaflet Shape by Principal Component Scores Based on Elliptic Fourier Descriptor , 1995 .

[7]  Qixin Cao,et al.  Studies on Automatic Sorting System for Strawberry (Part 2) , 1996 .

[8]  Masafumi Mitarai,et al.  Study on Grade Judgment of Fruit Vegetables Using Machine Vision (Part 2). Judgment for Several Varieties of Strawberry by Developed Software. , 1996 .

[9]  Y. Hashimoto,et al.  Pattern recognition of fruit shape based on the concept of chaos and neural networks , 2000 .

[10]  Application of Coefficient of Reliability to the Studies of Physical Therapy. , 2002 .

[11]  H. Iwata,et al.  SHAPE: a computer program package for quantitative evaluation of biological shapes based on elliptic Fourier descriptors. , 2002, The Journal of heredity.

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  樹 今井,et al.  理学療法研究における“評価の信頼性”の検査法 , 2004 .

[14]  Claire Anderson,et al.  Tomato Fruit Shape Analysis Using Morphometric and Morphology Attributes Implemented in Tomato Analyzer Software Program , 2009 .

[15]  Jacques Wainer,et al.  Automatic fruit and vegetable classification from images , 2010 .

[16]  Guido Sanguinetti,et al.  LEAFPROCESSOR: a new leaf phenotyping tool using contour bending energy and shape cluster analysis. , 2010, The New phytologist.

[17]  R. N. Shebiah,et al.  Fruit Recognition using Color and Texture Features , 2010 .

[18]  Xu Liming,et al.  Automated strawberry grading system based on image processing , 2010 .

[19]  Mahmoud Omid,et al.  Development of a lemon sorting system based on color and size. , 2010 .

[20]  Yudong Zhang,et al.  Classification of Fruits Using Computer Vision and a Multiclass Support Vector Machine , 2012, Sensors.

[21]  M. Yano,et al.  SmartGrain: High-Throughput Phenotyping Software for Measuring Seed Shape through Image Analysis1[C][W][OA] , 2012, Plant Physiology.

[22]  M. Sorrells,et al.  Comparison of digital image analysis using elliptic Fourier descriptors and major dimensions to phenotype seed shape in hexaploid wheat (Triticum aestivum L.) , 2012, Euphytica.

[23]  Aboul Ella Hassenian,et al.  Automatic Fruit Image Recognition System Based on Shape and Color Features , 2014, AMLTA.

[24]  Yoshua Bengio,et al.  Marathi Handwritten Numeral Recognition using Fourier Descriptors and Normalized Chain Code , 2017 .

[25]  Upasana Singh,et al.  Automatic Recognition of Medicinal Plants using Machine Learning Techniques , 2017 .

[26]  K. Oku,et al.  Development and characterization of a strawberry MAGIC population derived from crosses with six strawberry cultivars , 2017, Breeding science.