A mango picking vision algorithm on instance segmentation and key point detection from RGB images in an open orchard

The vision system of a fruit picking robot must perform two difficult tasks, the accurate pixel-wise instance segmentation of fruit and the correct detection of picking points. Due to the unsatisfactory performance of the vision system on these tasks, the picking performance of the robot is insufficient and its application is not prevalent. In this study, performing fruit instance segmentation and the localization of picking points, a vision algorithm using RGB images was designed in a deep learning framework for a potential visual system of mango picking robots. The algorithm not only performed these two tasks successfully, but also integrated them into an end-to-end network, in which parallel branches performed the two tasks simultaneously. The average precision (at IoU of 0.75) and the average recall of instance segmentation reached 0.947 and 0.929 respectively, and the best precision and recall of picking-point detection reached 0.984 and 0.908 respectively. In addition, the tasks face various illumination and background interference in outdoor orchards, along with complex problems in terms of occlusion, overlap and object scale. In this study, the performance of the vision system was analysed in detail on several datasets and subsets of the various complex conditions, and the major factors that affect the performance were discussed. The results demonstrated that this system was robust against various illuminations and complex backgrounds, and yielded satisfactory segmentation and picking-point detection performances for minor and medium occlusion or overlap, and for medium and large mangoes. The model visualization and the influence analysis of model training demonstrated the training process and modelling effect of the deep learning network.

[1]  Volkan Isler,et al.  Registering Reconstructions of the Two Sides of Fruit Tree Rows , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  Chao Chen,et al.  Fruit Detection and Segmentation for Apple Harvesting Using Visual Sensor in Orchards , 2019, Sensors.

[3]  Simon Parsons,et al.  Towards Automated Strawberry Harvesting: Identifying the Picking Point , 2017, TAROS.

[4]  Xiong Chen,et al.  A vision methodology for harvesting robot to detect cutting points on peduncles of double overlapping grape clusters in a vineyard , 2018, Comput. Ind..

[5]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Makgafele Lucia Ntsoane,et al.  Quality assesment and postharvest technology of mango: A review of its current status and future perspectives , 2019, Scientia Horticulturae.

[8]  Volkan Isler,et al.  Apple Counting using Convolutional Neural Networks , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[10]  Alexander Wendel,et al.  Ground based hyperspectral imaging for extensive mango yield estimation , 2019, Comput. Electron. Agric..

[11]  Bruce MacDonald,et al.  Robotic kiwifruit harvesting using machine vision, convolutional neural networks, and robotic arms , 2019, Biosystems Engineering.

[12]  Yi Li,et al.  Fully Convolutional Instance-Aware Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Yael Edan,et al.  Computer vision for fruit harvesting robots - state of the art and challenges ahead , 2012, Int. J. Comput. Vis. Robotics.

[14]  Tristan Perez,et al.  DeepFruits: A Fruit Detection System Using Deep Neural Networks , 2016, Sensors.

[15]  R. E. Litz The Mango: Botany, Production and Uses , 1997 .

[16]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[17]  K. Walsh,et al.  Deep learning for real-time fruit detection and orchard fruit load estimation: benchmarking of ‘MangoYOLO’ , 2019, Precision Agriculture.

[18]  Vijay Kumar,et al.  Counting Apples and Oranges With Deep Learning: A Data-Driven Approach , 2017, IEEE Robotics and Automation Letters.

[19]  Volkan Isler,et al.  Vision-Based Apple Counting and Yield Estimation , 2016, ISER.

[20]  Dennis Jarvis,et al.  Estimation of mango crop yield using image analysis - Segmentation method , 2013 .

[21]  James Patrick Underwood,et al.  Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards , 2016, J. Field Robotics.

[22]  Maryam Rahnemoonfar,et al.  Deep Count: Fruit Counting Based on Deep Simulated Learning , 2017, Sensors.

[23]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[24]  Q. Zhang,et al.  Sensors and systems for fruit detection and localization: A review , 2015, Comput. Electron. Agric..

[25]  Hedvig Kjellstrom,et al.  Determinantal Point Processes for Mini-Batch Diversification , 2017, UAI 2017.

[26]  Yuanshen Zhao,et al.  A review of key techniques of vision-based control for harvesting robot , 2016, Comput. Electron. Agric..

[27]  Dahua Lin,et al.  Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition , 2018, AAAI.

[28]  Avadesh Meduri,et al.  MangoNet: A deep semantic segmentation architecture for a method to detect and count mangoes in an open orchard , 2019, Eng. Appl. Artif. Intell..

[29]  Volkan Isler,et al.  Semantic mapping for orchard environments by merging two‐sides reconstructions of tree rows , 2018, J. Field Robotics.

[30]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Xiangjun Zou,et al.  Vision-based extraction of spatial information in grape clusters for harvesting robots , 2016 .

[32]  Andreas Kamilaris,et al.  Deep learning in agriculture: A survey , 2018, Comput. Electron. Agric..

[33]  Zhen Liu,et al.  Green Grape Detection and Picking-Point Calculation in a Night-Time Natural Environment Using a Charge-Coupled Device (CCD) Vision Sensor with Artificial Illumination , 2018, Sensors.

[34]  Lars Grimstad,et al.  An autonomous strawberry‐harvesting robot: Design, development, integration, and field evaluation , 2019, J. Field Robotics.

[35]  Weidong Hu,et al.  Diversity in Machine Learning , 2018, IEEE Access.

[36]  Min Ye,et al.  Fault-Tolerant Design of a Limited Universal Fruit-Picking End-Effector Based on Vision-Positioning Error , 2016 .

[37]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[38]  James Patrick Underwood,et al.  Deep fruit detection in orchards , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[39]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[40]  Zhen Liu,et al.  The recognition of litchi clusters and the calculation of picking point in a nocturnal natural environment , 2018 .

[41]  Vijay Kumar,et al.  Robust Fruit Counting: Combining Deep Learning, Tracking, and Structure from Motion , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[42]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  EdanYael,et al.  Harvesting Robots for High-value Crops , 2014 .

[44]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Changki Mo,et al.  Design, integration, and field evaluation of a robotic apple harvester , 2017, J. Field Robotics.

[46]  En Li,et al.  Apple detection during different growth stages in orchards using the improved YOLO-V3 model , 2019, Comput. Electron. Agric..

[47]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[48]  Volkan Isler,et al.  A comparative study of fruit detection and counting methods for yield mapping in apple orchards , 2018, J. Field Robotics.

[49]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[50]  Chris McCarthy,et al.  Deep learning - Method overview and review of use for fruit detection and yield estimation , 2019, Comput. Electron. Agric..

[51]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[52]  Zhen Liu,et al.  Visual positioning technology of picking robots for dynamic litchi clusters with disturbance , 2018, Comput. Electron. Agric..