Passion fruit detection and counting based on multiple scale faster R-CNN using RGB-D images

The accurate and reliable fruit detection in orchards is one of the most crucial tasks for supporting higher level agriculture tasks such as yield mapping and robotic harvesting. However, detecting and counting small fruit is a very challenging task under variable lighting conditions, low-resolutions and heavy occlusion by neighboring fruits or foliage. To robustly detect small fruits, an improved method is proposed based on multiple scale faster region-based convolutional neural networks (MS-FRCNN) approach using the color and depth images acquired with an RGB-D camera. The architecture of MS-FRCNN is improved to detect lower-level features by incorporating feature maps from shallower convolution feature maps for regions of interest (ROI) pooling. The detection framework consists of three phases. Firstly, multiple scale feature extractors are used to extract low and high features from RGB and depth images respectively. Then, RGB-detector and depth-detector are trained separately using MS-FRCNN. Finally, late fusion methods are explored for combining the RGB and depth detector. The detection framework was demonstrated and evaluated on two datasets that include passion fruit images under variable illumination conditions and occlusion. Compared with the faster R-CNN detector of RGB-D images, the recall, the precision and F1-score of MS-FRCNN method increased from 0.922 to 0.962, 0.850 to 0.931 and 0.885 to 0.946, respectively. Furthermore, the MS-FRCNN method effectively improves small passion fruit detection by achieving 0.909 of the F1 score. It is concluded that the detector based on MS-FRCNN can be applied practically in the actual orchard environment.

[1]  Verónica Vilaplana,et al.  Multi-modal deep learning for Fuji apple detection using RGB-D cameras and their radiometric capabilities , 2019, Comput. Electron. Agric..

[2]  Won Suk Lee,et al.  Immature citrus fruit detection based on local binary pattern feature and hierarchical contour analysis , 2018, Biosystems Engineering.

[3]  James Patrick Underwood,et al.  Deep fruit detection in orchards , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[4]  C. Glasbey,et al.  Automatic fruit recognition and counting from multiple images , 2014 .

[5]  Suraj Amatya,et al.  Apple crop load estimation with over-the-row machine vision system , 2014 .

[6]  Nuno Vasconcelos,et al.  Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Jidong Lv,et al.  Recognition method for apple fruit based on SUSAN and PCNN , 2018, Multimedia Tools and Applications.

[8]  S. Singh,et al.  Physiological and quality changes during postharvest ripening of purple passion fruit ( Passiflora edulis Sims ) , 2014 .

[9]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[10]  Marios Savvides,et al.  Multiple Scale Faster-RCNN Approach to Driver’s Cell-Phone Usage and Hands on Steering Wheel Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[11]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[12]  K. Walsh,et al.  Deep learning for real-time fruit detection and orchard fruit load estimation: benchmarking of ‘MangoYOLO’ , 2019, Precision Agriculture.

[13]  Chris McCarthy,et al.  Deep learning - Method overview and review of use for fruit detection and yield estimation , 2019, Comput. Electron. Agric..

[14]  Billy V. Koen,et al.  Definition of the engineering method , 1985 .

[15]  Yael Edan,et al.  Adaptive thresholding with fusion using a RGBD sensor for red sweet-pepper detection , 2016 .

[16]  Won Suk Lee,et al.  Immature green citrus fruit detection and counting based on fast normalized cross correlation (FNCC) using natural outdoor colour images , 2016, Precision Agriculture.

[17]  Lin Yang,et al.  Evaluating and Improving the Depth Accuracy of Kinect for Windows v2 , 2015, IEEE Sensors Journal.

[18]  Avadesh Meduri,et al.  MangoNet: A deep semantic segmentation architecture for a method to detect and count mangoes in an open orchard , 2019, Eng. Appl. Artif. Intell..

[19]  Dan Zecha,et al.  A closer look: Small object detection in faster R-CNN , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[20]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Kyunghyun Cho,et al.  Augmentation for small object detection , 2019, 9th International Conference on Advances in Computing and Information Technology (ACITY 2019).

[22]  Volkan Isler,et al.  A comparative study of fruit detection and counting methods for yield mapping in apple orchards , 2018, J. Field Robotics.

[23]  D. W. Lamb,et al.  Risk mapping of redheaded cockchafer (Adoryphorus couloni) (Burmeister) infestations using a combination of novel k-means clustering and on-the-go plant and soil sensing technologies , 2015, Precision Agriculture.

[24]  Yueju Xue,et al.  Detection of passion fruits and maturity classification using Red-Green-Blue Depth images , 2018, Biosystems Engineering.

[25]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[26]  James Patrick Underwood,et al.  Image Based Mango Fruit Detection, Localisation and Yield Estimation Using Multiple View Geometry , 2016, Sensors.

[27]  Md Zahangir Alom,et al.  Recurrent residual U-Net for medical image segmentation , 2019, Journal of medical imaging.

[28]  Vijay Kumar,et al.  Counting Apples and Oranges With Deep Learning: A Data-Driven Approach , 2017, IEEE Robotics and Automation Letters.

[29]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[30]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[32]  Won Suk Lee,et al.  Immature green citrus fruit detection using color and thermal images , 2018, Comput. Electron. Agric..

[33]  Tristan Perez,et al.  DeepFruits: A Fruit Detection System Using Deep Neural Networks , 2016, Sensors.

[34]  Patrice Y. Simard,et al.  Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[35]  Rainer Lienhart,et al.  Scalable logo recognition in real-world images , 2011, ICMR.

[36]  Q. Zhang,et al.  Sensors and systems for fruit detection and localization: A review , 2015, Comput. Electron. Agric..

[37]  Shengen Yan,et al.  Deep Image: Scaling up Image Recognition , 2015, ArXiv.