Agricultural Greenhouses Detection in High-Resolution Satellite Images Based on Convolutional Neural Networks: Comparison of Faster R-CNN, YOLO v3 and SSD

Agricultural greenhouses (AGs) are an important facility for the development of modern agriculture. Accurately and effectively detecting AGs is a necessity for the strategic planning of modern agriculture. With the advent of deep learning algorithms, various convolutional neural network (CNN)-based models have been proposed for object detection with high spatial resolution images. In this paper, we conducted a comparative assessment of the three well-established CNN-based models, which are Faster R-CNN, You Look Only Once-v3 (YOLO v3), and Single Shot Multi-Box Detector (SSD) for detecting AGs. The transfer learning and fine-tuning approaches were implemented to train models. Accuracy and efficiency evaluation results show that YOLO v3 achieved the best performance according to the average precision (mAP), frames per second (FPS) metrics and visual inspection. The SSD demonstrated an advantage in detection speed with an FPS twice higher than Faster R-CNN, although their mAP is close on the test set. The trained models were also applied to two independent test sets, which proved that these models have a certain transability and the higher resolution images are significant for accuracy improvement. Our study suggests YOLO v3 with superiorities in both accuracy and computational efficiency can be applied to detect AGs using high-resolution satellite images operationally.

[1]  Wang Ke,et al.  Object-based classification approach for greenhouse mapping using Landsat-8 imagery , 2016 .

[2]  Ryosuke Shibasaki,et al.  A Method for Vehicle Detection in High-Resolution Satellite Images that Uses a Region-Based Object Detector and Unsupervised Domain Adaptation , 2020, Remote. Sens..

[3]  Pietro Picuno,et al.  Innovative Material and Improved Technical Design for a Sustainable Exploitation of Agricultural Plastic Film , 2014 .

[4]  Eyal Ben-Dor,et al.  Remote sensing as a tool for monitoring plasticulture in agricultural landscapes , 2007 .

[5]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[7]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[8]  Xin Pan,et al.  High-Resolution Remote Sensing Image Classification Method Based on Convolutional Neural Network and Restricted Conditional Random Field , 2018, Remote. Sens..

[9]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[10]  Weiwei Sun,et al.  R-CNN-Based Ship Detection from High Resolution Remote Sensing Imagery , 2019, Remote. Sens..

[11]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[12]  Junwei Han,et al.  Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Pei Xue-xia Study on Changes of Soil Salt and Nutrient in Greenhouse of Different Planting Years , 2007 .

[14]  Joseph Redmon,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[15]  Jamie Sherrah,et al.  Effective semantic pixel labelling with convolutional networks and Conditional Random Fields , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16]  Yang Chen,et al.  Road Extraction from High-Resolution Remote Sensing Imagery Using Refined Deep Residual Convolutional Neural Network , 2019, Remote. Sens..

[17]  Weihong Li,et al.  Learning a two-stage CNN model for multi-sized building detection in remote sensing images , 2018, Remote Sensing Letters.

[18]  Wei Guo,et al.  Geospatial Object Detection in High Resolution Satellite Images Based on Multi-Scale Convolutional Neural Network , 2018, Remote. Sens..

[19]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[20]  Hong Zhang,et al.  Automatic Ship Detection Based on RetinaNet Using Multi-Resolution Gaofen-3 Imagery , 2019, Remote. Sens..

[21]  Yang Chen,et al.  Extraction of Urban Water Bodies from High-Resolution Remote-Sensing Imagery Using Deep Learning , 2018 .

[22]  Professor Dr. Joe J. Hanan,et al.  Greenhouse Management , 1978, Advanced Series in Agricultural Sciences.

[23]  Pierre Alliez,et al.  Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[25]  Jon Atli Benediktsson,et al.  Very High-Resolution Remote Sensing: Challenges and Opportunities [Point of View] , 2012, Proc. IEEE.

[26]  Karlheinz Knickel,et al.  Changes in farming systems, landscape, and nature: key success factors of agri-environmental schemes (AES). , 2000 .

[27]  Bin Sheng,et al.  Deep Convolutional Neural Networks for Human Action Recognition Using Depth Maps and Postures , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[28]  Zhong Chen,et al.  End-to-End Airplane Detection Using Transfer Learning in Remote Sensing Images , 2018, Remote. Sens..

[29]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[30]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[31]  Gianpaolo Francesco Trotta,et al.  Computer vision and deep learning techniques for pedestrian detection and tracking: A survey , 2018, Neurocomputing.

[32]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[34]  Hang Li,et al.  Deep learning for natural language processing: advantages and challenges , 2018 .

[35]  Derek C. Rose,et al.  Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier] , 2010, IEEE Computational Intelligence Magazine.

[36]  Jugal K. Kalita,et al.  A Survey of the Usages of Deep Learning for Natural Language Processing , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Wei Lee Woon,et al.  Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks , 2017 .

[39]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Yongyang Xu,et al.  Road Extraction from High-Resolution Remote Sensing Imagery Using Deep Learning , 2018, Remote. Sens..

[41]  WangLizhe,et al.  Remote sensing big data computing , 2015 .

[42]  Yuhuan Ren,et al.  Detection of Collapsed Buildings in Post-Earthquake Remote Sensing Images Based on the Improved YOLOv3 , 2019, Remote. Sens..

[43]  Pietro Picuno,et al.  Analysis of plasticulture landscapes in Southern Italy through remote sensing and solid modelling techniques , 2011 .

[44]  Albert Y. Zomaya,et al.  Remote sensing big data computing: Challenges and opportunities , 2015, Future Gener. Comput. Syst..

[45]  Р Ю Чуйков,et al.  Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector , 2017 .