论文信息 - Multi-task Enhanced Dam Crack Image Detection Based on Faster R-CNN

Multi-task Enhanced Dam Crack Image Detection Based on Faster R-CNN

To improve the detection accuracy for multiple small targets with Raster R-CNN model, we propose a Multitask Enhanced dam crack image detection method based on Faster R-CNN (ME-Faster R-CNN) to adapt the detection of dam cracks in different lighting environments and lengths. To solve the problem of insufficient samples of dam cracks, transfer learning methods are utilized to assist network training and data enhancement. In the ME-Faster R-CNN, ResNet-50 network is firstly adopted to extract features of original images and obtain the feature map. Then, the features map is input into multi-task enhanced RPN module to generate the candidate regions through adopting the appropriate size and dimension of anchor box. At last, the features map and candidate regions are processed to detect the dam cracks. Experimental results demonstrate that ME Faster R-CNN with transfer learning can obtain 82.52% average IoU and 80.08% average precision mAP, respectively. Compared with Faster R-CNN detection method with the same parameters, the average IoU and mAP can increase 1.06% and 1.56%, respectively.

Yingchi Mao | Jing Wang | Longbao Wang | Jianghong Tang

[1] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[3] Walter Kellermann,et al. Efficient target activity detection based on recurrent neural networks , 2017, 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA).

[4] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[7] Luc Van Gool,et al. Efficient Non-Maximum Suppression , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[8] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[9] Yang Wang,et al. Optimizing Intersection-Over-Union in Deep Neural Networks for Image Segmentation , 2016, ISVC.

[10] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Toby P. Breckon,et al. Using Deep Convolutional Neural Network Architectures for Object Classification and Detection Within X-Ray Baggage Security Imagery , 2018, IEEE Transactions on Information Forensics and Security.