TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios
暂无分享,去创建一个
Qi Zhao | Shuchang Lyu | Xingkui Zhu | Xu Wang | Shuchang Lyu | Qi Zhao | Xu Wang | Xingkui Zhu
[1] Mohsen Guizani,et al. Multiple Moving Targets Surveillance Based on a Cooperative Network for Multi-UAV , 2018, IEEE Communications Magazine.
[2] Yunhong Wang,et al. Receptive Field Block Net for Accurate and Fast Object Detection , 2017, ECCV.
[3] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] In-So Kweon,et al. CBAM: Convolutional Block Attention Module , 2018, ECCV.
[5] Roman Solovyev,et al. Weighted boxes fusion: Ensembling boxes from different object detection models , 2021, Image Vis. Comput..
[6] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[7] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Seong Joon Oh,et al. CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[9] Kaiming He,et al. Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[10] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.
[11] Zeming Li,et al. YOLOX: Exceeding YOLO Series in 2021 , 2021, ArXiv.
[12] Hong-Yuan Mark Liao,et al. YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.
[13] Songtao Liu,et al. Learning Spatial Fusion for Single-Shot Object Detection , 2019, ArXiv.
[14] Chien-Yao Wang,et al. Scaled-YOLOv4: Scaling Cross Stage Partial Network , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[15] Quoc V. Le,et al. EfficientDet: Scalable and Efficient Object Detection , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[16] Jun-Wei Hsieh,et al. CSPNet: A New Backbone that can Enhance Learning Capability of CNN , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[17] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.
[18] Xin Sun,et al. RRNet: A Hybrid Detector for Object Detection in Drone-Captured Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).
[19] Xiangyu Zhang,et al. DetNet: A Backbone network for Object Detection , 2018, ArXiv.
[20] Nuno Vasconcelos,et al. Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[21] Quoc V. Le,et al. NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[22] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Iasonas Kokkinos,et al. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[24] Xingyi Zhou,et al. Objects as Points , 2019, ArXiv.
[25] Shifeng Zhang,et al. Single-Shot Refinement Neural Network for Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[26] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[27] Qinghua Hu,et al. Vision Meets Drones: A Challenge , 2018, ArXiv.
[28] Shu Liu,et al. Path Aggregation Network for Instance Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[29] Xiangyu Zhang,et al. Light-Head R-CNN: In Defense of Two-Stage Object Detector , 2017, ArXiv.
[30] Deren Li,et al. An Accurate Matching Method for Projecting Vector Data into Surveillance Video to Monitor and Protect Cultivated Land , 2020, ISPRS Int. J. Geo Inf..
[31] Devis Tuia,et al. Detecting Mammals in UAV Images: Best Practices to address a substantially Imbalanced Dataset with Deep Learning , 2018, Remote Sensing of Environment.
[32] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[33] Stephen Lin,et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[34] Hei Law,et al. CornerNet: Detecting Objects as Paired Keypoints , 2018, ECCV.
[35] Bertrand Le Saux,et al. Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.
[36] Bin Li,et al. Deformable DETR: Deformable Transformers for End-to-End Object Detection , 2020, ICLR.
[37] Vladlen Koltun,et al. Probabilistic two-stage detection , 2021, ArXiv.
[38] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.
[39] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[40] Georg Heigold,et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2021, ICLR.
[41] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[42] Jennifer N. Hird,et al. Use of Unmanned Aerial Vehicles for Monitoring Recovery of Forest Vegetation on Petroleum Well Sites , 2017, Remote. Sens..
[43] Michele Volpi,et al. Fast animal detection in UAV images using convolutional neural networks , 2017, 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).
[44] Hongyi Zhang,et al. mixup: Beyond Empirical Risk Minimization , 2017, ICLR.
[45] Stephen Lin,et al. RepPoints: Point Set Representation for Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[46] Baijian Yang,et al. PENet: Object Detection using Points Estimation in Aerial Images , 2020, ArXiv.
[47] Ying Wang,et al. VarifocalNet: An IoU-aware Dense Object Detector , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[48] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[49] Ying Chen,et al. M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network , 2018, AAAI.
[50] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[51] Hao Chen,et al. FCOS: Fully Convolutional One-Stage Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[52] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.
[53] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.
[54] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.
[55] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[56] Surya Nepal,et al. Daedalus: Breaking Non-Maximum Suppression in Object Detection via Adversarial Examples , 2019, ArXiv.
[57] Ross B. Girshick,et al. Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.