Object detection is a challenging computer vision task with numerous real-world applications. In recent years, the concept of the object relationship model has become helpful for object detection and has been verified and realized in deep learning. Nonetheless, most approaches to modeling object relations are limited to using the anchor-based algorithms; they cannot be directly migrated to the anchor-free frameworks. The reason is that the anchor-free algorithms are used to eliminate the complex design of anchors and predict heatmaps to represent the locations of keypoints of different object categories, without considering the relationship between keypoints. Therefore, to better fuse the information between the heatmap channels, it is important to model the visual relationship between keypoints. In this paper, we present a knowledge-driven network (KDNet)—a new architecture that can aggregate and model keypoint relations to augment object features for detection. Specifically, it processes a set of keypoints simultaneously through interactions between their local and geometric features, thereby allowing the modeling of their relationship. Finally, the updated heatmaps were used to obtain the corners of the objects and determine their positions. The experimental results conducted on the RIDER dataset confirm the effectiveness of the proposed KDNet, which significantly outperformed other state-of-the-art object detection methods.
[1]
Andrew Zisserman,et al.
Very Deep Convolutional Networks for Large-Scale Image Recognition
,
2014,
ICLR.
[2]
Risheng Huang,et al.
RepGN: Object Detection with Relational Proposal Graph Network
,
2019,
ArXiv.
[3]
Yun Teng,et al.
CornerNet-Lite: Efficient Keypoint based Object Detection
,
2019,
BMVC.
[4]
Kaiming He,et al.
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
,
2015,
IEEE Transactions on Pattern Analysis and Machine Intelligence.
[5]
Junchi Yan,et al.
R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object
,
2019,
AAAI.
[6]
Xingyi Zhou,et al.
Objects as Points
,
2019,
ArXiv.
[7]
Forrest N. Iandola,et al.
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size
,
2016,
ArXiv.
[8]
Ali Farhadi,et al.
YOLOv3: An Incremental Improvement
,
2018,
ArXiv.