论文信息 - Content-aware image resizing for faster object detection on aerial imagery

Content-aware image resizing for faster object detection on aerial imagery

To combat poaching or perform game counts nature conservationists need to inspect areas that are very large and hard to reach by car or foot. Recently, nature conservationists have been able to inspect these areas more easily by using UAVs equipped with cameras. Despite of the ease with which these systems can be deployed, the recorded imagery still needs to be analyzed manually. Automatic object detection algorithms could greatly reduce the time spent looking for the object of interest, benefiting the conservation work. State-of-the-art object detection algorithms such as R-CNN [1] rely heavily on object proposals that are used to provide a speedup in the object detection pipeline. Object proposal methods such as Selective Search [2] or Edge Boxes [3] have proven to work well on popular datasets such as PASCAL VOC [4] and ImageNet [5]. However, the resolution of the images in these datasets are much lower than the image resolution needed for nature conservation tasks. Object proposal methods are significantly slower when applied to high resolution images, which affects the detection rate. To maintain a good object detection rate, we apply a content-aware image resizing method that resizes the image without compromising on the content. Figure 1 shows the original image and a version that was reduced to 25% of its original size. Even though the majority of the pixels have been removed, the objects (cows) are still clearly visible. Figure 2 compares our method to regular resizing by comparing the recall at different image sizes. The figure shows that as we reduce the image more, the recall drops much faster when using regular resizing than when using our contentaware image resizing method. We show that we can use our content-aware image resizing to speed up object detection on aerial imagery without significantly impacting detection performance.

A. Visser | J. van Gemert | J. V. Gemert | A. Visser

[1] C. Lawrence Zitnick,et al. Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[2] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[3] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[4] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[5] Jianguo Zhang,et al. The PASCAL Visual Object Classes Challenge , 2006 .

[6] Luc Van Gool,et al. The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.