A Comparative Analysis of Object Detection Metrics with a Companion Open-Source Toolkit

Recent outstanding results of supervised object detection in competitions and challenges are often associated with specific metrics and datasets. The evaluation of such methods applied in different contexts have increased the demand for annotated datasets. Annotation tools represent the location and size of objects in distinct formats, leading to a lack of consensus on the representation. Such a scenario often complicates the comparison of object detection methods. This work alleviates this problem along the following lines: (i) It provides an overview of the most relevant evaluation methods used in object detection competitions, highlighting their peculiarities, differences, and advantages; (ii) it examines the most used annotation formats, showing how different implementations may influence the assessment results; and (iii) it provides a novel open-source toolkit supporting different annotation formats and 15 performance metrics, making it easy for researchers to evaluate the performance of their detection algorithms in most known datasets. In addition, this work proposes a new metric, also included in the toolkit, for evaluating object detection in videos that is based on the spatio-temporal overlap between the ground-truth and detected bounding boxes.

[1]  Bernt Schiele,et al.  What Makes for Effective Detection Proposals? , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[3]  Wujie Zhou,et al.  Hybrid-Attention Network for RGB-D Salient Object Detection , 2020, Applied Sciences.

[4]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[5]  Arnold W. M. Smeulders,et al.  Interaction in the segmentation of medical images: A survey , 2001, Medical Image Anal..

[6]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[7]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[8]  Harald Ganster,et al.  Automated Melanoma Recognition , 2001, IEEE Trans. Medical Imaging.

[9]  Zehang Sun,et al.  On-road vehicle detection: a review , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Xiaogang Wang,et al.  Intelligent multi-camera video surveillance: A review , 2013, Pattern Recognit. Lett..

[11]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[12]  Guowu Yang,et al.  Breast Tumor Detection in Ultrasound Images Using Deep Learning , 2017, Patch-MI@MICCAI.

[13]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Anthony Hoogs,et al.  Industry and Object Recognition: Applications, Applied Research and Challenges , 2006, Toward Category-Level Object Recognition.

[15]  Dongil Han,et al.  A Deep Learning-Based Hybrid Framework for Object Detection and Recognition in Autonomous Driving , 2020, IEEE Access.

[16]  Dong-Chul Park,et al.  Object Detection Using Scale Invariant Feature Transform , 2013, ICGEC.

[17]  Dawid Połap,et al.  An adaptive genetic algorithm as a supporting mechanism for microscopy image analysis in a cascade of convolution neural networks , 2020, Appl. Soft Comput..

[18]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[19]  Sergio A. Velastin,et al.  A Review of Computer Vision Techniques for the Analysis of Urban Traffic , 2011, IEEE Transactions on Intelligent Transportation Systems.

[20]  Thomas O. Binford,et al.  The Recovery of Three-Dimensional Structure from Image Curves , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Xindong Wu,et al.  Object Detection With Deep Learning: A Review , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Shanq-Jang Ruan,et al.  Illumination-Sensitive Background Modeling Approach for Accurate Moving Object Detection , 2011, IEEE Transactions on Broadcasting.

[23]  J. Dudczyk,et al.  Identification of emitter sources in the aspect of their fractal features , 2013 .

[24]  Roberto E. Gonz'alez,et al.  Galaxy detection and identification using deep learning and data augmentation , 2018, Astron. Comput..

[25]  Lukasz Rybak,et al.  A Geometrical Divide of Data Particle in Gravitational Classification of Moons and Circles Data Sets , 2020, Entropy.

[26]  Jieping Ye,et al.  Object Detection in 20 Years: A Survey , 2019, Proceedings of the IEEE.

[27]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[28]  Huchuan Lu,et al.  Hyperfusion-Net: Hyper-densely reflective feature fusion for salient object detection , 2019, Pattern Recognit..

[29]  S. Iwin Thanakumar Joseph,et al.  Optimized vessel detection in marine environment using hybrid adaptive cuckoo search algorithm , 2019, Comput. Electr. Eng..

[30]  Zhigang Zeng,et al.  CLU-CNNs: Object detection for medical images , 2019, Neurocomputing.

[31]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[32]  Nour Eldeen M. Khalifa,et al.  A deep transfer learning model with classical data augmentation and CGAN to detect COVID-19 from chest CT radiography digital images , 2020, Neural computing & applications.

[33]  Huiyu Zhou,et al.  Object tracking using SIFT features and mean shift , 2009, Comput. Vis. Image Underst..

[34]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[35]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[36]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  F ATTNEAVE,et al.  The quantitative study of shape and pattern perception. , 1956, Psychological bulletin.