Optimization of a CNN-based Object Detector for Fisheye Cameras

Fisheye cameras are widely used in the automobile industry, due to their wide field of view for the environment. There are a lot of algorithms that researchers use for object detection, among them Convolutional Neural Networks (CNN) detectors have been widespread during the course of the last decade, however, they are mostly trained and applied on conventional pinhole camera images. In this paper, an attempt to optimize a CNN-based detector for fisheye cameras was made, taking into consideration the barrel distortion, which complicates the object detection. Several experiments were carried out to optimize one state-of-the-art detector, using synthetic fisheye effect on training dataset. The obtained result proves that fisheye augmentation can considerably advance a CNN-based detector’s performance on fisheye images in spite of the distortion. Along the way, a new object detection framework based on the Tensorflow Estomator API was constructed to perform the experiments as convenient as possible.

[1]  James J. Kumler,et al.  Fish-eye lens designs and their relative performance , 2000, SPIE Optics + Photonics.

[2]  Josef F. Krems,et al.  What would drivers like to know during automated driving? Information needs at different levels of automation. , 2015 .

[3]  Xiaorui Zhu,et al.  Object detection on panoramic images based on deep learning , 2017, 2017 3rd International Conference on Control, Automation and Robotics (ICCAR).

[4]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[5]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Alois Knoll,et al.  An Automotive Side-View System Based on Ethernet and IP , 2012, 2012 26th International Conference on Advanced Information Networking and Applications Workshops.

[7]  Arcangelo Distante,et al.  Camera Calibration and 3D Reconstruction , 2020 .

[8]  Ming Yang,et al.  CNN based semantic segmentation for urban traffic scenes using fisheye camera , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[9]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[10]  Forrest N. Iandola,et al.  SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).