Investigation on the Effect of the Feature Extraction Backbone for Small Object Segmentation using Fully Convolutional Neural Network in Traffic Signs Application

Small objects are widely found in different applications such as traffic signs and to segment those objects make it difficult to extract features due to the small number of pixels. Previous research has been done to show how error prone the semantic segmentation networks to small objects in variance of application such as medical images and remote sensing and how it leads to class imbalance. However, small object segmentation seems to be tricky and making the network struggle. Recently there are small amount of research has been done in the effect of the feature extraction backbone to the small object datasets. In this paper we investigate the effect of different backbone feature extraction such as AlexNet, VGGNet, GoogleNet on an imbalanced small objects dataset after grouping them by shape and colour in the Fully Convolutional Networks (FCN). We measure the performance on PASCAL VOC and Malaysian Traffic Sign Dataset (MTSD) showing the pixel accuracy, mean accuracy per class, mean IoU and frequency weighted IoU for each backbone and FCN. The results show that VGGNet as a backbone with Cross Entropy (CE) combined with Dice Loss (DL) achieves the highest score in mean IoU for imbalanced dataset but not for balanced dataset. However, in the imbalanced dataset major classes have a higher probability to confuse with minor classes due to the class imbalance. In conclusion we investigate different backbone networks with grouped labels dataset in shape and colour and we recommend using VGGNet FCN with CE combined with DL for imbalanced datasets.

[1]  Su-Lin Lee,et al.  Towards Automatic 3D Shape Instantiation for Deployed Stent Grafts: 2D Multiple-class and Class-imbalance Marker Segmentation with Equally-weighted Focal U-Net , 2017, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  Wu Liu,et al.  LOCO: Local Context Based Faster R-CNN for Small Traffic Sign Detection , 2018, MMM.

[3]  Yu Zhang,et al.  Real-time small traffic sign detection with revised faster-RCNN , 2018, Multimedia Tools and Applications.

[4]  Takeshi Naemura,et al.  Detection of small birds in large images by combining a deep detector with semantic segmentation , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[5]  Arun Kumar Sangaiah,et al.  Deep detection network for real-life traffic sign in vehicular networks , 2018, Comput. Networks.

[6]  Aref Meddeb,et al.  Deep learning traffic sign detection, recognition and augmentation , 2017, SAC.

[7]  Wenyu Liu,et al.  Cascaded Segmentation-Detection Networks for Text-Based Traffic Sign Detection , 2018, IEEE Transactions on Intelligent Transportation Systems.

[8]  Jun Ma,et al.  Segmentation Loss Odyssey , 2020, ArXiv.

[9]  Meina Song,et al.  FAMN: Feature Aggregation Multipath Network for Small Traffic Sign Detection , 2019, IEEE Access.

[10]  Honghai Liu,et al.  Faster R-CNN for Small Traffic Sign Detection , 2017, CCCV.

[11]  Lijuan Liu,et al.  A novel traffic sign detection method via color segmentation and robust shape matching , 2015, Neurocomputing.

[12]  Peter Hall,et al.  Traffic signal detection and classification in street views using an attention model , 2018, Computational Visual Media.

[13]  Christopher Joseph Pal,et al.  The Importance of Skip Connections in Biomedical Image Segmentation , 2016, LABELS/DLMIA@MICCAI.

[14]  Kang Kim,et al.  Simultaneous Traffic Sign Detection and Boundary Estimation Using Convolutional Neural Network , 2018, IEEE Transactions on Intelligent Transportation Systems.

[15]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[16]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Camelia Lemnaru,et al.  Semantic segmentation-based traffic sign detection and recognition using deep learning techniques , 2018, 2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP).

[18]  Kristen Grauman,et al.  Image Recognition , 2009, Encyclopedia of Biometrics.

[19]  Md. Kamrul Hasan,et al.  Automatic Traffic Sign Detection and Recognition Using SegU-Net and a Modified Tversky Loss Function With L1-Constraint , 2020, IEEE Transactions on Intelligent Transportation Systems.

[20]  Yusheng Fu,et al.  Multi-Feature Fusion and Enhancement Single Shot Detector for Traffic Sign Recognition , 2020, IEEE Access.

[21]  Rubiyah Yusof,et al.  Malaysian traffic sign dataset for traffic sign detection and recognition systems , 2016 .

[22]  Michael Kampffmeyer,et al.  Semantic Segmentation of Small Objects and Modeling of Uncertainty in Urban Remote Sensing Images Using Deep Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[23]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[24]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[25]  Feng Tian,et al.  MR-CNN: A Multi-Scale Region-Based Convolutional Neural Network for Small Traffic Sign Recognition , 2019, IEEE Access.

[26]  Faming Shao,et al.  Improved Faster R-CNN Traffic Sign Detection Based on a Second Region of Interest and Highly Possible Regions Proposal Network , 2019, Sensors.

[27]  Jing Fan,et al.  Localized Traffic Sign Detection with Multi-scale Deconvolution Networks , 2018, 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC).

[28]  Baoli Li,et al.  Traffic-Sign Detection and Classification in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Van-Dung Hoang,et al.  Improving Traffic Signs Recognition Based Region Proposal and Deep Neural Networks , 2018, ACIIDS.