Improved Face Detection Method via Learning Small Faces on Hard Images Based on a Deep Learning Approach

Most facial recognition and face analysis systems start with facial detection. Early techniques, such as Haar cascades and histograms of directed gradients, mainly rely on features that had been manually developed from particular images. However, these techniques are unable to correctly synthesize images taken in untamed situations. However, deep learning’s quick development in computer vision has also sped up the development of a number of deep learning-based face detection frameworks, many of which have significantly improved accuracy in recent years. When detecting faces in face detection software, the difficulty of detecting small, scale, position, occlusion, blurring, and partially occluded faces in uncontrolled conditions is one of the problems of face identification that has been explored for many years but has not yet been entirely resolved. In this paper, we propose Retina net baseline, a single-stage face detector, to handle the challenging face detection problem. We made network improvements that boosted detection speed and accuracy. In Experiments, we used two popular datasets, such as WIDER FACE and FDDB. Specifically, on the WIDER FACE benchmark, our proposed method achieves AP of 41.0 at speed of 11.8 FPS with a single-scale inference strategy and AP of 44.2 with multi-scale inference strategy, which are results among one-stage detectors. Then, we trained our model during the implementation using the PyTorch framework, which provided an accuracy of 95.6% for the faces, which are successfully detected. Visible experimental results show that our proposed model outperforms seamless detection and recognition results achieved using performance evaluation matrices.

[1]  J. Chedjou,et al.  Improved Agricultural Field Segmentation in Satellite Imagery Using TL-ResUNet Architecture , 2022, Sensors.

[2]  M. Mukhiddinov,et al.  A Wildfire Smoke Detection System Using Unmanned Aerial Vehicle Images Based on the Optimized YOLOv5 , 2022, Sensors.

[3]  M. Mukhiddinov,et al.  Development of Real-Time Landmark-Based Emotion Recognition CNN for Masked Faces , 2022, Sensors.

[4]  I. Tarimer,et al.  Effect of Feature Selection on the Accuracy of Music Popularity Classification Using Machine Learning Algorithms , 2022, Electronics.

[5]  T. Whangbo,et al.  Modeling and Applying Implicit Dormant Features for Recommendation via Clustering and Deep Factorization , 2022, Sensors.

[6]  T. Whangbo,et al.  Improved Feature Parameter Extraction from Speech Signals Using Machine Learning Algorithm , 2022, Sensors.

[7]  Fazal Haque Malik,et al.  The Impact of Agile Methodology on Project Success, with a Moderating Role of Person’s Job Fit in the IT Industry of Pakistan , 2022, Applied Sciences.

[8]  T. Whangbo,et al.  Improved Real-Time Fire Warning System Based on Advanced Technologies for Visually Impaired People , 2022, Sensors.

[9]  T. Whangbo,et al.  Attention 3D U-Net with Multiple Skip Connections for Segmentation of Brain Tumor Images , 2022, Sensors.

[10]  Jinsoo Cho,et al.  Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language , 2022, Sensors.

[11]  M. Mukhiddinov,et al.  Automatic Fire Detection and Notification System Based on Improved YOLOv4 for the Blind and Visually Impaired , 2022, Sensors.

[12]  Mukhriddin Mukhiddinov,et al.  An improvement for the automatic classification method for ultrasound images used on CNN , 2021, Int. J. Wavelets Multiresolution Inf. Process..

[13]  Ugur Ayvaz,et al.  Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning , 2022, Computers, Materials & Continua.

[14]  A. Abdusalomov,et al.  LDA-Based Topic Modeling Sentiment Analysis Using Topic/Document/Sentence (TDS) Model , 2021, Applied Sciences.

[15]  Taeg Keun Whangbo,et al.  3D Volume Reconstruction from MRI Slices based on VTK , 2021, 2021 International Conference on Information and Communication Technology Convergence (ICTC).

[16]  Taeg Keun Whangbo,et al.  An Improvement of the Fire Detection and Classification Method Using YOLOv3 for Surveillance Systems , 2021, Sensors.

[17]  Stefanos Zafeiriou,et al.  Masked Face Recognition Challenge: The InsightFace Track Report , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[18]  Payman Zarkesh-Ha,et al.  Face Recognition on a Smart Image Sensor Using Local Gradients , 2021, Sensors.

[19]  Amjad J. Humaidi,et al.  Review of deep learning: concepts, CNN architectures, challenges, applications, future directions , 2021, Journal of Big Data.

[20]  Young Im Cho,et al.  Automatic Fire and Smoke Detection Method for Surveillance Systems Based on Dilated CNNs , 2020, Atmosphere.

[21]  Young Im Cho,et al.  Improvement of the end-to-end scene text recognition method for "text-to-speech" conversion , 2020, Int. J. Wavelets Multiresolution Inf. Process..

[22]  Zhen Lei,et al.  Towards Fast, Accurate and Stable 3D Dense Face Alignment , 2020, ECCV.

[23]  Young Im Cho,et al.  Automatic Moving Shadow Detection and Removal Method for Smart City Environments , 2020 .

[24]  Irene Kotsia,et al.  RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Taeg Keun Whangbo,et al.  Automatic Salient Object Extraction Based on Locally Adaptive Thresholding to Generate Tactile Graphics , 2020, Applied Sciences.

[26]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[28]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[29]  Taeg Keun Whangbo,et al.  Detection and Removal of Moving Object Shadows Using Geometry and Color Information for Indoor Video Streams , 2019 .

[30]  Taeg Keun Whangbo,et al.  Fully Automatic Stroke Symptom Detection Method Based on Facial Features and Moving Hand Differences , 2019, 2019 International Symposium on Multimedia and Communication Technology (ISMAC).

[31]  Jian Yang,et al.  DSFD: Dual Shot Face Detector , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Shifeng Zhang,et al.  Selective Refinement Network for High Performance Face Detection , 2018, AAAI.

[33]  Yan Yan,et al.  Multi-task Learning of Cascaded CNN for Facial Attribute Classification , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[34]  Xu Tang,et al.  PyramidBox: A Context-assisted Single Shot Face Detector , 2018, ECCV.

[35]  Ran Tao,et al.  Seeing Small Faces from Robust Anchor's Perspective , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[36]  Xiang Xu,et al.  Face Detection Using Improved Faster RCNN , 2018, ArXiv.

[37]  Hao Wang,et al.  Detecting Faces Using Region-based Fully Convolutional Networks , 2017 .

[38]  Shifeng Zhang,et al.  S^3FD: Single Shot Scale-Invariant Face Detector , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[39]  Shifeng Zhang,et al.  FaceBoxes: A CPU real-time face detector with high accuracy , 2017, 2017 IEEE International Joint Conference on Biometrics (IJCB).

[40]  Larry S. Davis,et al.  SSH: Single Stage Headless Face Detector , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[41]  Taeg Keun Whangbo,et al.  An improvement for the foreground recognition method using shadow removal technique for indoor environments , 2017, Int. J. Wavelets Multiresolution Inf. Process..

[42]  Peiyun Hu,et al.  Finding Tiny Faces , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Р Ю Чуйков,et al.  Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector , 2017 .

[45]  Shanu Sharma,et al.  Review and comparison of face detection algorithms , 2017, 2017 7th International Conference on Cloud Computing, Data Science & Engineering - Confluence.

[46]  Taeg Keun Whangbo,et al.  A Review on various widely used shadow detection methods to identify a shadow from image , 2016 .

[47]  Marios Savvides,et al.  CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection , 2016, ArXiv.

[48]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[49]  Abhinav Gupta,et al.  Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Yu Qiao,et al.  Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[51]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[52]  Shuo Yang,et al.  WIDER FACE: A Face Detection Benchmark , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  A. Abdusalomov,et al.  Robust Shadow Removal Technique For Improving Image Enhancement Based On Segmentation Method , 2016 .

[55]  Gang Hua,et al.  A convolutional neural network cascade for face detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Xiaogang Wang,et al.  Deep Learning Face Representation from Predicting 10,000 Classes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Igor S. Pandzic,et al.  Fast Localization of Facial Landmark Points , 2014, ArXiv.

[60]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[61]  Ivanna K. Timotius,et al.  A Frontal Pose Face Detection and Classification System Based on Haar Wavelet Coefficients and Support Vector Machine , 2011 .

[62]  Erik Learned-Miller,et al.  FDDB: A benchmark for face detection in unconstrained settings , 2010 .

[63]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[64]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.