HLNet: A Unified Framework for Real-Time Segmentation and Facial Skin Tones Evaluation

Real-time semantic segmentation plays a crucial role in industrial applications, such as autonomous driving, the beauty industry, and so on. It is a challenging problem to balance the relationship between speed and segmentation performance. To address such a complex task, this paper introduces an efficient convolutional neural network (CNN) architecture named HLNet for devices with limited resources. Based on high-quality design modules, HLNet better integrates high-dimensional and low-dimensional information while obtaining sufficient receptive fields, which achieves remarkable results on three benchmark datasets. To our knowledge, the accuracy of skin tone classification is usually unsatisfactory due to the influence of external environmental factors such as illumination and background impurities. Therefore, we use HLNet to obtain accurate face regions, and further use color moment algorithm to extract its color features. Specifically, for a 224×224 input, using our HLNet, we achieve 78.39% mean IoU on Figaro1k dataset at over 17 FPS in the case of the CPU environment. We further use the masked color moment for skin tone grade evaluation and approximate 80% classification accuracy demonstrate the feasibility of the proposed method.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  M. Emre Celebi,et al.  A Feature‐Preserving Hair Removal Algorithm for Dermoscopy Images , 2013, Skin research and technology : official journal of International Society for Bioengineering and the Skin (ISBS) [and] International Society for Digital Imaging of Skin (ISDIS) [and] International Society for Skin Imaging.

[3]  Dani Lischinski,et al.  A Closed-Form Solution to Natural Image Matting , 2008 .

[4]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[5]  Riccardo Leonardi,et al.  Hair detection, segmentation, and hairstyle classification in the wild , 2018, Image Vis. Comput..

[6]  Yu Qiao,et al.  Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[7]  Zhenyun Peng,et al.  Image Based Hair Segmentation Algorithm for the Application of Automatic Facial Caricature Synthesis , 2014, TheScientificWorldJournal.

[8]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Yin Yang,et al.  End-to-End Detection-Segmentation System for Face Labeling , 2019 .

[10]  Ling Luo,et al.  EHANet: An Effective Hierarchical Aggregation Network for Face Parsing , 2020, Applied Sciences.