Benchmarking Shadow Removal for Facial Landmark Detection and Beyond

Facial landmark detection is a very fundamental and significant vision task with many important applications. In practice, the facial landmark detection can be affected by a lot of natural degradations. One of the most common and important degradations is the shadow caused by light source blocking. While many advanced shadow removal methods have been proposed to recover the image quality in recent years, their effects to facial landmark detection are not well studied. For example, it remains unclear whether the shadow removal could enhance the robustness of facial landmark detection to diverse shadow patterns or not. In this work, for the first attempt, we construct a novel benchmark to link two independent but related tasks (i.e., shadow removal and facial landmark detection). In particular, the proposed benchmark covers diverse face shadows with different intensities, sizes, shapes, and locations. Moreover, to mine hard shadow patterns against facial landmark detection, we propose a novel method (i.e., adversarial shadow attack), which allows us to construct a challenging subset of the benchmark for a comprehensive analysis. With the constructed benchmark, we conduct extensive analysis on three state-of-the-art shadow removal methods and three landmark detectors. The observation of this work motivates us to design a novel detection-aware shadow removal framework, which empowers shadow removal to achieve higher restoration quality and enhance the shadow robustness of deployed facial landmark detectors.

[1]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Felix Juefei-Xu,et al.  SPARK: Spatial-Aware Online Incremental Attack Against Visual Tracking , 2019, ECCV.

[3]  Xiaoou Tang,et al.  Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[4]  Dimitris Samaras,et al.  Shadow Removal via Shadow Image Decomposition , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[5]  Junwei Han,et al.  Learning Selective Self-Mutual Attention for RGB-D Saliency Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Qiang Ji,et al.  Facial Landmark Detection: A Literature Survey , 2018, International Journal of Computer Vision.

[7]  Jianzhu Guo,et al.  Towards Fast, Accurate and Stable 3D Dense Face Alignment , 2020, ECCV.

[8]  Marios Savvides,et al.  An image statistics approach towards efficient and robust refinement for landmarks on facial boundary , 2013, 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[9]  Yang Liu,et al.  Auto-Exposure Fusion for Single-Image Shadow Removal , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Chi-Wing Fu,et al.  Mask-ShadowGAN: Learning to Remove Shadows From Unpaired Data , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Dani Lischinski,et al.  The Shadow Meets the Mask: Pyramid‐Based Shadow Removal , 2008, Comput. Graph. Forum.

[12]  Pat Hanrahan,et al.  Reflection from layered surfaces due to subsurface scattering , 1993, SIGGRAPH.

[13]  Yi Yang,et al.  Style Aggregated Network for Facial Landmark Detection , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14]  Hongkai Yu,et al.  Adversarial Relighting against Face Recognition , 2021, ArXiv.

[15]  Qing Guo,et al.  AVA: Adversarial Vignetting Attack against Visual Recognition , 2021, IJCAI.

[16]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[17]  Yu Liu,et al.  Exploring Disentangled Feature Representation Beyond Face Identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18]  Felix Juefei-Xu,et al.  Fooling LiDAR Perception via Adversarial Trajectory Perturbation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[19]  Le Hui,et al.  Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  Feng Liu,et al.  Joint Face Alignment and 3D Face Reconstruction , 2016, ECCV.

[21]  Yuanjun Xiong,et al.  Learning Self-Consistency for Deepfake Detection , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  José Miguel Buenaposada,et al.  A Deeply-Initialized Coarse-to-fine Ensemble of Regression Trees for Face Alignment , 2018, ECCV.

[23]  Weisi Lin,et al.  Adversarial Exposure Attack on Diabetic Retinopathy Imagery , 2020, ArXiv.

[24]  Toshihiko Yamasaki,et al.  Learning From Synthetic Shadows for Shadow Detection and Removal , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Qi Zou,et al.  Effects of Image Degradation and Degradation Removal to CNN-Based Image Classification , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Song Wang,et al.  Making Images Undiscoverable from Co-Saliency Detection , 2020, ArXiv.

[27]  Yici Cai,et al.  Look at Boundary: A Boundary-Aware Face Alignment Algorithm , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[28]  Hari Sundaram,et al.  Estimating Complexity of 2D Shapes , 2005, 2005 IEEE 7th Workshop on Multimedia Signal Processing.

[29]  Xiangyu Zhu,et al.  High-fidelity Pose and Expression Normalization for face recognition in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Stefanos Zafeiriou,et al.  300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[31]  Yang Zhao,et al.  Deep High-Resolution Representation Learning for Visual Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Qing Guo,et al.  AdvHaze: Adversarial Haze Attack , 2021, ArXiv.

[33]  Hayder Radha,et al.  Object Detection Under Rainy Conditions for Autonomous Vehicles: A Review of State-of-the-Art and Emerging Techniques , 2021, IEEE Signal Processing Magazine.

[34]  Liang Liu,et al.  FReeNet: Multi-Identity Face Reenactment , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Lei Ma,et al.  Learning to Adversarially Blur Visual Object Tracking , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[36]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[37]  Weisi Lin,et al.  Pasadena: Perceptually Aware and Stealthy Adversarial Denoise Attack , 2020, IEEE Transactions on Multimedia.

[38]  Felix Juefei-Xu,et al.  Watch out! Motion is Blurring the Vision of Your Deep Neural Networks , 2020, NeurIPS.

[39]  Rynson W. H. Lau,et al.  DeshadowNet: A Multi-context Embedding Deep Network for Shadow Removal , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Chi-Wing Fu,et al.  Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection , 2018, ECCV.

[41]  David E. Jacobs,et al.  Portrait shadow manipulation , 2020, ACM Trans. Graph..

[42]  Abhinav Gupta,et al.  Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[43]  Jiahuan Zhou,et al.  Learning Robust Facial Landmark Detection via Hierarchical Structured Ensemble , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[44]  Qing Guo,et al.  Let There Be Light: Improved Traffic Surveillance via Detail Preserving Night-to-Day Transfer , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[45]  Lei Ma,et al.  It's Raining Cats or Dogs? Adversarial Rain Attack on DNN Perception , 2020, ArXiv.

[46]  Run Wang,et al.  Amora: Black-box Adversarial Morphing Attack , 2019, ACM Multimedia.

[47]  Geguang Pu,et al.  AdvBokeh: Learning to Adversarially Defocus Blur , 2021, ArXiv.

[48]  Andrew Zisserman,et al.  Recurrent Human Pose Estimation , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[49]  Qing Guo,et al.  Bias Field Poses a Threat to DNN-based X-Ray Recognition , 2020, ICME.

[50]  Ye Wang,et al.  LUVLi Face Alignment: Estimating Landmarks’ Location, Uncertainty, and Visibility Likelihood , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Takayuki Okatani,et al.  Feature Quantization for Defending Against Distortion of Images , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.