Kernel Density Network for Quantifying Regression Uncertainty in Face Alignment

For deep neural networks, it is important to quantify the uncertainty in its predictions. So a probabilistic neural network with a Gaussian assumption was widely used. However, in real data especially image data, the Gaussian assumption typically cannot hold. We are interested in modeling a more general distribution, e.g. multi-modal or asymmetric distribution. Therefore, a kernel density neural network is proposed. We adopt state-of-the-art neural network architecture and propose a new loss function based on maximizing the conditional log likelihood. And we show its application in face alignment. The proposed loss function achieves comparable or better performance than state-of-the-art end-to-end trainable deep learning based methods in terms of both the predicted labels and uncertainty in predictions. Moreover, it can be generally extended to many other regression problems such as Action Unit intensity estimation and face age estimation.

[1]  A. Weigend,et al.  Estimating the mean and variance of the target probability distribution , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[2]  Alexander J. Smola,et al.  Heteroscedastic Gaussian process regression , 2005, ICML.

[3]  Jonathan Tompson,et al.  Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[4]  Trevor Darrell,et al.  Fully convolutional networks for semantic segmentation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[6]  Georgios Tzimiropoulos,et al.  How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Alex Kendall,et al.  What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[8]  Shiguang Shan,et al.  Mean-Variance Loss for Deep Age Estimation from a Face , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.