Cumulative attribute space regression for head pose estimation and color constancy

Abstract Two-stage Cumulative Attribute (CA) regression has been found effective in regression problems of computer vision such as facial age and crowd density estimation. The first stage regression maps input features to cumulative attributes that encode correlations between target values. The previous works have dealt with single output regression. In this work, we propose cumulative attribute spaces for 2- and 3-output (multivariate) regression. We show how the original CA space can be generalized to multiple output by the Cartesian product (CartCA). However, for target spaces with more than two outputs the CartCA becomes computationally infeasible and therefore we propose an approximate solution - multi-view CA (MvCA) - where CartCA is applied to output pairs. We experimentally verify improved performance of the CartCA and MvCA spaces in 2D and 3D face pose estimation and three-output (RGB) illuminant estimation for color constancy.

[1]  Chi Fang,et al.  Head Pose Estimation Based on Random Forests for Multiclass Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[2]  Joost van de Weijer,et al.  Improving Color Constancy by Photometric Edge Weighting , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[4]  Brian V. Funt,et al.  A data set for color research , 2002 .

[5]  Raimondo Schettini,et al.  Color constancy using CNNs , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[6]  Joost van de Weijer,et al.  Author Manuscript, Published in "ieee Transactions on Image Processing Edge-based Color Constancy , 2022 .

[7]  Xiantong Zhen,et al.  Descriptor Learning via Supervised Manifold Regularization for Multioutput Regression , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Bodo Rosenhahn,et al.  Class Generative Models Based on Feature Regression for Pose Estimation of Object Categories , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Jiri Matas,et al.  Deep structured-output regression learning for computational color constancy , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[10]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[11]  Qingshan Liu,et al.  Robust facial landmark tracking via cascade regression , 2017, Pattern Recognit..

[12]  Luc Van Gool,et al.  Real Time Head Pose Estimation from Consumer Depth Cameras , 2011, DAGM-Symposium.

[13]  Vijayan K. Asari,et al.  A Two-Layer Framework for Piecewise Linear Manifold-Based Head Pose Estimation , 2012, International Journal of Computer Vision.

[14]  Shaogang Gong,et al.  Attribute Learning for Understanding Unstructured Social Activity , 2012, ECCV.

[15]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[16]  Xin Geng,et al.  Head Pose Estimation Based on Multivariate Label Distribution , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Ke Chen,et al.  Spectral attribute learning for visual regression , 2017, Pattern Recognit..

[18]  Shaogang Gong,et al.  Cumulative Attribute Space for Age and Crowd Density Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Mark S. Drew,et al.  The Role of Bright Pixels in Illumination Estimation , 2012, Color Imaging Conference.

[20]  Massimiliano Pontil,et al.  Multi-Task Feature Learning , 2006, NIPS.

[21]  Silvio Savarese,et al.  Recognizing human actions by attributes , 2011, CVPR 2011.

[22]  Theo Gevers,et al.  Color Constancy Using Natural Image Statistics and Scene Semantics , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  NowozinSebastian,et al.  Structured Learning and Prediction in Computer Vision , 2011 .

[24]  Kai-Fu Yang,et al.  Efficient illuminant estimation for color constancy using grey pixels , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Shaogang Gong,et al.  Person Re-Identification by Support Vector Ranking , 2010, BMVC.

[26]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[27]  Christoph H. Lampert,et al.  Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Weifeng Liu,et al.  Canonical correlation analysis networks for two-view image recognition , 2017, Inf. Sci..

[29]  Jun Yu,et al.  Multitask Autoencoder Model for Recovering Human Poses , 2018, IEEE Transactions on Industrial Electronics.

[30]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[31]  Horst Bischof,et al.  Alternating Decision Forests , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  K AsariVijayan,et al.  A Two-Layer Framework for Piecewise Linear Manifold-Based Head Pose Estimation , 2013 .

[33]  Larry S. Davis,et al.  On partial least squares in head pose estimation: How to simultaneously deal with misalignment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Kobus Barnard,et al.  Improvements to Gamut Mapping Colour Constancy Algorithms , 2000, ECCV.

[35]  Rama Chellappa,et al.  Growing Regression Forests by Classification: Applications to Object Pose Estimation , 2013, ECCV.

[36]  Jian-Jiun Ding,et al.  Facial age estimation based on label-sensitive learning and age-oriented regression , 2013, Pattern Recognit..

[37]  Luc Van Gool,et al.  Random Forests for Real Time 3D Face Analysis , 2012, International Journal of Computer Vision.

[38]  Songcan Chen,et al.  Cumulative attribute relation regularization learning for human age estimation , 2015, Neurocomputing.

[39]  Sebastian Nowozin,et al.  Structured Learning and Prediction in Computer Vision , 2011, Found. Trends Comput. Graph. Vis..

[40]  Rama Chellappa,et al.  Growing Regression Tree Forests by Classification for Continuous Object Pose Estimation , 2017, International Journal of Computer Vision.

[41]  Keigo Hirakawa,et al.  Color Constancy with Spatio-Spectral Statistics , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Kristen Grauman,et al.  Relative attributes , 2011, 2011 International Conference on Computer Vision.

[43]  J. Crowley,et al.  Estimating Face orientation from Robust Detection of Salient Facial Structures , 2004 .

[44]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[45]  Mark S. Drew,et al.  Exemplar-Based Color Constancy and Multiple Illumination , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Horst Bischof,et al.  Alternating Regression Forests for Object Detection and Pose Estimation , 2013, 2013 IEEE International Conference on Computer Vision.

[47]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[48]  Luc Van Gool,et al.  Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[49]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[50]  Zhi-Hua Zhou,et al.  Automatic Age Estimation Based on Facial Aging Patterns , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Andrew Blake,et al.  Bayesian color constancy revisited , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Jun Yu,et al.  Local Deep-Feature Alignment for Unsupervised Dimension Reduction , 2018, IEEE Transactions on Image Processing.