Recognizing surface qualities from natural images based on learning to rank

This paper proposes a method for estimating the quantitative values of some attributes associated with surface qualities of an object, such as glossiness and transparency, from its image. Our approach is to learn functions that compute such attribute values from the input image by using training data given in the form of relative information. To be specific, each sample of the training data represents that, for a pair of images, which is greater in terms of the target attribute. The functions are learned based on leaning to rank. This approach enables us to deal with natural images, which cannot be dealt with in previous works, which are based on CG synthesized images for both training and testing. We created data sets using the Flickr Material Database for four attributes of glossiness, transparency, smoothness, and coldness, and learn the functions representing the values of these attributes. We present experimental results that the learned functions show very promising performances in the estimation of the attribute values.

[1]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[2]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[3]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[4]  Edward H. Adelson,et al.  Exploring features in a Bayesian framework for material recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Kristen Grauman,et al.  Relative attributes , 2011, 2011 International Conference on Computer Vision.

[6]  E. Adelson,et al.  Recognition of Surface Reflectance Properties from a Single Image under Unknown Real-World Illumination , 2001 .

[7]  Naokazu Goda,et al.  Transformation from image-based to perceptual representation of materials along the human ventral visual pathway , 2011, NeuroImage.

[8]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[9]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[10]  I. Motoyoshi Highlight-shading relationship as a cue for the perception of translucent and transparent materials. , 2010, Journal of vision.

[11]  E. Adelson,et al.  Image statistics and the perception of surface qualities , 2007, Nature.