Learning visual biases from human imagination

Although the human visual system can recognize many concepts under challenging conditions, it still has some biases. In this paper, we investigate whether we can extract these biases and transfer them into a machine recognition system. We introduce a novel method that, inspired by well-known tools in human psychophysics, estimates the biases that the human visual system might use for recognition, but in computer vision feature spaces. Our experiments are surprising, and suggest that classifiers from the human visual system can be transferred into a machine with some success. Since these classifiers seem to capture favorable biases in the human visual system, we further present an SVM formulation that constrains the orientation of the SVM hyperplane to agree with the bias from human visual system. Our results suggest that transferring this human bias into machines may help object recognition systems generalize across datasets and perform better when very little training data is available.

[1]  P. Bennett,et al.  Inversion Leads to Quantitative, Not Qualitative, Changes in Face Processing , 2004, Current Biology.

[2]  P. Neri Estimation of nonlinear psychophysical kernels. , 2004, Journal of vision.

[3]  Miguel P Eckstein,et al.  Classification images: a tool to analyze visual strategies. , 2002, Journal of vision.

[4]  Julie E. Boland,et al.  Cultural variation in eye movements during scene perception. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Trevor Darrell,et al.  What you saw is not what you get: Domain adaptation using asymmetric kernel transforms , 2011, CVPR 2011.

[6]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Rong Yan,et al.  Adapting SVM Classifiers to Data with Shifted Distributions , 2007 .

[8]  Andrew Zisserman,et al.  Tabula rasa: Model transfer for object category detection , 2011, 2011 International Conference on Computer Vision.

[9]  R. B. Macleod,et al.  A Source Book Of Gestalt Psychology , 1939 .

[10]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[11]  Michelle R. Greene,et al.  Visual Noise from Natural Scene Statistics Reveals Human Scene Category Representations , 2014, ArXiv.

[12]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[13]  Alexei A. Efros,et al.  Unbiased look at dataset bias , 2011, CVPR 2011.

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  A. Ahumada,et al.  Stimulus Features in Signal Detection , 1971 .

[16]  Kristen Grauman,et al.  Large-scale live active learning: Training object detectors with crawled data and crowds , 2011, CVPR.

[17]  P. Schyns,et al.  Superstitious Perceptions Reveal Properties of Internal Representations , 2003, Psychological science.

[18]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[19]  Patrick Pérez,et al.  Reconstructing an image from its local descriptors , 2011, CVPR 2011.

[20]  Pierre Vandergheynst,et al.  Beyond bits: Reconstructing images from Local Binary Descriptors , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[21]  S. Li Concise Formulas for the Area and Volume of a Hyperspherical Cap , 2011 .

[22]  Pietro Perona,et al.  The Multidimensional Wisdom of Crowds , 2010, NIPS.

[23]  Yair Weiss,et al.  Learning about Canonical Views from Internet Image Collections , 2012, NIPS.

[24]  Michael C. Mangini,et al.  Making the ineffable explicit: estimating the information employed for face classifications , 2004, Cogn. Sci..

[25]  Richard F Murray,et al.  Classification images: A review. , 2011, Journal of vision.

[26]  Marin Ferecatu,et al.  A Statistical Framework for Image Category Search from a Mental Picture , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Antonio Torralba,et al.  HOGgles: Visualizing Object Detection Features , 2013, 2013 IEEE International Conference on Computer Vision.

[28]  Devi Parikh Human-Debugging of Machines , 2011 .

[29]  Trevor Darrell,et al.  One-Shot Adaptation of Supervised Deep Convolutional Models , 2013, ICLR.

[30]  Alexei A. Efros,et al.  Undoing the Damage of Dataset Bias , 2012, ECCV.

[31]  Tatsuya Harada,et al.  Image Reconstruction from Bag-of-Visual-Words , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Christoph H. Lampert,et al.  Beyond Dataset Bias: Multi-task Unaligned Shared Knowledge Transfer , 2012, ACCV.

[33]  Gerald DeJong,et al.  Rotational Prior Knowledge for SVMs , 2005, ECML.

[34]  Joshua B. Tenenbaum,et al.  Learning to share visual appearance for multiclass object detection , 2011, CVPR 2011.

[35]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[36]  Cordelia Schmid,et al.  Dataset Issues in Object Recognition , 2006, Toward Category-Level Object Recognition.

[37]  Pietro Perona,et al.  Visual Recognition with Humans in the Loop , 2010, ECCV.

[38]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[39]  A. Ahumada Perceptual Classification Images from Vernier Acuity Masked by Noise , 1996 .

[40]  Albert J. Ahumada,et al.  Technique to extract relevant image features for visual tasks , 1998, Electronic Imaging.

[41]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  David A. Forsyth,et al.  Utility data annotation with Amazon Mechanical Turk , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[43]  Manuel Blum,et al.  Peekaboom: a game for locating objects in images , 2006, CHI.

[44]  ZissermanAndrew,et al.  The Pascal Visual Object Classes Challenge , 2015 .

[45]  Rachael E. Jack,et al.  Culture Shapes How We Look at Faces , 2008, PloS one.

[46]  Andrea Vedaldi,et al.  Understanding deep image representations by inverting them , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[48]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.