Visual Analogies: A Framework for Defining Aspect Categorization

Analogies are common simple word problems (calf is to cow as x is to sheep?) and we use them to identify analogies between images. Let \(\mathcal {I}[\mathcal {A},\theta ]\) be an image of object \(\mathcal {A}\) at view \(\theta \). We show how to learn to choose an image \(\mathcal {I}\) such that \(\mathcal {I}[\mathcal {A},\phi ]\) is to \(\mathcal {I}[\mathcal {A},\theta ]\) as \(\mathcal {I}\) is to \(\mathcal {I}[\mathcal {B},\theta ]\). We introduce a framework to identify an image of a familiar object at an unfamiliar angle and extend our method to treat unfamiliar objects. By doing so, we identify pairs of objects that are good at finding new views of one another. This yields an operational notion of aspectual equivalence: objects are equivalent if they can predict each other’s appearance well.

[1]  Alex Pentland,et al.  Why aspect graphs are not (yet) practical for computer vision , 1991, [1991 Proceedings] Workshop on Directions in Automated CAD-Based Vision.

[2]  Tapas Kanungo,et al.  Object recognition using appearance-based parts and relations , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  J. Friedman Stochastic gradient boosting , 2002 .

[4]  Silvio Savarese,et al.  Object Detection by 3D Aspectlets and Occlusion Reasoning , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[5]  Ali Farhadi,et al.  Learning to Recognize Activities from the Wrong View Point , 2008, ECCV.

[6]  Patrick Pérez,et al.  Cross-View Action Recognition from Temporal Self-similarities , 2008, ECCV.

[7]  Cordelia Schmid,et al.  Semi-Local Affine Parts for Object Recognition , 2004, BMVC.

[8]  Jean Ponce,et al.  Computer Vision: A Modern Approach , 2002 .

[9]  David A. Forsyth,et al.  Invariant Descriptors for 3D Object Recognition and Pose , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Luc Van Gool,et al.  Towards Multi-View Object Class Detection , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Yuting Zhang,et al.  Deep Visual Analogy-Making , 2015, NIPS.

[12]  Silvio Savarese,et al.  Estimating the aspect layout of object categories , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Leslie Pack Kaelbling,et al.  Virtual Training for Multi-View Object Class Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Ali Farhadi,et al.  Transfer Learning in Sign language , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Silvio Savarese,et al.  3D generic object categorization, localization and pose estimation , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[16]  Ali Farhadi,et al.  Visalogy: Answering Visual Analogy Questions , 2015, NIPS.

[17]  Silvio Savarese,et al.  View Synthesis for Recognizing Unseen Poses of Object Classes , 2008, ECCV.

[18]  Dieter Fox,et al.  A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[19]  Cordelia Schmid,et al.  Flexible Object Models for Category-Level 3D Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Kevin W. Bowyer,et al.  Aspect graphs: An introduction and survey of recent results , 1990, Int. J. Imaging Syst. Technol..

[21]  Mengjie Zhang,et al.  Domain Generalization for Object Recognition with Multi-task Autoencoders , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[22]  Jian Huang,et al.  Regularized ROC method for disease classification and biomarker selection with microarray data , 2005, Bioinform..

[23]  Jitendra Malik,et al.  Pose Induction for Novel Object Categories , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[24]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).