论文信息 - Visual Classification by a Hierarchy of Extended Fragments

Visual Classification by a Hierarchy of Extended Fragments

The chapter describes visual classification by a hierarchy of semantic fragments. In fragment-based classification, objects within a class are represented by common sub-structures selected during training. The chapter describes two extensions to the basic fragment-based scheme. The first extension is the extraction and use of feature hierarchies. We describe a method that automatically constructs complete feature hierarchies from image examples, and show that features constructed hierarchically are significantly more informative and better for classification compared with similar non-hierarchical features. The second extension is the use of so-called semantic fragments to represent object parts. The goal of a semantic fragment is to represent the different possible appearances of a given object part. The visual appearance of such object parts can differ substantially, and therefore traditional image similarity-based methods are inappropriate for the task. We show how the method can automatically learn the part structure of a new domain, identify the main parts, and how their appearance changes across objects in the class. We discuss the implications of these extensions to object classification and recognition.

Shimon Ullman | Boris Epshtein

[1] Shimon Ullman,et al. Feature hierarchies for object classification , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[2] D. M. Green,et al. Signal detection theory and psychophysics , 1966 .

[3] Shimon Ullman,et al. Class-Based Matching of Object Parts , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[4] Peter Földiák,et al. Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[5] Cordelia Schmid,et al. A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Cordelia Schmid,et al. Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[7] Edmund T. Rolls,et al. Invariant Object Recognition in the Visual System with Novel Views of 3D Objects , 2002, Neural Computation.

[8] T. Poggio,et al. Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[9] Norbert Krüger,et al. Face Recognition by Elastic Bunch Graph Matching , 1997, CAIP.

[10] I. Biederman. Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[11] Thomas Serre,et al. Categorization by Learning and Combining Object Parts , 2001, NIPS.

[12] Norbert Krüger,et al. Face Recognition by Elastic Bunch Graph Matching , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[13] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[14] Shimon Ullman,et al. Identifying semantically equivalent object fragments , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15] Dan Roth,et al. Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Michel Vidal-Naquet,et al. Visual features of intermediate complexity and their use in classification , 2002, Nature Neuroscience.

[17] Pietro Perona,et al. Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[18] Shimon Ullman,et al. Computation of pattern invariance in brain-like structures , 1999, Neural Networks.

[19] D. Marr,et al. Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[20] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[21] Shimon Ullman,et al. Object recognition with informative features and linear classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[22] Shimon Ullman,et al. Recognition invariance obtained by extended and invariant features , 2004, Neural Networks.

[23] C Tomasi,et al. Shape and motion from image streams: a factorization method. , 1992, Proceedings of the National Academy of Sciences of the United States of America.