Body plans

This paper describes a representation for people and animals, called a body plan, which is adapted to segmentation and to recognition in complex environments. The representation is an organized collection of grouping hints obtained from a combination of constraints on color and texture and constraints on geometric properties such as the structure of individual parts and the relationships between parts. Body plans can be learned from image data, using established statistical learning techniques. The approach is illustrated with two examples of programs that successfully use body plans for recognition: one example involves determining whether a picture contains a scantily clad human, using a body plan built by hand; the other involves determining whether a picture contains a horse, using a body plan learned from image data. In both cases, the system demonstrates excellent performance on large, uncontrolled test sets and very large and diverse control sets.

[1]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[2]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[3]  George E. Collins,et al.  Cylindrical Algebraic Decomposition I: The Basic Algorithm , 1984, SIAM J. Comput..

[4]  M. Brady,et al.  Smoothed Local Symmetries and Their Implementation , 1984 .

[5]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Michael Brady,et al.  Generating and Generalizing Models of Visual Objects , 1987, Artif. Intell..

[7]  Thomas O. Binford Body-Centered Representation and Perception , 1994, Object Representation in Computer Vision.

[8]  Three-Dimensional Part-Based Descriptions from a Real . . . , 1994 .

[9]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[10]  David A. Forsyth,et al.  3D Object Recognition Using Invariance , 1995, Artif. Intell..

[11]  László Györfi,et al.  A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[12]  David A. Forsyth,et al.  Finding Naked People , 1996, ECCV.

[13]  Hayit Greenspan,et al.  Finding Pictures of Objects in Large Collections of Images , 1996, Object Representation in Computer Vision.

[14]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Other Conferences.

[15]  Nasser M. Nasrabadi,et al.  Object Recognition Using , 1997 .

[16]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Tom Minka,et al.  Interactive learning with a "society of models" , 1997, Pattern Recognit..

[18]  Vladimir Cherkassky,et al.  The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.