Automatic Detection of Human Nudes

This paper demonstrates an automatic system for telling whether there are human nudes present in an image. The system marks skin-like pixels using combined color and texture properties. These skin regions are then fed to a specialized grouper, which attempts to group a human figure using geometric constraints on human structure. If the grouper finds a sufficiently complex structure, the system decides a human is present. The approach is shown to be effective for a wide range of shades and colors of skin and human configurations. This approach offers an alternate view of object recognition, where an object model is an organized collection of grouping hints obtained from a combination of constraints on color and texture and constraints on geometric properties such as the structure of individual parts and the relationships between parts. The system demonstrates excellent performance on a test set of 565 uncontrolled images of human nudes, mostly obtained from the internet, and 4289 assorted control images, drawn from a wide variety of sources.

[1]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  S. Ahmad,et al.  A usable real-time 3D hand tracker , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[4]  Don R. Hush,et al.  Query by image example: The CANDID approach , 1995 .

[5]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[6]  David C. Hogg Model-based vision: a program to see a walking person , 1983, Image Vis. Comput..

[7]  David A. Forsyth,et al.  Body plans , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  P. Perona,et al.  Face Localization via Shape Statistics , 1995 .

[9]  Fang Liu,et al.  Real-time recognition with the entire Brodatz texture database , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Tomaso A. Poggio,et al.  Pedestrian detection using wavelet templates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  David G. Lowe,et al.  Learning object recognition models from images , 1993, 1993 (4th) International Conference on Computer Vision.

[12]  Margaret M. Fleck Practical edge finding with a robust estimator , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Tom Minka,et al.  Interactive learning with a "Society of Models" , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[15]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Other Conferences.

[16]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[17]  J. O'Rourke,et al.  Model-based image analysis of human motion using constraint propagation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Ramakant Nevatia,et al.  Description and Recognition of Curved Objects , 1977, Artif. Intell..

[19]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[20]  Koichiro Akita,et al.  Image sequence analysis of real world human motion , 1984, Pattern Recognit..

[21]  David J. Kriegman,et al.  On Recognizing and Positioning Curved 3-D Objects from Image Contours , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  D. Alman Colour—Why the World Isn't Grey, by Hazel Rossotti, Princeton University Press, Princeton, 1985, 239 pp., paperbound. Price $9.95 , 1989 .

[23]  Michael C. Burl,et al.  Finding faces in cluttered scenes using random labeled graph matching , 1995, Proceedings of IEEE International Conference on Computer Vision.

[24]  W. Eric L. Grimson,et al.  Configuration based scene classification and image indexing , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Takeo Kanade,et al.  Human Face Detection in Visual Scenes , 1995, NIPS.

[26]  D. W. Thompson,et al.  Three-dimensional model matching from an unconstrained viewpoint , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[27]  Stephen Alan Underwood Visual learning and recognition by computer , 1972 .

[28]  Michael J. Swain,et al.  Interactive indexing into image databases , 1993, Electronic Imaging.

[29]  M. Hebert,et al.  The Representation, Recognition, and Locating of 3-D Objects , 1986 .

[30]  Larry S. Davis,et al.  3-D model-based tracking of humans in action: a multi-view approach , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  Hayit Greenspan,et al.  Finding Pictures of Objects in Large Collections of Images , 1996, Object Representation in Computer Vision.

[32]  Fang Liu,et al.  Periodicity, Directionality, and Randomness: Wold Features for Image Modeling and Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Hsi-Jian Lee,et al.  Knowledge-guided visual perception of 3-D human gait from a single image sequence , 1992, IEEE Trans. Syst. Man Cybern..

[34]  W. Eric L. Grimson,et al.  Localizing Overlapping Parts by Searching the Interpretation Tree , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Hiroshi Murase,et al.  Real-time 100 object recognition system , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[36]  J. Koenderink,et al.  The internal representation of solid shape with respect to vision , 1979, Biological Cybernetics.

[37]  Demas Sanger,et al.  Method for light source discrimination and facial pattern detection from negative color films , 1995 .

[38]  Michael Stonebraker,et al.  Chabot: Retrieval from a Relational Database of Images , 1995, Computer.

[39]  Emanuele Trucco,et al.  Geometric Invariance in Computer Vision , 1995 .

[40]  S. Ullman High-Level Vision: Object Recognition and Visual Cognition , 1996 .

[41]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[42]  Takeo Kanade,et al.  Model-based tracking of self-occluding articulated objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[43]  Tom Minka,et al.  Interactive learning with a "society of models" , 1997, Pattern Recognit..

[44]  Charles R. Dyer,et al.  Visibility, occlusion, and the aspect graph , 1990, International Journal of Computer Vision.

[45]  Ramesh C. Jain,et al.  ImageGREP: fast visual pattern matching in image databases , 1997, Electronic Imaging.

[46]  Takeo Kanade,et al.  Neural network-based face detection , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47]  Alex Pentland,et al.  Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[48]  Yee-Hong Yang,et al.  First Sight: A Human Body Outline Labeling System , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[49]  Simone Santini,et al.  Similarity queries in image databases , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[50]  K NayarShree,et al.  Visual learning and recognition of 3-D objects from appearance , 1995 .

[51]  Tapas Kanungo,et al.  Object recognition using appearance-based parts and relations , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[52]  Rosalind W. Picard,et al.  Texture orientation for sorting photos "at a glance" , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[53]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[54]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[55]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[57]  Karl Rohr,et al.  Incremental recognition of pedestrians from image sequences , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[58]  David Salesin,et al.  Fast multiresolution image querying , 1995, SIGGRAPH.

[59]  Michael J. Swain,et al.  The capacity of color histogram indexing , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[60]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[61]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[62]  Andrew Zisserman,et al.  Applications of Invariance in Computer Vision , 1993, Lecture Notes in Computer Science.

[63]  David A. Forsyth,et al.  Invariant Descriptors for 3D Object Recognition and Pose , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[64]  Tomaso A. Poggio,et al.  Finding Human Faces with a Gaussian Mixture Distribution-Based Face Model , 1995, ACCV.

[65]  M. Brady,et al.  Smoothed Local Symmetries and Their Implementation , 1984 .

[66]  Jitendra Malik,et al.  Tracking people with twists and exponential maps , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[67]  Amarnath Gupta,et al.  Virage video engine , 1997, Electronic Imaging.

[68]  John K. Tsotsos,et al.  Ambient illumination and the determination of material changes. , 1986, Journal of the Optical Society of America. A, Optics and image science.

[69]  F. Mosteller,et al.  Understanding robust and exploratory data analysis , 1985 .

[70]  D. Kriegman,et al.  On recognizing and positioning curved 3D objects from image contours , 1989, [1989] Proceedings. Workshop on Interpretation of 3D Scenes.

[71]  Andrew Zisserman,et al.  Geometric invariance in computer vision , 1992 .