Accuracy versus speed in context-based object detection

The visual detection and recognition of objects is facilitated by context. This paper studies two types of learning methods for realizing context-based object detection in paintings. The first method is called the gradient method; it learns to transform the spatial context into a gradient towards the object. The second method, the context-detection method, learns to detect image regions that are likely to contain objects. The accuracy and speed of both methods are evaluated on a face-detection task involving natural and painted faces in a wide variety of contexts. The experimental results show that the gradient method enhances accuracy at the cost of computational speed, whereas the context-detection method optimises speed at the cost of accuracy. The different results of both methods are argued to arise from the different ways in which the methods trade-off accuracy and speed. We conclude that both the gradient method and the context-detection method can be applied to reliable and fast object detection in paintings and that the choice for either method depends on the application and user constraints.

[1]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[2]  Takeo Kanade,et al.  A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[3]  W. R. Howard The Nature of Mathematical Modeling , 2006 .

[4]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[5]  Eric O. Postma,et al.  Discovering the Visual Signature of Painters , 2000 .

[6]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[7]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Eric O. Postma,et al.  A Context-Based Model of Attention , 2004, ECAI.

[9]  Robert V. Brill,et al.  Applied Statistics and Probability for Engineers , 2004, Technometrics.

[10]  David G. Stork,et al.  Pattern Classification , 1973 .

[11]  S. Coren,et al.  In Sensation and perception , 1979 .

[12]  Leslie S. Smith,et al.  The principal components of natural images , 1992 .

[13]  Antonio Torralba,et al.  Contextual Modulation of Target Saliency , 2001, NIPS.

[14]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.