论文信息 - Accuracy versus speed in context-based object detection

Accuracy versus speed in context-based object detection

The visual detection and recognition of objects is facilitated by context. This paper studies two types of learning methods for realizing context-based object detection in paintings. The first method is called the gradient method; it learns to transform the spatial context into a gradient towards the object. The second method, the context-detection method, learns to detect image regions that are likely to contain objects. The accuracy and speed of both methods are evaluated on a face-detection task involving natural and painted faces in a wide variety of contexts. The experimental results show that the gradient method enhances accuracy at the cost of computational speed, whereas the context-detection method optimises speed at the cost of accuracy. The different results of both methods are argued to arise from the different ways in which the methods trade-off accuracy and speed. We conclude that both the gradient method and the context-detection method can be applied to reliable and fast object detection in paintings and that the choice for either method depends on the application and user constraints.

Eric O. Postma | H. Jaap van den Herik | Niek Bergboer

[1] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[2] Takeo Kanade,et al. A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[3] W. R. Howard. The Nature of Mathematical Modeling , 2006 .

[4] Rainer Lienhart,et al. An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[5] Eric O. Postma,et al. Discovering the Visual Signature of Painters , 2000 .

[6] Tomaso A. Poggio,et al. A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[7] Tomaso A. Poggio,et al. Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[8] Eric O. Postma,et al. A Context-Based Model of Attention , 2004, ECAI.

[9] Robert V. Brill,et al. Applied Statistics and Probability for Engineers , 2004, Technometrics.

[10] David G. Stork,et al. Pattern Classification , 1973 .

[11] S. Coren,et al. In Sensation and perception , 1979 .

[12] Leslie S. Smith,et al. The principal components of natural images , 1992 .

[13] Antonio Torralba,et al. Contextual Modulation of Target Saliency , 2001, NIPS.

[14] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.