Object Detection and Localization by Dynamic Template Warping

A simple method is presented for detecting, localizing and recognizing classes of objects, that accommodates wide variation in an object's pose. The method utilizes a small two-dimensional template that is warped into an image, and converts localization to a one-dimensional sub-problem, with the search for a match between image and template executed by dynamic programming. The method recovers three of the six degrees of freedom of motion (2 translation, 1 rotation), and accommodates two more DOF in the search process (1 rotation, 1 translation), and is extensible to the final DOF. Experiments demonstrate that the method provides an efficient search strategy that outperforms normalized correlation. This is demonstrated in the example domain of face detection and localization, and is extended to more general detection tasks. An additional technique recovers a rough object pose from the match results, and is used in a two stage recognition experiment using maximization of mutual information.

[1]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[2]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[3]  Takeo Kanade,et al.  Stereo by Intra- and Inter-Scanline Search Using Dynamic Programming , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Shimon Ullman,et al.  Structural Saliency: The Detection Of Globally Salient Structures using A Locally Connected Network , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[6]  William Rucklidge,et al.  Locating objects using the Hausdorff distance , 1995, Proceedings of IEEE International Conference on Computer Vision.

[7]  Paul A. Viola,et al.  Alignment by Maximization of Mutual Information , 1995, Proceedings of IEEE International Conference on Computer Vision.

[8]  PoggioTomaso,et al.  Example-Based Learning for View-Based Human Face Detection , 1998 .