Accurate pose estimation for forensic identification

In forensic authentication, one aims to identify the perpetrator among a series of suspects or distractors. A fundamental problem in any recognition system that aims for identification of subjects in a natural scene is the lack of constrains on viewing and imaging conditions. In forensic applications, identification proves even more challenging, since most surveillance footage is of abysmal quality. In this context, robust methods for pose estimation are paramount. In this paper we will therefore present a new pose estimation strategy for very low quality footage. Our approach uses 3D-2D registration of a textured 3D face model with the surveillance image to obtain accurate far field pose alignment. Starting from an inaccurate initial estimate, the technique uses novel similarity measures based on the monogenic signal to guide a pose optimization process. We will illustrate the descriptive strength of the introduced similarity measures by using them directly as a recognition metric. Through validation, using both real and synthetic surveillance footage, our pose estimation method is shown to be accurate, and robust to lighting changes and image degradation.

[1]  Hassan Foroosh,et al.  Extension of phase correlation to subpixel registration , 2002, IEEE Trans. Image Process..

[2]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[3]  David J. Fleet,et al.  Computation of component image velocity from local phase information , 1990, International Journal of Computer Vision.

[4]  Michael Felsberg,et al.  Structure Multivector for Local Analysis of Images , 2000, Theoretical Foundations of Computer Vision.

[5]  M. J. D. Powell,et al.  Direct search algorithms for optimization calculations , 1998, Acta Numerica.

[6]  Laurence C. Breaker,et al.  A Proposed Definition for Vector Correlation in Geophysics: Theory and Application , 1993 .

[7]  Nicholas Ayache,et al.  The Correlation Ratio as a New Similarity Measure for Multimodal Image Registration , 1998, MICCAI.

[8]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Ivo B Alberink,et al.  2D/3D image (facial) comparison using camera matching. , 2006, Forensic science international.

[10]  Michael Brady,et al.  On the Choice of Band-Pass Quadrature Filters , 2004, Journal of Mathematical Imaging and Vision.

[11]  Colin Studholme,et al.  An overlap invariant entropy measure of 3D medical image alignment , 1999, Pattern Recognit..

[12]  Heiko Neumann,et al.  Detection of Head Pose and Gaze Direction for Human-Computer Interaction , 2006, PIT.

[13]  Michael Brady,et al.  The Use of Multi-scale Monogenic Signal on Structure Orientation Identification and Segmentation , 2006, Digital Mammography / IWDM.

[14]  Michael Felsberg,et al.  Optical Flow Estimation from Monogenic Phase , 2004, IWCM.

[15]  Hans Knutsson,et al.  Phase-based multidimensional volume registration , 2000, IEEE Transactions on Medical Imaging.

[16]  Michael Brady,et al.  Phase mutual information as a similarity measure for registration , 2005, Medical Image Anal..

[17]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  M. Concetta Morrone,et al.  An adaptive approach to scale selection for line and edge detection , 1995, Pattern Recognit. Lett..

[19]  Larry S. Davis,et al.  Model-based object pose in 25 lines of code , 1992, International Journal of Computer Vision.

[20]  Marco La Cascia,et al.  Fast, reliable head tracking under varying illumination , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[21]  Peter Kovesi,et al.  Image Features from Phase Congruency , 1995 .

[22]  C. Morandi,et al.  Registration of Translated and Rotated Images Using Finite Fourier Transforms , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Hans-Peter Seidel,et al.  Robust Pose Estimation with 3D Textured Models , 2006, PSIVT.

[24]  A.V. Oppenheim,et al.  The importance of phase in signals , 1980, Proceedings of the IEEE.

[25]  Zeno Geradts,et al.  Forensic audio and visual evidence 2004-2007 : A Review , 2007 .

[26]  K. Mardia,et al.  A general correlation coefficient for directional data and related regression problems , 1980 .

[27]  Barbara Zitová,et al.  How to measure the pose robustness of object views , 2002, Image Vis. Comput..

[28]  Michael Felsberg,et al.  A New Extension of Linear Signal Processing for Estimating Local Properties and Detecting Features , 2000, DAGM-Symposium.