3D Target Scale Estimation for Size Preserving in PTZ Video Tracking

In size preserving video tracking, the camera's focal length (zoom) is adjusted automatically to compensate for the changes in the target's image size caused by the relative motion between the camera and the target. The accurate estimation of these changes is paramount to the system performance. Structure from motion (SFM) based on the weak perspective projection model has been applied to real time target scale estimation (B. Tordoff, et al., 2004). In this paper we design target scale estimation algorithms with linear solutions based on more advanced projection models: paraperspective and perspective. The performances of the algorithms using three projection models are examined and compared. Experimental results show that the proposed algorithm based on the paraperspective projection model produces the best performance with respect to accuracy, complexity, and robustness.

[1]  Mongi A. Abidi,et al.  Automatic face region tracking for highly accurate face recognition in unconstrained environments , 2003, Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, 2003..

[2]  Robert T. Collins,et al.  Mean-shift blob tracking through scale space , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[3]  David W. Murray,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Reactive Control of Zoom While Fixating Using Perspective and Affine Cameras , 2022 .

[4]  Harry Shum,et al.  Constrained planar motion analysis by decomposition , 2004, Image Vis. Comput..

[5]  Rama Chellappa,et al.  A Factorization Method for Structure from Planar Motion , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[6]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, ECCV.

[7]  Jie Wei,et al.  On Active Camera Control and Camera Motion Recovery with Foveate Wavelet Transform , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  José Santos-Victor,et al.  A closed-form solution for paraperspective reconstruction , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[9]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[10]  Ehud Rivlin,et al.  Zoom tracking and its applications , 2001, Machine Vision and Applications.