Beyond the Static Camera: Issues and Trends in Active Vision

Maximizing both the area coverage and the resolution per target is highly desirable in many applications of computer vision. However, with a limited number of cameras viewing a scene, the two objectives are contradictory. This chapter is dedicated to active vision systems, trying to achieve a trade-off between these two aims and examining the use of high-level reasoning in such scenarios. The chapter starts by introducing different approaches to active cameras configurations. Later, a single active camera system to track a moving object is developed, offering the reader first-hand understanding of the issues involved. Another section discusses practical considerations in building an active vision platform, taking as an example a multi-camera system developed for a European project. The last section of the chapter reflects upon the future trends of using semantic factors to drive smartly coordinated active systems.

[1]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[2]  Helder Araújo,et al.  A surveillance system combining peripheral and foveated motion tracking , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[3]  Sven Wachsmuth,et al.  Integration and Coordination in a Cognitive Vision System , 2006, Fourth IEEE International Conference on Computer Vision Systems (ICVS'06).

[4]  Eric Sommerlade,et al.  Information-theoretic active scene exploration , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Ian D. Reid,et al.  Driving saccade to pursuit using image motion , 1995, International Journal of Computer Vision.

[6]  Luc Van Gool,et al.  A distributed camera system for multi-resolution surveillance , 2009, 2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC).

[7]  Simone Calderara,et al.  Bayesian-Competitive Consistent Labeling for People Surveillance , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Demetri Terzopoulos,et al.  Surveillance in Virtual Reality: System Design and Multi-Camera Control , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Stan Sclaroff,et al.  Look there! Predicting where to look for motion in an active camera network , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[10]  L. Van Gool,et al.  Event-Based Tracking Evaluation Metric , 2008, 2008 IEEE Workshop on Motion and video Computing.

[11]  Joachim Denzler,et al.  Information theoretic focal length selection for real-time active 3D object tracking , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  Greg Welch,et al.  Welch & Bishop , An Introduction to the Kalman Filter 2 1 The Discrete Kalman Filter In 1960 , 1994 .

[13]  Alberto Del Bimbo,et al.  Improving evidential quality of surveillance imagery through active face tracking , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[14]  Sharath Pankanti,et al.  Face cataloger: multi-scale imaging for relating identity to location , 2003, Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, 2003..

[15]  F. Xavier Roca,et al.  Robust and Efficient Multipose Face Detection Using Skin Color Segmentation , 2009, IbPRIA.

[16]  H.-H. Nagel,et al.  Representation of occurrences for road vehicle traffic , 2008, Artif. Intell..

[17]  T. Kanade,et al.  A master-slave system to acquire biometric imagery of humans at distance , 2003, IWVS '03.

[18]  F. Xavier Roca,et al.  Understanding dynamic scenes based on human sequence evaluation , 2009, Image Vis. Comput..

[19]  C. Diehl,et al.  Scheduling an active camera to observe people , 2004, VSSN '04.

[20]  David W. Murray,et al.  A method of reactive zoom control from uncertainty in tracking , 2007, Comput. Vis. Image Underst..

[21]  J.-J. Wang,et al.  Face Image Resolution versus Face Recognition Performance Based on Two Global Methods , 2004 .

[22]  Alberto Del Bimbo,et al.  Exploiting distinctive visual landmark maps in pan-tilt-zoom camera networks , 2010, Comput. Vis. Image Underst..

[23]  Mubarak Shah,et al.  Integrating multiple levels of zoom to enable activity analysis , 2006, Comput. Vis. Image Underst..

[24]  Qiang Ji,et al.  Facial expression understanding in image sequences using dynamic and active visual information fusion , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[25]  Fatih Murat Porikli,et al.  Collaborative tracking of objects in EPTZ cameras , 2007, Electronic Imaging.

[26]  Juan C. Cockburn,et al.  Dual Camera Zoom Control: A Study of Zoom Tracking Stability , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[27]  Greg Welch,et al.  A Stochastic Quality Metric for Optimal Control of Active Camera Network Configurations for 3 D Computer Vision Tasks , 2008 .

[28]  F. Xavier Roca,et al.  Reactive Object Tracking with a Single PTZ Camera , 2010, 2010 20th International Conference on Pattern Recognition.

[29]  A. Çapar,et al.  License Plate Recognition From Still Images and Video Sequences: A Survey , 2008, IEEE Transactions on Intelligent Transportation Systems.

[30]  Nicu Sebe,et al.  Facial expression recognition from video sequences: temporal and static modeling , 2003, Comput. Vis. Image Underst..

[31]  Demetri Terzopoulos,et al.  Multi-camera Control through Constraint Satisfaction for Persistent Surveillance , 2008, 2008 IEEE Fifth International Conference on Advanced Video and Signal Based Surveillance.