Event prediction in a hybrid camera network

Given a hybrid camera layout—one containing, for example, static and active cameras—and people moving around following established traffic patterns, our goal is to predict a subset of cameras, respective camera parameter settings, and future time windows that will most likely lead to success the vision tasks, such as, face recognition when a camera observes an event of interest. We propose an adaptive probabilistic model that accrues temporal camera correlations over time as the cameras report observed events. No extrinsic, intrinsic, or color calibration of cameras is required. We efficiently obtain the camera parameter predictions using a modified Sequential Monte Carlo method. We demonstrate the performance of the model in an example face detection scenario in both simulated and real environment experiments, using several active cameras.

[1]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[2]  N. M. Nagorny,et al.  The Theory of Algorithms , 1988 .

[3]  Y. Boers On the number of samples to be drawn in particle filtering , 1999 .

[4]  Trevor Darrell,et al.  Simultaneous calibration and tracking with a network of non-overlapping sensors , 2004, CVPR 2004.

[5]  Peter I. Corke,et al.  A tutorial on visual servo control , 1996, IEEE Trans. Robotics Autom..

[6]  Ulf Grenander,et al.  Hands: A Pattern Theoretic Study of Biological Shapes , 1990 .

[7]  Chris Stauffer,et al.  Estimating Tracking Sources and Sinks , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[8]  Yoshiaki Shirai,et al.  Parallel scheduling of planning and action for realizing an efficient and reactive robotic system , 2002, 7th International Conference on Control, Automation, Robotics and Vision, 2002. ICARCV 2002..

[9]  Larry S. Davis,et al.  Constructing task visibility intervals for video surveillance , 2006, Multimedia Systems.

[10]  W. Eric L. Grimson,et al.  Inference of non-overlapping camera network topology by measuring statistical dependence , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[11]  Michael I. Jordan,et al.  Mixed Memory Markov Models: Decomposing Complex Stochastic Processes as Mixtures of Simpler Ones , 1999, Machine Learning.

[12]  Christopher Richard Wren,et al.  Similarity-based analysis for large networks of ultra-low resolution sensors , 2006, Pattern Recognit..

[13]  Tim J. Ellis,et al.  Bridging the gaps between cameras , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[14]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Trevor Darrell,et al.  Simultaneous calibration and tracking with a network of non-overlapping sensors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[16]  Mubarak Shah,et al.  Consistent Labeling of Tracked Objects in Multiple Cameras with Overlapping Fields of View , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Dimitrios Makris,et al.  Bridging the gaps between cameras , 2004, CVPR 2004.

[18]  Gregory Dudek,et al.  A practical algorithm for network topology inference , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[19]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[20]  Stan Sclaroff,et al.  Look there! Predicting where to look for motion in an active camera network , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[21]  G. Kitagawa Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models , 1996 .

[22]  Ralph P. Grimaldi,et al.  Discrete and Combinatorial Mathematics: An Applied Introduction , 1998 .

[23]  Christopher R. Wren,et al.  Automatic pan-tilt-zoom calibration in the presence of hybrid sensor networks , 2005, VSSN '05.

[24]  T. Kanade,et al.  A master-slave system to acquire biometric imagery of humans at distance , 2003, IWVS '03.

[25]  Ser-Nam Lim,et al.  Constructing task visibility intervals for a surveillance system , 2005, VSSN@MM.

[26]  C. Diehl,et al.  Scheduling an active camera to observe people , 2004, VSSN '04.

[27]  Mubarak Shah,et al.  Tracking across multiple cameras with disjoint views , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[28]  Dan Schonfeld,et al.  Distributed Bayesian Multiple-Target Tracking in Crowded Environments Using Multiple Collaborative Cameras , 2007, EURASIP J. Adv. Signal Process..

[29]  Songhwai Oh,et al.  Markov chain Monte Carlo data association for general multiple-target tracking problems , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[30]  K. Etemad,et al.  Discriminant analysis for recognition of human face images , 1997 .

[31]  Eric Sommerlade,et al.  Influence of zoom selection on a Kalman filter , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[32]  David B. Stewart,et al.  Real-Time Scheduling of Sensor-Based Control Systems , 1991 .

[33]  Ramin Zabih,et al.  Bayesian multi-camera surveillance , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[34]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[35]  Richard J. Radke,et al.  Calibrating Distributed Camera Networks Using Belief Propagation , 2007, EURASIP J. Adv. Signal Process..