Feature compression: A framework for multi-view multi-person tracking in visual sensor networks

Visual sensor networks (VSNs) consist of image sensors, embedded processors and wireless transceivers which are powered by batteries. Since the energy and bandwidth resources are limited, setting up a tracking system in VSNs is a challenging problem. In this paper, we present a framework for human tracking in VSNs. The traditional approach of sending compressed images to a central node has certain disadvantages such as decreasing the performance of further processing (i.e., tracking) because of low quality images. Instead, we propose a feature compression-based decentralized tracking framework that is better matched with the further inference goal of tracking. In our method, each camera performs feature extraction and obtains likelihood functions. By transforming to an appropriate domain and taking only the significant coefficients, these likelihood functions are compressed and this new representation is sent to the fusion node. As a result, this allows us to reduce the communication in the network without significantly affecting the tracking performance. An appropriate domain is selected by performing a comparison between well-known transforms. We have applied our method for indoor people tracking and demonstrated the superiority of our system over the traditional approach and a decentralized approach that uses Kalman filter.

[1]  Amit K. Roy-Chowdhury,et al.  Robust Tracking in A Camera Network: A Multi-Objective Optimization Framework , 2008, IEEE Journal of Selected Topics in Signal Processing.

[2]  Ian F. Akyildiz,et al.  Wireless multimedia sensor networks: A survey , 2007, IEEE Wireless Communications.

[3]  Allen R. Hanson,et al.  Automatic resource allocation in a distributed camera network , 2010, Machine Vision and Applications.

[4]  Patrick Pérez,et al.  Color-Based Probabilistic Tracking , 2002, ECCV.

[5]  Chao Yu,et al.  Camera Scheduling and Energy Allocation for Lifetime Maximization in User-Centric Visual Sensor Networks , 2010, IEEE Transactions on Image Processing.

[6]  Bernhard Rinner,et al.  Resource-Aware Coverage and Task Assignment in Visual Sensor Networks , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  António de Sousa Smart Cameras in Embedded Systems , 2020 .

[9]  Larry S. Davis,et al.  A General Method for Sensor Planning in Multi-Sensor Systems: Extension to Random Occlusion , 2007, International Journal of Computer Vision.

[10]  Larry S. Davis,et al.  Constraint Integration for Efficient Multiview Pose Estimation with Self-Occlusions , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[12]  Wolfgang Straßer,et al.  Adaptive Probabilistic Tracking Embedded in Smart Cameras for Distributed Surveillance in a 3D Model , 2007, EURASIP J. Embed. Syst..

[13]  Henry Medeiros,et al.  Distributed Object Tracking Using a Cluster-Based Kalman Filter in Wireless Camera Networks , 2008, IEEE Journal of Selected Topics in Signal Processing.

[14]  Pascal Fua,et al.  Multicamera People Tracking with a Probabilistic Occupancy Map , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Bülent Tavli,et al.  A survey of visual sensor network platforms , 2012, Multimedia Tools and Applications.

[16]  Henry Medeiros,et al.  Cluster-Based Distributed Face Tracking in Camera Networks , 2010, IEEE Transactions on Image Processing.

[17]  Aggelos K. Kutsuggelos OPTIMAL SENSOR SELECTION FOR VIDEO-BASED TARGET TRACKING IN A WIRELESS SENSOR NETWORK , 2004 .

[18]  Andrea Cavallaro,et al.  Distributed and Decentralized Multicamera Tracking , 2011, IEEE Signal Processing Magazine.

[19]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[20]  Hamid K. Aghajan,et al.  Color-Based Multiple Agent Tracking for Wireless Image Sensor Networks , 2006, ACIVS.

[21]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[22]  S. Mallat A wavelet tour of signal processing , 1998 .

[23]  Dariu Gavrila,et al.  Multi-view 3D human pose estimation combining single-frame recovery, temporal integration and model adaptation , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Hwann-Tzong Chen,et al.  Real-time tracking using trust-region methods , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Eduardo Monari,et al.  Task-Oriented Object Tracking in Large Distributed Camera Networks , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[26]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[27]  Sufen Fong,et al.  MeshEye: A Hybrid-Resolution Smart Camera Mote for Applications in Distributed Intelligent Surveillance , 2007, 2007 6th International Symposium on Information Processing in Sensor Networks.

[28]  Jian Yao,et al.  Multi-Camera Multi-Person 3D Space Tracking with MCMC in Surveillance Scenarios , 2008, ECCV 2008.

[29]  Aggelos K. Katsaggelos,et al.  Optimal sensor selection for video-based target tracking in a wireless sensor network , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[30]  Anastasios N. Venetsanopoulos,et al.  Biorthogonal modified coiflet filters for image compression , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).