Non-Zero Diffusion Particle Flow SMC-PHD Filter for Audio-Visual Multi-Speaker Tracking

The sequential Monte Carlo probability hypothesis density (SMC-PHD) filter has been shown to be promising for audio-visual multi-speaker tracking. Recently, the zero diffusion particle flow (ZPF) has been used to mitigate the weight degeneracy problem in the SMC-PHD filter. However, this leads to a substantial increase in the computational cost due to the migration of particles from prior to posterior distribution with a partial differential equation. This paper proposes an alternative method based on the non-zero diffusion particle flow (NPF) to adjust the particle states by fitting the particle distribution with the posterior probability density using the nonzero diffusion. This property allows efficient computation of the migration of particles. Results from the AV16.3 dataset demonstrate that we can significantly mitigate the weight degeneracy problem with a smaller computational cost as compared with the ZPF based SMC-PHD filter.

[1]  Fred Daum,et al.  Nonlinear filters with log-homotopy , 2007, SPIE Optical Engineering + Applications.

[2]  J. Odobez,et al.  AV 16 . 3 : An Audio-Visual Corpus for Speaker Localization and Tracking , .

[3]  Anthony G. Constantinides,et al.  Estimation of direction of arrival using information theory , 2005, IEEE Signal Processing Letters.

[4]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Frederick E. Daum,et al.  Particle flow for nonlinear filters , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Mustafa Ozden,et al.  A Nonparametric Adaptive Tracking Algorithm Based on Multiple Feature Distributions , 2006, IEEE Transactions on Multimedia.

[7]  Lifeng Sun,et al.  Contextual Mixture Tracking , 2009, IEEE Transactions on Multimedia.

[8]  Junjie Wang,et al.  Gaussian particle flow implementation of PHD filter , 2016, Defense + Security.

[9]  Michael Elad,et al.  Cross-Modal Localization via Sparsity , 2007, IEEE Transactions on Signal Processing.

[10]  Martin Ulmke,et al.  Non-linear and non-Gaussian state estimation using log-homotopy based particle flow filters , 2014, 2014 Sensor Data Fusion: Trends, Solutions, Applications (SDF).

[11]  Fred Daum,et al.  Particle flow for nonlinear filters with log-homotopy , 2008, SPIE Defense + Commercial Sensing.

[12]  Anoop Gupta,et al.  Automating camera management for lecture room environments , 2001, CHI.

[13]  Fred Daum,et al.  Small curvature particle flow for nonlinear filters , 2012, Defense + Commercial Sensing.

[14]  Josef Kittler,et al.  Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering , 2015, IEEE Transactions on Multimedia.

[15]  Yaakov Bar-Shalom,et al.  Sonar tracking of multiple targets using joint probabilistic data association , 1983 .

[16]  Branko Ristic,et al.  A Metric for Performance Evaluation of Multi-Target Tracking Algorithms , 2011, IEEE Transactions on Signal Processing.

[17]  Josef Kittler,et al.  Mean-Shift and Sparse Sampling-Based SMC-PHD Filtering for Audio Informed Visual Speaker Tracking , 2016, IEEE Transactions on Multimedia.

[18]  Emilio Maggio,et al.  Particle PHD Filtering for Multi-Target Visual Tracking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[19]  Fred Daum,et al.  Nonlinear filters with particle flow , 2009, Optical Engineering + Applications.

[20]  Frederick E. Daum,et al.  Renormalization group flow in k-space for nonlinear filters, Bayesian decisions and transport , 2015, 2015 18th International Conference on Information Fusion (Fusion).

[21]  Murat Efe,et al.  A novel auxiliary particle PHD filter , 2012, 2012 15th International Conference on Information Fusion.

[22]  Fred Daum,et al.  Exact particle flow for nonlinear filters , 2010, Defense + Commercial Sensing.

[23]  Rainer Stiefelhagen,et al.  Multi-level Particle Filter Fusion of Features and Cues for Audio-Visual Person Tracking , 2007, CLEAR.

[24]  Fred Daum,et al.  Coulomb's law particle flow for nonlinear filters , 2011, Optical Engineering + Applications.

[25]  Jun S. Liu,et al.  Sequential Imputations and Bayesian Missing Data Problems , 1994 .

[26]  Volkan Cevher,et al.  Target Tracking Using a Joint Acoustic Video System , 2007, IEEE Transactions on Multimedia.

[27]  Jean-Marc Odobez,et al.  Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[28]  Lingling Zhao,et al.  Particle flow for particle filtering , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  Fred Daum,et al.  Renormalization group flow and other ideas inspired by physics for nonlinear filters, Bayesian decisions, and transport , 2014, Defense + Security Symposium.