Non-Zero Diffusion Particle Flow SMC-PHD Filter for Audio-Visual Multi-Speaker Tracking

The sequential Monte Carlo probability hypothesis density (SMC-PHD) filter has been shown to be promising for audio-visual multi-speaker tracking. Recently, the zero diffusion particle flow (ZPF) has been used to mitigate the weight degeneracy problem in the SMC-PHD filter. However, this leads to a substantial increase in the computational cost due to the migration of particles from prior to posterior distribution with a partial differential equation. This paper proposes an alternative method based on the non-zero diffusion particle flow (NPF) to adjust the particle states by fitting the particle distribution with the posterior probability density using the nonzero diffusion. This property allows efficient computation of the migration of particles. Results from the AV16.3 dataset demonstrate that we can significantly mitigate the weight degeneracy problem with a smaller computational cost as compared with the ZPF based SMC-PHD filter.

[1]  Fred Daum,et al.  Particle flow for nonlinear filters with log-homotopy , 2008, SPIE Defense + Commercial Sensing.

[2]  Mustafa Ozden,et al.  A Nonparametric Adaptive Tracking Algorithm Based on Multiple Feature Distributions , 2006, IEEE Transactions on Multimedia.

[3]  Emilio Maggio,et al.  Particle PHD Filtering for Multi-Target Visual Tracking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[4]  Paul C. Martin Statistical Physics: Statics, Dynamics and Renormalization , 2000 .

[5]  Yuxin Zhao,et al.  Particle flow for sequential Monte Carlo implementation of probability hypothesis density , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Rainer Stiefelhagen,et al.  Multi-level Particle Filter Fusion of Features and Cues for Audio-Visual Person Tracking , 2007, CLEAR.

[7]  Fred Daum,et al.  Coulomb's law particle flow for nonlinear filters , 2011, Optical Engineering + Applications.

[8]  C. H. Edwards Advanced calculus of several variables , 1973 .

[9]  Pete Bunch,et al.  Approximations of the Optimal Importance Density Using Gaussian Particle Flow Importance Sampling , 2014, 1406.3183.

[10]  Jun S. Liu,et al.  Sequential Imputations and Bayesian Missing Data Problems , 1994 .

[11]  Branko Ristic,et al.  A Metric for Performance Evaluation of Multi-Target Tracking Algorithms , 2011, IEEE Transactions on Signal Processing.

[12]  Anthony G. Constantinides,et al.  Estimation of direction of arrival using information theory , 2005, IEEE Signal Processing Letters.

[13]  Michael Elad,et al.  Cross-Modal Localization via Sparsity , 2007, IEEE Transactions on Signal Processing.

[14]  Fred Daum,et al.  Nonlinear filters with particle flow , 2009, Optical Engineering + Applications.

[15]  Josef Kittler,et al.  Mean-Shift and Sparse Sampling-Based SMC-PHD Filtering for Audio Informed Visual Speaker Tracking , 2016, IEEE Transactions on Multimedia.

[16]  A. Doucet,et al.  Gibbs flow for approximate transport with applications to Bayesian computation , 2015, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[17]  Yaakov Bar-Shalom,et al.  Sonar tracking of multiple targets using joint probabilistic data association , 1983 .

[18]  Anoop Gupta,et al.  Automating camera management for lecture room environments , 2001, CHI.

[19]  Fred Daum,et al.  Small curvature particle flow for nonlinear filters , 2012, Defense + Commercial Sensing.

[20]  Volkan Cevher,et al.  Target Tracking Using a Joint Acoustic Video System , 2007, IEEE Transactions on Multimedia.

[21]  Jean-Marc Odobez,et al.  Audiovisual Probabilistic Tracking of Multiple Speakers in Meetings , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[22]  Martin Ulmke,et al.  Non-linear and non-Gaussian state estimation using log-homotopy based particle flow filters , 2014, 2014 Sensor Data Fusion: Trends, Solutions, Applications (SDF).

[23]  Yunpeng Li,et al.  Particle Filtering With Invertible Particle Flow , 2016, IEEE Transactions on Signal Processing.

[24]  Jean-Marc Odobez,et al.  AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking , 2004, MLMI.

[25]  Fred Daum,et al.  Particle flow with non-zero diffusion for nonlinear filters , 2013, Defense, Security, and Sensing.

[26]  T. H. Meyer,et al.  Gradient Estimation from Irregularly Spaced Data Sets , 2001 .

[27]  A. Beskos,et al.  Error Bounds and Normalizing Constants for Sequential Monte Carlo in High Dimensions , 2011, 1112.1544.

[28]  Josef Kittler,et al.  Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering , 2015, IEEE Transactions on Multimedia.

[29]  Lingling Zhao,et al.  Particle flow for particle filtering , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30]  Fred Daum,et al.  Renormalization group flow and other ideas inspired by physics for nonlinear filters, Bayesian decisions, and transport , 2014, Defense + Security Symposium.

[31]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Lifeng Sun,et al.  Contextual Mixture Tracking , 2009, IEEE Transactions on Multimedia.

[33]  Junjie Wang,et al.  Gaussian particle flow implementation of PHD filter , 2016, Defense + Security.

[34]  Fred Daum,et al.  A plethora of open problems in particle flow research for nonlinear filters, Bayesian decisions, Bayesian learning, and transport , 2016, Defense + Security.

[35]  Yunpeng Li,et al.  Fast particle flow particle filters via clustering , 2016, 2016 19th International Conference on Information Fusion (FUSION).

[36]  Fred Daum,et al.  Exact particle flow for nonlinear filters , 2010, Defense + Commercial Sensing.

[37]  Murat Efe,et al.  A novel auxiliary particle PHD filter , 2012, 2012 15th International Conference on Information Fusion.

[38]  Frederick E. Daum,et al.  Particle flow for nonlinear filters , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[39]  Fred Daum,et al.  Nonlinear filters with log-homotopy , 2007, SPIE Optical Engineering + Applications.

[40]  J. Odobez,et al.  AV 16 . 3 : An Audio-Visual Corpus for Speaker Localization and Tracking , .

[41]  Frederick E. Daum,et al.  Renormalization group flow in k-space for nonlinear filters, Bayesian decisions and transport , 2015, 2015 18th International Conference on Information Fusion (Fusion).