An efficient two-pass MAP-MRF algorithm for motion estimation based on mean field theory

This paper presents a two-pass algorithm for estimating motion vectors from image sequences. In the proposed algorithm, the motion estimation is formulated as a problem of obtaining the maximum a posteriori in the Markov random field (MAP-MRF). An optimization method based on the mean field theory (MFT) is opted to conduct the MAP search. The estimation of motion vectors is modeled by only two MRFs, namely, the motion vector field and unpredictable field. Instead of utilizing the line field, a truncation function is introduced to handle the discontinuity between the motion vectors on neighboring sites. In this algorithm, a "double threshold" preprocessing pass is first employed to partition the sites into three regions, whereby the ensuing MPT-based pass for each MRF is conducted on one or two of the three regions. With this algorithm, no significant difference exists between the block-based and pixel-based MAP searches any more. Consequently, a good compromise between precision and efficiency can be struck with ease. To render our algorithm more resilient against noise, the mean absolute difference instead of mean square error is selected as the measure of difference, which is more reliable according to the knowledge of robust statistics. This is supported by our experimental results from both synthetic and real-world image sequences. The proposed two-pass algorithm is much faster than any other MAP-MRF motion estimation method reported in the literature so far.

[1]  Eric Dubois,et al.  Bayesian Estimation of Motion Vector Fields , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Michael J. Black,et al.  On the unification of line processes , 1996 .

[3]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[4]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[5]  R. Srinivasan,et al.  Predictive Coding Based on Efficient Motion Estimation , 1985, IEEE Trans. Commun..

[6]  Allen Gersho,et al.  Maximum a posteriori decision and evaluation of class probabilities by Boltzmann perceptron classifiers , 1990 .

[7]  Carsten Peterson,et al.  A Mean Field Theory Learning Algorithm for Neural Networks , 1987, Complex Syst..

[8]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[9]  Anil K. Jain,et al.  Displacement Measurement and Its Application in Interframe Image Coding , 1981, IEEE Trans. Commun..

[10]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Sohail Zafar,et al.  Motion-compensated wavelet transform coding for color video compression , 1992, IEEE Trans. Circuits Syst. Video Technol..

[12]  Andrew Blake,et al.  Visual Reconstruction , 1987, Deep Learning for EEG-Based Brain–Computer Interfaces.

[13]  Jun Zhang,et al.  The application of mean field theory to image motion estimation , 1995, IEEE Trans. Image Process..

[14]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[15]  D. Chandler,et al.  Introduction To Modern Statistical Mechanics , 1987 .

[16]  Antonio Ortega,et al.  Stereo image compression with disparity compensation using the MRF model , 1996, Other Conferences.

[17]  Rama Chellappa,et al.  Multiresolution Gauss-Markov random field models for texture segmentation , 1997, IEEE Trans. Image Process..

[18]  Refractor Vision , 2000, The Lancet.

[19]  T Koga,et al.  MOTION COMPENSATED INTER-FRAME CODING FOR VIDEO CONFERENCING , 1981 .

[20]  P. Pérez,et al.  Parallel visual motion analysis using multiscale Markov random fields , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[21]  P. B. Coaker,et al.  Applied Dynamic Programming , 1964 .

[22]  Basilis Gidas,et al.  A Renormalization Group Approach to Image Processing Problems , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[24]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[25]  Stan Z. Li,et al.  Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[26]  H. B. Mitchell Markov Random Fields , 1982 .

[27]  오승준 [서평]「Digital Video Processing」 , 1996 .

[28]  Christoph Stiller,et al.  Object-based estimation of dense motion fields , 1997, IEEE Trans. Image Process..

[29]  D. Ruppert Robust Statistics: The Approach Based on Influence Functions , 1987 .

[30]  Jie Wei,et al.  Illumination-invariant color object recognition via compressed chromaticity histograms of color-channel-normalized images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[31]  J. M. Hammersley,et al.  Markov fields on finite graphs and lattices , 1971 .

[32]  Jie Wei,et al.  Motion compensation in color video with illumination variations , 1997, Proceedings of International Conference on Image Processing.

[33]  Jun Zhang The mean field theory in EM procedures for Markov random fields , 1992, IEEE Trans. Signal Process..