A neural network based scheme for unsupervised video object segmentation

We propose a neural network based scheme for performing unsupervised video object segmentation, especially for videophone or videoconferencing applications. The procedure includes (a) a training algorithm for adapting the network weights to the current condition, (b) a maximum a posteriori (MAP) estimation procedure for optimally selecting the most representative data of the current environment as retraining data and (c) a decision mechanism for determining when network retraining should be activated. The training algorithm takes into consideration both the former and the current network knowledge in order to achieve good generalization. The MAP estimation procedure models the network output as a Markov random field (MRF) and optimally selects the set of training inputs and corresponding desired outputs, using initial estimates of the human face and body. Finally, a verification mechanism is introduced which augments the training data, exploiting information of the previous and current environment.

[1]  Jenq-Neng Hwang,et al.  Neural networks for intelligent multimedia processing , 1998 .

[2]  Montse Pardàs,et al.  Hierarchical morphological segmentation for image sequence coding , 1994, IEEE Trans. Image Process..

[3]  Jenq-Neng Hwang,et al.  Neural networks for intelligent multimedia processing , 1997 .

[4]  Stuart German,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1988 .

[5]  Alex Pentland,et al.  Probabilistic Visual Learning for Object Representation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Stefanos Kollias,et al.  Retrainable neural networks for image analysis and classification , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[8]  Thomas Sikora,et al.  The MPEG-4 video standard verification model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[9]  Touradj Ebrahimi,et al.  Dynamic approach to visual data compression , 1997, IEEE Trans. Circuits Syst. Video Technol..

[10]  Stefanos D. Kollias,et al.  Low bit-rate coding of image sequences using adaptive regions of interest , 1998, IEEE Trans. Circuits Syst. Video Technol..