Improving the performance of MPEG compatible encoders using on line retrainable neural networks

On line retraining of neural network is introduced for extracting foreground/background objects in video sequences. The scheme is applied together with a modification of the rate control of MPEG-1 algorithm. The proposed method is compatible to MPEG-1/2 standard but also can be used as a pre-coding stage for the forthcoming MPEG-4 algorithm. Simulation studies have shown an improvement of about 1.5 dB on average as far the PSNR is concerned compared with the conventional MPEG-1 encoder.

[1]  Antonio Ortega,et al.  Joint Encoder and VBR Channel Optimization with Buffer and Leaky Bucket Constraints , 1996 .

[2]  Boon-Lock Yeo,et al.  Analysis And Presentation Of Soccer Highlights From Digital Video , 1995 .

[3]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[4]  Olivier Verscheure,et al.  Perceptual quality measure using a spatiotemporal model of the human visual system , 1996, Electronic Imaging.

[5]  JiGuan G. Lin Multiple-objective problems: Pareto-optimal solutions by method of proper equality constraints , 1976 .

[6]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[7]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[8]  Michael G. Perkins,et al.  Data compression of stereopairs , 1992, IEEE Trans. Commun..

[9]  Avishai Henik,et al.  On stereo image coding , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[10]  Antonio Ortega,et al.  Optimal trellis-based buffered compression and fast approximations , 1994, IEEE Trans. Image Process..

[11]  Robert L. Stevenson,et al.  A Bayesian approach to image expansion for improved definitio , 1994, IEEE Trans. Image Process..

[12]  Boon-Lock Yeo,et al.  Classification, simplification, and dynamic visualization of scene transition graphs for video browsing , 1997, Electronic Imaging.

[13]  Thomas Sikora,et al.  The MPEG-4 video standard verification model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[14]  Indraneel Das,et al.  Nonlinear Multicriteria Optimization and Robust Optimality , 1997 .

[15]  Touradj Ebrahimi,et al.  Dynamic approach to visual data compression , 1997, IEEE Trans. Circuits Syst. Video Technol..

[16]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[17]  Barry G. Haskell,et al.  Basics of stereoscopic video, new compression results with MPEG-2 and a proposal for MPEG-4 , 1997, Signal Process. Image Commun..

[18]  Narendra Karmarkar,et al.  A new polynomial-time algorithm for linear programming , 1984, Comb..

[19]  Mel Siegel,et al.  COMPRESSION OF STEREO VIDEO STREAMS , 1994 .

[20]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[21]  V. Ralph Algazi,et al.  Objective picture quality scale (PQS) for image coding , 1998, IEEE Trans. Commun..

[22]  John R. Smith,et al.  Adapting Multimedia Internet Content for Universal Access , 1999, IEEE Trans. Multim..

[23]  Alexandros Eleftheriadis,et al.  Automatic face location detection for model-assisted rate control in H.261-compatible coding of video , 1995, Signal Process. Image Commun..

[24]  J. William Ahwood,et al.  CLASSIFICATION , 1931, Foundations of Familiar Language.

[25]  Leonardo Chiariglione MPEG and multimedia communications , 1997, IEEE Trans. Circuits Syst. Video Technol..

[26]  J.-Y. Bouguet,et al.  Pyramidal implementation of the lucas kanade feature tracker , 1999 .

[27]  Avishai Henik,et al.  Compression Of Stereo Images And The Evaluation Of Its Effects On 3-D Perception , 1990, Optics & Photonics.

[28]  J. Dennis,et al.  A closer look at drawbacks of minimizing weighted sums of objectives for Pareto set generation in multicriteria optimization problems , 1997 .

[29]  Avishai Henik,et al.  Compression of stereo images using subsampling and transform coding , 1991 .

[30]  Alan Hanjalic,et al.  Automated high-level movie segmentation for advanced video-retrieval systems , 1999, IEEE Trans. Circuits Syst. Video Technol..

[31]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[32]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[33]  Nick G. Kingsbury,et al.  A distortion measure for blocking artifacts in images based on human visual sensitivity , 1995, IEEE Trans. Image Process..

[34]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Yao Wang,et al.  Error control and concealment for video communication: a review , 1998, Proc. IEEE.

[36]  Stefanos Kollias,et al.  Retrainable neural networks for image analysis and classification , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[37]  Roger M. Y. Ho,et al.  Goal programming and extensions , 1976 .

[38]  S. Kollias,et al.  AN OPTIMAL FRAMEWORK FOR SUMMARIZATION OF STEREOSCOPIC VIDEO SEQUENCES , 1999 .

[39]  Lotfi A. Zadeh,et al.  Optimality and non-scalar-valued performance criteria , 1963 .

[40]  Boon-Lock Yeo,et al.  Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[41]  Karen Spärck Jones,et al.  Automatic content-based retrieval of broadcast news , 1995, MULTIMEDIA '95.

[42]  Sethuraman Panchanathan,et al.  Review of Image and Video Indexing Techniques , 1997, J. Vis. Commun. Image Represent..

[43]  R. Knopp,et al.  Multiple-accessing over frequency-selective fading channels , 1995, Proceedings of 6th International Symposium on Personal, Indoor and Mobile Radio Communications.

[44]  A. Schertz Source coding of stereoscopic television pictures , 1992 .

[45]  Remi Depommier,et al.  Content-based browsing of video sequences , 1994, MULTIMEDIA '94.

[46]  Wolfgang Effelsberg,et al.  VisualGREP: a systematic method to compare and retrieve video sequences , 1997, Electronic Imaging.

[47]  Antonio Ortega,et al.  Optimal blockwise dependent quantization for stereo image coding , 1999, IEEE Trans. Circuits Syst. Video Technol..