Object-Oriented Television

In search of more compression, researchers have recently sought to describe digital video of real scenes not as sequences of frames but rather as collections of objects that are rendered and combined according to scripting information. Depending upon the application and the scene analysis tools available, representations may range from two-dimensional layers to full three-dimensional computer-graphics-style data bases. The significance of these more meaningful representations goes beyond compression, however, enabling new forms of interactivity and personalization, as well as new degrees of freedom in post-production. This paper proposes a computational framework for a television receiver that can handle digital video in forms from traditional motion-compensated transform coders to sets of three-dimensional objects and discusses the requirements for a scripting language to control such a receiver. It is also noted that the concept of scalability can be expanded to include intelligently resizable video, where the originator of a video sequence can specify how the scene is to be composed and cut for displays of differing sizes and aspect ratios.

[1]  V. Michael Bove,et al.  Real-time decoding and display of structured video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[2]  Roger G. Kermode,et al.  Coding for content: enhanced resolution from coding , 1995, Proceedings., International Conference on Image Processing.

[3]  A B Watson,et al.  Perceptual-components architecture for digital video. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[4]  Yuko Yamanouchi,et al.  A Virtual Studio System for TV Program Production , 1993 .

[5]  Jörn Ostermann,et al.  Object-oriented analysis-synthesis coding of moving images , 1989, Signal Process. Image Commun..

[6]  D. Westerkamp,et al.  The Digital Hierarchy — A Blueprint for Television in the 21st Century , 1992 .

[7]  Kenneth A. Parulski,et al.  Source-Adaptive Encoding Options for HDTV and NTSC , 1992 .

[8]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[9]  V. Bove,et al.  Semiautomatic 3D-model extraction from uncalibrated 2D-camera views , 1995 .

[10]  W. Stackhouse Report of the Task Force on Digital Image Architecture , 1992 .

[11]  Ewan A. Macpherson,et al.  A Computer Model of Binaural Localization for Stereo Imaging Measurement , 1989 .

[12]  Jake K. Aggarwal,et al.  On the computation of motion from sequences of images-A review , 1988, Proc. IEEE.

[13]  Andrew Lippman,et al.  Scalable open-architecture television , 1992 .