Abstract : In this report we present an approach to low bitrate video teleconferencing by focusing attention on important information. We show that by selectively degrading the quality of less important regions, more important regions can be sent without loss of quality but with greatly reduced bandwidth requirements. Low bitrate transmission for real-time video delivery over a dynamic network is achieved by region blurring and cropping. A prototype system has been developed to demonstrate the concept. We assume that a human facial area is the most interesting area in the system. The system can automatically focus its attention on a given face and its adjustable surrounding area. The selected area is then fed to the coding system and sent to the receiver. The selection function of the system is fulfilled by a real-time face tracker. The face being tracked can be selected by a mouse or finger pointing if a touch screen is used. It can track a person's face while the person moves freely (e.g., walking, jumping, sitting and rising) in a room. Based on the information provided by the face tracker and the network traffic, a window surrounding the face can be determined. The window size can reflect the network traffic. The image outside of the window will be either cropped or blurred. The preprocessed image is then fed to a tele-conferencing software package-vic, a real-time multimedia application for video conferencing over the Internet. The experimental results show significant savings of required bandwidth for video subjected to the changes.
[1]
S Ullman,et al.
Shifts in selective visual attention: towards the underlying neural circuitry.
,
1985,
Human neurobiology.
[2]
T. Poggio,et al.
Spotlight on attention
,
1985,
Trends in Neurosciences.
[3]
Haibo Li,et al.
Image sequence coding at very low bit rates: a review
,
1994,
IEEE Trans. Image Process..
[4]
A. Treisman,et al.
A feature-integration theory of attention
,
1980,
Cognitive Psychology.
[5]
D. Broadbent.
Task combination and selective intake of information.
,
1982,
Acta psychologica.
[6]
G. Wyszecki,et al.
Color Science Concepts and Methods
,
1982
.
[7]
Allen Allport,et al.
Visual attention
,
1989
.
[8]
Alex Waibel,et al.
Tracking Human Faces in Real-Time,
,
1995
.
[9]
C. Mozer.
A connectionist m o d e l of selective attention in visual perception
,
2020
.
[10]
Steven McCanne,et al.
vic: a flexible framework for packet video
,
1995,
MULTIMEDIA '95.
[11]
C. Bundesen.
A theory of visual attention.
,
1990,
Psychological review.
[12]
Kiyoharu Aizawa,et al.
Model-based image coding advanced video coding techniques for very low bit-rate applications
,
1995,
Proc. IEEE.