Immersion into Visual Media: New Applications of Image Understanding

Using images to interface machines and humans offers an exciting new application domain for image understanding (IU). This article describes two such applications: digital libraries that bring visual data to the user, and virtualized reality that brings users to the visual data. The digital library calls on IU techniques such as scene segmentation and content analysis, and joins together natural language analysis and IU. Virtualized reality uses precise, dense, and video-rate stereo reconstruction techniques.

[1]  Michael L. Mauldin Conceptual Information Retrieval , 1991 .

[2]  John P. McDermott,et al.  Rule-Based Interpretation of Aerial Imagery , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Takeo Kanade,et al.  A multiple-baseline stereo , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Michael Loren Mauldin,et al.  Information retrieval by text skimming , 1989 .

[5]  Mei-Yuh Hwang,et al.  Predicting unseen triphones with senones , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Takeo Kanade,et al.  Human Face Detection in Visual Scenes , 1995, NIPS.

[7]  Takeo Kanade,et al.  Development of a video-rate stereo machine , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.