Salient video stills: content and context preserved

A new class of images called salient stills is demonstrated and a software development platform for their creation is discussed. These images do not represent one discrete moment of time, as do a photograph or single video frame. Rather, one image reflects the aggregate of the temporal changes that occur in a moving image sequence with the salient features preserved. By the application of an affine transformation and non-linear temporal processing, multiple frames of an image sequence, which may include variations in focal-length or field-of-view, are combined to create a single still image. The still image may have multi-resolution patches, a larger field-of-view, or higher overall resolution than any individual frame in the original image sequence. It may also contain selected salient objects from any one of the sequence of video frames. The still can be created automatically or with user intervention. A by-product of the salient still process is a structured representation of moving image data.

[1]  J. Limb,et al.  Estimating the Velocity of Moving Images in Television Signals , 1975 .

[2]  G. W. Furnas,et al.  Generalized fisheye views , 1986, CHI '86.

[3]  Anil K. Jain,et al.  Displacement Measurement and Its Application in Interframe Image Coding , 1981, IEEE Trans. Commun..

[4]  Richard Mander,et al.  A “pile” metaphor for supporting casual organization of information , 1992, CHI.

[5]  Andrew Lippman,et al.  Feature sets for interactive images , 1991, CACM.

[6]  Edward H. Adelson,et al.  PYRAMID METHODS IN IMAGE PROCESSING. , 1984 .

[7]  Henry Neil Holtzman,et al.  Three-dimensional representations of video using knowledge based estimation , 1991 .

[8]  Jock D. Mackinlay,et al.  Cone Trees: animated 3D visualizations of hierarchical information , 1991, CHI.

[9]  Steven Yelick,et al.  Anamorphic image processing , 1980 .

[10]  Walter Bender,et al.  Salient stills , 1992, CHI '92.

[11]  Shmuel Peleg,et al.  Computing two motions from three frames , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[12]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[13]  G. Elliot Multiple views of digital video , 1992 .

[14]  Patrick Campbell McLean,et al.  Structured video coding , 1991 .

[15]  Manojit Sarkar,et al.  Graphical fisheye views of graphs , 1992, CHI.

[16]  Jock D. Mackinlay,et al.  The perspective wall: detail and context smoothly integrated , 1991, CHI.

[17]  Walter Bender,et al.  Newspace: Mass Media and Personal Computing , 1991, USENIX Summer.

[18]  Robert Mohl Cognitive space in the interactive movie map : an investigation of spatial learning in virtual environments , 1981 .

[19]  Andrew Lippman,et al.  Movie-maps: An application of the optical videodisc to computer graphics , 1980, SIGGRAPH '80.

[20]  Michael Mills,et al.  Panoramic overviews for navigating real-world scenes , 1993, MULTIMEDIA '93.