Efficient automatic analysis of camera work and microsegmentation of video using spatiotemporal images

Abstract The shot has been regarded as a fundamental unit for the application of digital manipulation to a video. Various techniques have been developed to detect automatically shot changes. But a sequence shot can be so long and complex that it has to be further decomposed into smaller units for more flexible and detailed manipulation. A sequence shot can be segmented into shot segments, each of which keeps a homogeneous camera motion. Camera work has important significance that reflect the intention of video producers. Camera work analysis and segmentation of a sequence shot into shot segments can help in choosing a representative image for a shot. Following concepts introduced by Tonomura et al. (1993), we propose an efficient method for the automatic detection of camera work changes using spatiotemporal images called X-ray images. We introduce various steps in the spatiotemporal image analysis process which significantly improves its robustness and decreases its computational complexity.

[1]  Glorianna Davenport,et al.  The Stratification System - A Design Emvironment for Random Access , 1992, NOSSDAV.

[2]  Philippe Aigrain,et al.  The automatic real-time analysis of film editing and transition effects and its applications , 1994, Comput. Graph..

[3]  Thomas D. C. Little,et al.  Video scene decomposition with the motion picture parser , 1994, Electronic Imaging.

[4]  R. L. Baker,et al.  Global zoom/pan estimation and compensation for video compression , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[5]  Hideo Hashimoto,et al.  Video indexing using motion vectors , 1992, Other Conferences.

[6]  Masahiro Shibata,et al.  Temporal segmentation method for video sequence , 1992, Other Conferences.

[7]  Philippe Aigrain,et al.  Representation-based user interfaces for the audiovisual library of the year 2000 , 1995, Electronic Imaging.

[8]  Natalio Pincever,et al.  Parsing Movies in Context , 1991, USENIX Summer.

[9]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[10]  Yoshinobu Tonomura,et al.  VideoMAP and VideoSpaceIcon: tools for anatomizing video content , 1993, INTERCHI.

[11]  Yelena Yesha,et al.  Digital Libraries Current Issues , 1995, Lecture Notes in Computer Science.

[12]  Dominique Villain Le montage au cinéma , 1991 .

[13]  Anil K. Jain Fundamentals of Digital Image Processing , 2018, Control of Color Imaging Systems.