Detecting camera movements & production effects in digital videos

Numerous algorithms have been proposed in the last decade, for detecting low level features and camera motion in video sequences. Along with different camera movements such as pan, tilt, and zoom, there may be some jitter movements due to hand shaking. Differentiating these jitter movements from actual camera motions is a very challenging task and is the main cause of false detections in most of the processes. Comparison of the work of different research groups working in this area can be seen from TRECVID 2005 & 2006 results. Low Precision and Recall of different methods is a clear evident that no method is suitable for all types of video to detect different camera movements. Further many methods are either computation intensive or too much false detection reduces their efficiency. A method based on region matching using area correlation is proposed to differentiate the different camera movements. Different regions are classified based on their statistical similarity in each frame of video. For each region in two consecutive frames, area correlation function is calculated. This area correlation energy function is used to detect the similarity among the regions. Weighted sum of correlation function in all the mismatched regions and location of these are used to classify the camera movements in video into different types such as pan, tilt, zoom etc. This method is tested on different videos and reasonable performance is achieved. It can also be used as pre-classifier in prior to video transition detection methods to reduce their false detections.

[1]  Brian V. Funt,et al.  Color Constant Color Indexing , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Ullas Gargi,et al.  Performance characterization and comparison of video indexing algorithms , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[3]  Behzad Shahraray,et al.  Scene change detection and content-based sampling of video sequences , 1995, Electronic Imaging.

[4]  Rainer Lienhart,et al.  Reliable Transition Detection in Videos: A Survey and Practitioner's Guide , 2001, Int. J. Image Graph..

[5]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying production effects , 1999, Multimedia Systems.

[6]  André Zaccarin,et al.  A system for reliable dissolve detection in videos , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[7]  Ramesh C. Jain,et al.  Digital video segmentation , 1994, MULTIMEDIA '94.

[8]  Rainer Lienhart,et al.  Comparison of automatic shot boundary detection algorithms , 1998, Electronic Imaging.

[9]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, Electronic Imaging.

[10]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying scene breaks , 1995, MULTIMEDIA '95.

[11]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[12]  Ba Tu Truong,et al.  Improved fade and dissolve detection for reliable video segmentation , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[13]  Wolfgang Effelsberg,et al.  On the detection and recognition of television commercials , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.