Generating a Time Shrunk Lecture Video by Event Detection

Streaming a lecture video via the Internet is important for e-learning. We have developed a system that generates a lecture video using virtual camerawork based on shooting techniques of broadcast cameramen. However, viewing a full-length video takes time for students. In this paper, we propose a method for generating a time shrunk lecture video using event detection. We detect two kinds of events: a speech period and a chalkboard writing period. A speech period is detected by voice activity detection with LPC cepstrum and classified into speech or non-speech using Mahalanobis distance. To detect chalkboard writing periods, we use a graph cuts technique to segment a precise region of interests such as an instructor. By deleting content-free periods, i.e, period without the events of speech and writing, and fast-forwarding writing periods, our method can generate a time shrunk lecture video automatically. The resulting generated video is about 20%~30% shorter than the original video in time. This is almost the same as the results of manual editing by a human operator

[1]  John R. Kender,et al.  Analysis and enhancement of videos of electronic slide presentations , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[2]  Hironobu Fujiyoshi,et al.  Virtual camerawork for generating lecture video from high resolution images , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[3]  Mitsuho Yamada,et al.  Sensing of Feeling and "Kansei". Analysis of the Work and Eye Movement of Broadcasting-Studio Cameramen. , 1995 .

[4]  Koichi Miura,et al.  Motion Based Automatic Abstraction of Cooking Videos , 2003 .

[5]  Michael A. Smith,et al.  Video skimming and characterization through the combination of image and language understanding techniques , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[7]  Hironobu Fujiyoshi,et al.  Moving target classification and tracking from real-time video , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[8]  Kentaro Ishizuka,et al.  Speech and Video Indexing on Automatic Lecture Recording System , 2000 .