论文信息 - Generating a Time Shrunk Lecture Video by Event Detection

Generating a Time Shrunk Lecture Video by Event Detection

Streaming a lecture video via the Internet is important for e-learning. We have developed a system that generates a lecture video using virtual camerawork based on shooting techniques of broadcast cameramen. However, viewing a full-length video takes time for students. In this paper, we propose a method for generating a time shrunk lecture video using event detection. We detect two kinds of events: a speech period and a chalkboard writing period. A speech period is detected by voice activity detection with LPC cepstrum and classified into speech or non-speech using Mahalanobis distance. To detect chalkboard writing periods, we use a graph cuts technique to segment a precise region of interests such as an instructor. By deleting content-free periods, i.e, period without the events of speech and writing, and fast-forwarding writing periods, our method can generate a time shrunk lecture video automatically. The resulting generated video is about 20%~30% shorter than the original video in time. This is almost the same as the results of manual editing by a human operator

Hironobu Fujiyoshi | Takao Yokoi

[1] John R. Kender,et al. Analysis and enhancement of videos of electronic slide presentations , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[2] Hironobu Fujiyoshi,et al. Virtual camerawork for generating lecture video from high resolution images , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[3] Mitsuho Yamada,et al. Sensing of Feeling and "Kansei". Analysis of the Work and Eye Movement of Broadcasting-Studio Cameramen. , 1995 .

[4] Koichi Miura,et al. Motion Based Automatic Abstraction of Cooking Videos , 2003 .

[5] Michael A. Smith,et al. Video skimming and characterization through the combination of image and language understanding techniques , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[7] Hironobu Fujiyoshi,et al. Moving target classification and tracking from real-time video , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[8] Kentaro Ishizuka,et al. Speech and Video Indexing on Automatic Lecture Recording System , 2000 .