论文信息 - Integrating Text and Face Detection for Finding Informative Poster Frames

Integrating Text and Face Detection for Finding Informative Poster Frames

Digital video is rapidly becoming an important source of information and entertainment, and is used in a host of multimedia applications. With the size of digital video collections growing to many thousands of hours, technology is needed to allow rapid browsing of videos. One way to summarize a video is to select poster frames to represent segments of the video. Previous techniques for extracting poster frames were based on scene segmentation, using color histograms or optical flow. To provide more informative poster frames, this work combines algorithms for extracting image content, specifically faces and on-screen text, with existing scene segmentation technology.

Shumeet Baluja | Michael Smith | Henry A. Rowley

[1] V. Rich. Personal communication , 1989, Nature.

[2] P. Peronatg. Recognition of Planar Object Classes , 1996 .

[3] Tomaso A. Poggio,et al. Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[4] M. Smith,et al. Video Skimming for Quick Browsing based on Audio and Image Characterization , 1995 .

[5] Remi Depommier,et al. Content-based browsing of video sequences , 1994, MULTIMEDIA '94.

[6] Thomas S. Huang,et al. Human face detection in a complex background , 1994, Pattern Recognit..

[7] R. Vaillant,et al. Original approach for the localisation of objects in images , 1994 .

[8] Michael Mills,et al. A magnifier tool for video data , 1992, CHI.