Video classification based on HMM using text and faces

Video content classification and retrieval is a necessary tool in the current merging of entertainment and information media. With the advent of broadband networking, every consumer will have video programs available on-line as well as in the traditional distribution channels. Systems that help in content management have to discern between different categories of video in order to provide for fast retrieval. In this paper we present a novel method for video classification based on face and text trajectories. This is based on the observation that in different TV categories there are different face and text trajectory patterns. Face and text tracking is applied to arbitrary video clips to extract faces and text trajectories. We used Hidden Markov Models (HMM) to classify a given video clip into predefined categories, e.g., commercial, news, sitcom and soap. Our preliminary experimental results show classification accuracy of over 80% for HMM method on short video clips. This paper describes continuity-based face and text detection and tracking in video for the above HMM classification method.

[1]  Gang Wei,et al.  TV program classification based on face and text processing , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[2]  Ahmed M. Elgammal,et al.  Face detection in complex environments from color images , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[3]  Nathan Intrator,et al.  Face detection by direct convexity estimation , 1997, Pattern Recognit. Lett..

[4]  Alex Pentland,et al.  A subspace method for maximum likelihood target detection , 1995, Proceedings., International Conference on Image Processing.

[5]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Michael J. Witbrock,et al.  Story segmentation and detection of commercials in broadcast news video , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[7]  T. Little,et al.  Compressed Video Processing For Cut Detection , 1996 .

[8]  Thomas S. Huang,et al.  Human face detection in a complex background , 1994, Pattern Recognit..

[9]  Rama Chellappa,et al.  Human and machine recognition of faces: a survey , 1995, Proc. IEEE.

[10]  Nevenka Dimitrova,et al.  ADVISO demonstration: a video content management system , 1999, MULTIMEDIA '99.

[11]  Roberto Cipolla,et al.  Feature-based human face detection , 1997, Image Vis. Comput..

[12]  Eli Saber,et al.  Frontal-view face detection and facial feature extraction using color, shape and symmetry based cost functions , 1998, Pattern Recognit. Lett..

[13]  Gang Wei,et al.  Face detection for image annotation , 1999, Pattern Recognition Letters.

[14]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Ying Dai,et al.  Face-texture model based on SGLD and its application in face detection in a color scene , 1996, Pattern Recognit..

[16]  Michael C. Burl,et al.  Finding faces in cluttered scenes using random labeled graph matching , 1995, Proceedings of IEEE International Conference on Computer Vision.

[17]  Chin-Chuan Han,et al.  Facial feature detection using geometrical face model: An efficient approach , 1998, Pattern Recognit..