Named Faces: Putting Names to Faces

To provide automatic labeling of faces in video, the author has developed Named Faces, a fully functional automated system that builds a large database of name-face association pairs from broadcast news. This article describes how the system detects and recognizes superimposed text in the video, then verifies or repairs the text by comparing it with a large list of automatically generated names found in news stories. Faces found in the video where superimposed names were recognized are tracked, extracted, and associated with the superimposed text. With Named Faces, users can submit queries to find names for faces in video images.

[1]  Ellen K. Hughes,et al.  Video OCR for digital news archive , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[2]  Takeo Kanade,et al.  Name-It: Naming and Detecting Faces in Video by the Integration of Image and Natural Language Processing , 1997, IJCAI.

[3]  Lynette Hirschman,et al.  Mixed-Initiative Development of Language Processing Systems , 1997, ANLP.

[4]  Yihong Gong,et al.  Lessons Learned from Building a Terabyte Digital Video Library , 1999, Computer.

[5]  Seiichi Nakagawa,et al.  Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[6]  Wolfgang Effelsberg,et al.  Video abstracting , 1997, CACM.

[7]  Hiroaki Sakoe,et al.  A Dynamic Programming Approach to Continuous Speech Recognition , 1971 .