论文信息 - Video face naming using global sequence alignment

Video face naming using global sequence alignment

This paper explores the problem of automatically naming faces in TV series or films. A novel method is proposed to build association between the faces in the video and the names in the script by a global sequence alignment algorithm. We firstly build two heterogenous sequences: a face sequence and a name sequence. The elements of the two sequences are cluster labels, computed from the clustering process, and speaking names, respectively. Then the alignment of the two sequences is considered as a problem of surjection between the cluster set and the name set. The optimal solution is obtained by minimizing the Levenshtein Distance between the two sequences which is constrained by the temporal order information. Experiments on public videos demonstrate the effectiveness of our method.

Yifan Zhang | Hanqing Lu | Zhiqiang Tang | Shuang Qiu

[1] Andrew Zisserman,et al. Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video , 2006, BMVC.

[2] Xiaogang Wang,et al. Deep Convolutional Network Cascade for Facial Point Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Changsheng Xu,et al. Robust Face-Name Graph Matching for Movie Character Identification , 2012, IEEE Transactions on Multimedia.

[4] Ulrike von Luxburg,et al. A tutorial on spectral clustering , 2007, Stat. Comput..

[5] Rainer Stiefelhagen,et al. “Knock! Knock! Who is it?” probabilistic person identification in TV-series , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Vladimir I. Levenshtein,et al. Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[7] Changsheng Xu,et al. Character Identification in Feature-Length Films Using Global Face-Name Matching , 2009, IEEE Transactions on Multimedia.

[8] Michael J. Fischer,et al. The String-to-String Correction Problem , 1974, JACM.

[9] Andrew Zisserman,et al. “Who are you?” - Learning person specific classifiers from video , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Changsheng Xu,et al. TVParser: An automatic TV video parsing method , 2011, CVPR 2011.