论文信息 - Video Face Clustering via Constrained Sparse Representation

Video Face Clustering via Constrained Sparse Representation

In this paper, we focus on the problem of clustering faces in videos. Different from traditional clustering on a collection of facial images, a video provides some inherent benefits: faces from a face track must belong to the same person and faces from a video frame can not be the same person. These benefits can be used to enhance the clustering performance. More precisely, we convert the above benefits into must-link and cannot-link constraints. These constraints are further effectively incorporated into our novel algorithm, Video Face Clustering via Constrained Sparse Representation (CS-VFC). The CS-VFC utilizes the constraints in two stages, including sparse representation and spectral clustering. Experiments on real-world videos show the improvements of our algorithm over the state-of-the-art methods.

[1] Claire Cardie,et al. Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[2] Andrew W. Fitzgibbon,et al. On Affine Invariant Clustering and Automatic Cast Listing in Movies , 2002, ECCV.

[3] Ian Davidson,et al. On constrained spectral clustering and its applications , 2012, Data Mining and Knowledge Discovery.

[4] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Andrew W. Fitzgibbon,et al. Joint manifold distance: a new approach to appearance based clustering , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[6] Cordelia Schmid,et al. Unsupervised metric learning for face identification in TV video , 2011, 2011 International Conference on Computer Vision.

[7] Ben Taskar,et al. Talking pictures: Temporal grouping and dialog-supervised person recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8] R. Vidal,et al. Sparse Subspace Clustering: Algorithm, Theory, and Applications. , 2013, IEEE transactions on pattern analysis and machine intelligence.

[9] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[10] Ioannis Pitas,et al. A mutual information based face clustering algorithm for movie content analysis , 2011, Image Vis. Comput..

[11] D. Donoho. For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution , 2006 .

[12] Changsheng Xu,et al. Character Identification in Feature-Length Films Using Global Face-Name Matching , 2009, IEEE Transactions on Multimedia.

[13] Miguel Á. Carreira-Perpiñán,et al. Constrained spectral clustering through affinity propagation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14] Erik G. Learned-Miller,et al. Unsupervised Joint Alignment of Complex Images , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[15] Guangliang Chen,et al. Spectral Curvature Clustering (SCC) , 2009, International Journal of Computer Vision.

[16] Stephen P. Boyd,et al. An Interior-Point Method for Large-Scale l1-Regularized Logistic Regression , 2007, J. Mach. Learn. Res..

[17] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[18] Lei Yu,et al. Study on Detection Technology of Load Waveform Based on Fuzzy Pattern Recognition , 2012 .

[19] Qiang Ji,et al. Constrained Clustering and Its Application to Face Clustering in Videos , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[20] Wen Gao,et al. Manifold-Manifold Distance with application to face recognition based on image set , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.