Improved Hierarchical Clustering for Face Images in Videos: Integrating positional and temporal information with HAC

Efficient techniques for face clustering, along with speech and text recognition, can provide the means for rapid browsing and accurate retrieval of videos from large video databases. Observing that video data contains information about not only the face features, but also the temporal ordering of the faces and positional coordinates of the face-regions within the video frame, an attempt is made to consolidate all of these details into the framework of the flexible and intuitive hierarchical clustering algorithm. This paper outlines a novel initialization mechanism for the hierarchical clustering to follow, based on the temporal and positional information of the face-samples extracted from a video. Experiments with news broadcast videos show that the novel algorithm is considerably more efficient than it's parent, and is promising for future exploration.

[1]  Noel E. O'Connor,et al.  Face detection and clustering for video indexing applications , 2003 .

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  Vitomir Struc,et al.  The Complete Gabor-Fisher Classifier for Robust Face Recognition , 2010, EURASIP J. Adv. Signal Process..

[4]  Subhradeep Kayal,et al.  Face clustering in videos: GMM-based hierarchical clustering using Spatio-Temporal data , 2013, 2013 13th UK Workshop on Computational Intelligence (UKCI).

[5]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[6]  R. Brunelli,et al.  A Survey on the Automatic Indexing of Video Data, , 1999, J. Vis. Commun. Image Represent..

[7]  Gerhard Rigoll,et al.  Content based indexing of images and video using face detection and recognition methods , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[8]  Julia Hirschberg,et al.  V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure , 2007, EMNLP.

[9]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  V. Kshirsagar,et al.  Face recognition using Eigenfaces , 2011, 2011 3rd International Conference on Computer Research and Development.

[11]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[12]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[13]  Tao Zhang,et al.  Face Recognition Dimensionality Reduction Based on LLE and ISOMAP , 2013 .

[14]  Georgios Tziritas,et al.  Face Detection Using Quantized Skin Color Regions Merging and Wavelet Packet Analysis , 1999, IEEE Trans. Multim..