WLFDB : Weakly Labeled Face Databases

Auto face annotation aims to detect human faces from a facial image and tag the faces with the corresponding human names. It is a fundamental research problem and plays a critical role for many real-world applications in computer vision and pattern recognition. Instead of adopting traditional “model-based face annotation” techniques, the “search-based face annotation” recently has been gaining increasing attentions for mining large amounts of weakly labeled facial images freely available on the Internet. Although several web facial image databases have been constructed by researchers in literature (e.g., LFW, Yahoo!News, FAN-large, and PubFig), they are not suitable for addressing the search-based face annotation problem. To facilitate the research of search-based face annotation, we build WLFDB — a large-scale Weakly Labeled Face Database, with three major characteristics: (i) Large-scale Weakly Labeled Facial Images: WLFDB consists of over 6, 000 subjects and more than 700, 000 web facial images, where the images are weakly labeled with names from the query text; (ii) Rich Data Types: We make WLFDB a comprehensive testbed with three types of data: “raw web facial images”, “aligned facial images”, and “facial feature representations”. Researchers can easily use WLFDB with any type of data with minimal efforts; (iii) Benchmark Protocol: We provide a standard benchmark protocol to evaluate the performance of different search-based face annotation techniques based on the same ground truth test set. In addition, three baseline algorithms are evaluated based on the same test sets according to the hit rate metric. In summary, WLFDB is a large-scale weakly labeled facial images database that attempts to model real-world web facial images. We hope it will not only facilitate the research of search-based face annotation, but also benefit other kinds of face related research, such as face detection, alignment, verification, and recognition, etc. WLFDB is freely available to public for non-commercial research purposes at http://wlfdb.stevenhoi.com/.

[1]  Laurent Itti,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Rapid Biologically-inspired Scene Classification Using Features Shared with Visual Attention , 2022 .

[2]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Barbara Caputo,et al.  A Large-Scale Database of Images and Captions for Automatic Face Naming , 2011, BMVC.

[4]  Matti Pietikäinen,et al.  Face Recognition with Local Binary Patterns , 2004, ECCV.

[5]  Cordelia Schmid,et al.  Multiple Instance Metric Learning from Automatically Labeled Bags of Faces , 2010, ECCV.

[6]  Zhe Wang,et al.  Modeling LSH for performance tuning , 2008, CIKM '08.

[7]  Klaus J. Kirchberg,et al.  Robust Face Detection Using the Hausdorff Distance , 2001, AVBPA.

[8]  Tao Mei,et al.  Retrieval-Based Face Annotation by Weak Label Regularized Local Coordinate Coding , 2014, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Terence Sim,et al.  The CMU Pose, Illumination, and Expression Database , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Erik Hjelmås,et al.  Face Detection: A Survey , 2001, Comput. Vis. Image Underst..

[11]  Shree K. Nayar,et al.  Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[12]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Luc Van Gool,et al.  Unsupervised face alignment by robust nonrigid mapping , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[14]  Shuicheng Yan,et al.  Near-duplicate keyframe retrieval by nonrigid image matching , 2008, ACM Multimedia.

[15]  Chunyan Miao,et al.  Learning to name faces: a multimodal learning scheme for search-based face annotation , 2013, SIGIR.

[16]  Chunyan Miao,et al.  FANS: face annotation by searching large-scale web facial images , 2013, WWW.

[17]  Patrick J. Flynn,et al.  Overview of the face recognition grand challenge , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18]  Zhengyou Zhang,et al.  A Survey of Recent Advances in Face Detection , 2010 .

[19]  Diane Gershon,et al.  In the picture , 1990, Nature.