Establishing Good Benchmarks and Baselines for Face Recognition

The Rowland Institute at Harvard, Harvard University, 100 Edwin Land Blvd.,Cambridge, MA 02142, USA,cox@rowland.harvard.eduAbstract. Progress in face recognition relies critically on the creationof test sets against which the performance of various approaches can beevaluated. A good set must capture the essential elements of what makesthe problem hard, while conforming to practical scale limitations. How-ever, these goals are often deceptively dicult to achieve. In the relatedarea of object recognition, Pinto et al. [2] demonstrated the potentialdangers of using a large, uncontrolled natural image set, showing thatan extremely rudimentary vision system (inspired by the early stages ofvisual processing in the brain) was able to perform on par with manystate-of-the-art vision systems on the popular Caltech101 object set [3].At the same time, this same rudimentary system was easily defeatedby an ostensibly \simpler" synthetic recognition test designed to betterspan the range of real world variation in object pose, position, scale, etc.These results suggested that image sets that look \natural" to human ob-servers may nonetheless fail to properly embody the problem of interest,and that care must be taken to establish baselines against which perfor-mance can be judged. Here, we repeat this approach for the \LabeledFaces in the Wild" (LFW) dataset [1], and for a collection of standardface recognition tests. The goal of the present work is not to competein the LFW challenge, per se, but to provide a baseline against whichthe performance of other systems can be judged. In particular, we foundthat our rudimentary \baseline" vision system was able to achieve  68%correct performance on the LFW challenge, substantially higher than apure \chance" baseline. We argue that this value might serve as a moreuseful baseline against which to evaluate absolute performance and arguethat the LFW set, while perhaps not perfect, represents an improvementover other standard face sets.

[1]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[2]  Berk Gökberk,et al.  Feature selection for pose invariant face recognition , 2002, Object recognition supported by user interaction for service robots.

[3]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[4]  Witold Pedrycz,et al.  Face recognition: A study in information fusion using fuzzy integral , 2005, Pattern Recognit. Lett..

[5]  George Bebis,et al.  Face recognition experiments with random projection , 2005, SPIE Defense + Commercial Sensing.

[6]  R. Gross Face Databases , 2005 .

[7]  Simon C. K. Shiu,et al.  Two Dimensional Laplacianfaces Method for Face Recognition , 2006, RSCTC.

[8]  Palaiahnakote Shivakumara,et al.  Diagonal Fisher linear discriminant analysis for efficient face recognition , 2006, Neurocomputing.

[9]  David J. Hand,et al.  Classifier Technology and the Illusion of Progress , 2006, math/0606441.

[10]  Cordelia Schmid,et al.  Dataset Issues in Object Recognition , 2006, Toward Category-Level Object Recognition.

[11]  Weiguo Gong,et al.  Uncorrelated linear discriminant analysis based on weighted pairwise Fisher criterion , 2007, Pattern Recognit..

[12]  Frédéric Jurie,et al.  Learning Visual Similarity Measures for Comparing Never Seen Objects , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  David Zhang,et al.  Block Independent Component Analysis for Face Recognition , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[14]  Hongzhou Zhang,et al.  Face Recognition using Gaborface-based 2DPCA and (2D)2PCA Classification with Ensemble and Multichannel Model , 2007, 2007 IEEE Symposium on Computational Intelligence in Security and Defense Applications.

[15]  Marios Savvides,et al.  Frequency Domain Face Recognition , 2007 .

[16]  Nicolas Pinto,et al.  Why is Real-World Visual Object Recognition Hard? , 2008, PLoS Comput. Biol..

[17]  Lior Shamir,et al.  Evaluation of Face Datasets as Tools for Assessing the Performance of Face Recognition Methods , 2008, International Journal of Computer Vision.

[18]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .