Solving Avatar Captchas Automatically

Captchas are challenge-response tests used in many online systems to prevent attacks by automated bots. Avatar Captchas are a recently-proposed variant in which users are asked to classify between human faces and computer-generated avatar faces, and have been shown to be secure if bots employ random guessing. We test a variety of modern object recognition and machine learning approaches on the problem of avatar versus human face classification. Our results show that using these techniques, a bot can successfully solve Avatar Captchas as often as humans can. These experiments suggest that this high performance is caused more by biases in the facial datasets used by Avatar Captchas and not by a fundamental flaw in the concept itself, but nevertheless our results highlight the difficulty in creating Captcha tasks that are immune to automatic solution.

[1]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[2]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[3]  Yaniv Taigman,et al.  Descriptor Based Methods in the Wild , 2008 .

[4]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[5]  Axel Pinz,et al.  Computer Vision – ECCV 2006 , 2006, Lecture Notes in Computer Science.

[6]  John Langford,et al.  Telling humans and computers apart automatically , 2004, CACM.

[7]  John Langford,et al.  CAPTCHA: Using Hard AI Problems for Security , 2003, EUROCRYPT.

[8]  A. M. Sapkal,et al.  A Framework to analyze the security of Text based CAPTCHA , 2010 .

[9]  Uwe Aickelin,et al.  Against Spyware Using CAPTCHA in Graphical Password Scheme , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[10]  Yasir Ahmad,et al.  Multi-Modal CAPTCHA: A User Verification Scheme , 2011, 2011 International Conference on Information Science and Applications.

[11]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[12]  Jon Howell,et al.  Asirra: a CAPTCHA that exploits interest-aligned manual image categorization , 2007, CCS '07.

[13]  David J. Crandall,et al.  Learning Visual Features for the Avatar Captcha Recognition Challenge , 2012, 2012 11th International Conference on Machine Learning and Applications.

[14]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[15]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[17]  Roman V. Yampolskiy,et al.  Avatar CAPTCHA: Telling computers and humans apart via face classification , 2012, 2012 IEEE International Conference on Electro/Information Technology.

[18]  Liming Wang,et al.  A Novel Image Based CAPTCHA Using Jigsaw Puzzle , 2010, 2010 13th IEEE International Conference on Computational Science and Engineering.

[19]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .