Complex Image Recognition and Web Security

Web services offered for human use are being abused by programs. Efforts to defend against these abuses have, over the last 5 years, stimulated the development of a new family of security protocols able to distinguish between human and machine users automatically over graphical user interfaces (GUIs) and networks. AltaVista pioneered this technology in 1997; by 2000, Yahoo! and PayPal were using similar methods. Researchers at Carnegie- Mellon University [2] and then a collaboration between the University of California at Berkeley and the Palo Alto Research Center [9] developed such tests. By January 2002 the subject was called human interactive proofs (HIPs), defined broadly as challenge/response protocols that allow a human to authenticate herself as a member of a given group: e.g., human (vs. machine), herself (vs. anyone else), etc. All commercial uses of HIPs exploit the gap in reading ability between humans and machines. Thus, many technical issues studied by the image recognition research community are relevant to HIPs. This chapter describes the evolution of HIP R&D, applications of HIPs now and on the horizon, relevant legal issues, highlights of the first two HIP workshops, and proposals for an image recognition research agenda to advance the state of the art of HIPs.

[1]  K. Rayner,et al.  The psychology of reading , 1989 .

[2]  Tapas Kanungo,et al.  Document degradation models and a methodology for degradation model validation , 1996 .

[3]  Daniel P. Lopresti,et al.  Locating and Recognizing Text in WWW Images , 2000, Information Retrieval.

[4]  Varol Akman,et al.  Turing Test: 50 Years Later , 2000, Minds and Machines.

[5]  Henry S. Baird,et al.  A Highly Legible CAPTCHA That Resists Segmentation Attacks , 2005, HIP.

[6]  Tapas Kanungo What Fraction of Images on the Web Contain Text ? , 2001 .

[7]  Gordon E Legge,et al.  Psychophysics of reading XX. Linking letter recognition to reading speed in central and peripheral vision , 2001, Vision Research.

[8]  Jitendra Malik,et al.  Recognizing objects in adversarial clutter: breaking a visual CAPTCHA , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[9]  Stephen V. Rice,et al.  The Fourth Annual Test of OCR Accuracy , 1995 .

[10]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[11]  Manuel Blum,et al.  Secure Human Identification Protocols , 2001, ASIACRYPT.

[12]  G. Legge,et al.  Psychophysics of reading—I. Normal vision , 1985, Vision Research.

[13]  Henry S. Baird,et al.  Document image defect models , 1995 .

[14]  Tin Kam Ho,et al.  Large-Scale Simulation Studies in Image Pattern Recognition , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[16]  Henry S. Baird,et al.  Pessimal print: a reverse Turing test , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[17]  G. Legge,et al.  Mr. Chips: An ideal-observer model of reading , 1997 .

[18]  Kris Popat,et al.  Human Interactive Proofs and Document Image Analysis , 2002, Document Analysis Systems.

[19]  Henry S. Baird,et al.  ScatterType: a reading CAPTCHA resistant to segmentation attack , 2005, IS&T/SPIE Electronic Imaging.

[20]  Mary Czerwinski,et al.  Building Segmentation Based Human-Friendly Human Interaction Proofs (HIPs) , 2005, HIP.

[21]  Pietro Perona,et al.  Probabilistic affine invariants for recognition , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[22]  Theodosios Pavlidis 36 years on the pattern recognition front: Lecture given at ICPR'2000 in Barcelona, Spain on the occasion of receiving the K.S. Fu prize , 2003, Pattern Recognit. Lett..

[23]  Dinggang Shen,et al.  Affine-invariant image retrieval by correspondence matching of shapes , 1999, Image Vis. Comput..

[24]  Kazuhiko Yamamoto,et al.  Structured Document Image Analysis , 1992, Springer Berlin Heidelberg.

[25]  D. Pelli,et al.  Feature detection and letter identification , 2006, Vision Research.

[26]  Paul A. Kolers,et al.  Processing of visible language , 1979 .

[27]  Henry S. Baird,et al.  BaffleText: a Human Interactive Proof , 2003, IS&T/SPIE Electronic Imaging.