OCR Correction via Human Computational Game

We present TypeAttack, a Facebook game that improves the efficiency of digitization, which is the process of converting analog information into digital information. In this game, players compete to type out texts from images of scanned documents in the quickest time possible, with the utmost accuracy. These historical documents are thus manually digitized as a bi-product of game play. Harnessing the perceptual abilities of humans as well as their desire to be entertained, TypeAttack enhances the digitization of old documents, especially those that cannot be transcribed accurately by Optical Character Recognition (OCR) programs.

[1]  Youngmoo E. Kim,et al.  MoodSwings: A Collaborative Game for Music Mood Label Collection , 2008, ISMIR.

[2]  Gert R. G. Lanckriet,et al.  A Game-Based Approach for Collecting Semantic Annotations of Music , 2007, ISMIR.

[3]  Jane Yung-jen Hsu,et al.  PhotoSlap: A Multi-player Online Game for Semantic Annotation , 2007, AAAI.

[4]  Laura A. Dabbish,et al.  Designing games with a purpose , 2008, CACM.

[5]  Aniket Kittur,et al.  He says, she says: conflict and coordination in Wikipedia , 2007, CHI.

[6]  Chen-Nee Chuah,et al.  Unveiling facebook: a measurement study of social network based applications , 2008, IMC '08.

[7]  Manuel Blum,et al.  Verbosity: a game for collecting common-sense facts , 2006, CHI.

[8]  Manuel Blum,et al.  Peekaboom: a game for locating objects in images , 2006, CHI.

[9]  Rose Holley Many Hands Make Light Work : Public Collaborative OCR Text Correction in Australian Historic Newspapers , 2009 .

[10]  Luis von Ahn,et al.  Matchin: eliciting user preferences with an online game , 2009, CHI.

[11]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[12]  Daniel P. W. Ellis,et al.  Please Scroll down for Article Journal of New Music Research a Web-based Game for Collecting Music Metadata a Web-based Game for Collecting Music Metadata , 2022 .

[13]  Manuel Blum,et al.  Improving accessibility of the web with a computer game , 2006, CHI.

[14]  Udo Kruschwitz,et al.  Phrase Detectives: A Web-based collaborative annotation game , 2008 .

[15]  Luis von Ahn Games with a Purpose , 2006, Computer.

[16]  C. Mills,et al.  The Theory of Social and Economic Organization , 1948 .

[17]  Daniel B. Horn,et al.  Patterns of entry and correction in large vocabulary continuous speech recognition systems , 1999, CHI '99.

[18]  Manuel Blum,et al.  reCAPTCHA: Human-Based Character Recognition via Web Security Measures , 2008, Science.