Observations on Typing from 136 Million Keystrokes

We report on typing behaviour and performance of 168,000 volunteers in an online study. The large dataset allows detailed statistical analyses of keystroking patterns, linking them to typing performance. Besides reporting distributions and confirming some earlier findings, we report two new findings. First, letter pairs that are typed by different hands or fingers are more predictive of typing speed than, for example, letter repetitions. Second, rollover-typing, wherein the next key is pressed before the previous one is released, is sur- prisingly prevalent. Notwithstanding considerable variation in typing patterns, unsupervised clustering using normalised inter-key intervals reveals that most users can be divided into eight groups of typists that differ in performance, accuracy, hand and finger usage, and rollover. The code and dataset are released for scientific use.

[1]  I. Scott MacKenzie,et al.  Phrase sets for evaluating text entry techniques , 2003, CHI Extended Abstracts.

[2]  Daniel B. Horn,et al.  Patterns of entry and correction in large vocabulary continuous speech recognition systems , 1999, CHI '99.

[3]  Niels Henze,et al.  Observational and experimental investigation of typing behaviour using virtual keyboards for mobile devices , 2012, CHI.

[4]  Kent Lyons,et al.  Revisiting and validating a model of two-thumb text entry , 2007, CHI.

[5]  Gordon D Logan,et al.  Different (key)strokes for different folks: How standard and nonstandard typists balance Fitts' law and Hick's law. , 2016, Journal of experimental psychology. Human perception and performance.

[6]  Martina Rieger,et al.  Typing Style and the Use of Different Sources of Information during Typing: An Investigation Using Self-Reports , 2016, Front. Psychol..

[7]  Hisao Yamada,et al.  An Analysis of the Standard English Keyboard , 1980, COLING.

[8]  Ulf-Dietrich Reips,et al.  A Brief History of Web Experimenting , 2000 .

[9]  Per Ola Kristensson,et al.  A versatile dataset for text entry evaluations based on genuine mobile emails , 2011, Mobile HCI.

[10]  Jonathan T. Grudin,et al.  Error Patterns in Novice and Skilled Transcription Typing , 1983 .

[11]  Yili Liu,et al.  Queuing Network Modeling of Transcription Typing , 2008, TCHI.

[12]  Shwetak N. Patel,et al.  ContextType: using hand posture information to improve mobile touch screen text entry , 2013, CHI.

[13]  D. Gentner Keystroke Timing in Transcription Typing , 1983 .

[14]  Katharina Reinecke,et al.  LabintheWild: Conducting Large-Scale Online Experiments With Uncompensated Samples , 2015, CSCW.

[15]  Shumin Zhai,et al.  Making touchscreen keyboards adaptive to keys, hand postures, and individuals: a hierarchical spatial backoff model approach , 2013, CHI.

[16]  J. Grabowski The internal structure of university student's keyboard skills , 2008 .

[17]  Nellie L. Merrick,et al.  Typewriting Behavior: Psychology Applied to Teaching and Learning Typewriting , 1936 .

[18]  Antti Oulasvirta,et al.  How We Type: Movement Strategies and Performance in Everyday Typing , 2016, CHI.

[19]  T. Salthouse Perceptual, cognitive, and motoric aspects of transcription typing. , 1986, Psychological bulletin.

[20]  Ulf-Dietrich Reips Chapter 4 – The Web Experiment Method: Advantages, Disadvantages, and Solutions , 2000 .

[21]  Charles E. Hughes,et al.  Establishing a baseline for text entry for a multi-touch virtual keyboard , 2012, Int. J. Hum. Comput. Stud..

[22]  Janet C. Read,et al.  ExpECT: an expanded error categorisation method for text input , 2007, BCS HCI.

[23]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[24]  Gopal K. Gupta,et al.  Identity authentication based on keystroke latencies , 1990, Commun. ACM.

[25]  Brad A. Myers,et al.  Analyzing the input stream for character- level errors in unconstrained text entry evaluations , 2006, TCHI.

[26]  L. Shaffer,et al.  Typing Performance as a Function of Text , 1968 .

[27]  Jacob O. Wobbrock,et al.  Measures of Text Entry Performance , 2007 .

[28]  Jeongmin Son,et al.  TapBoard: making a touch screen keyboard more touchable , 2013, CHI Extended Abstracts.

[29]  Damon L. Woodard,et al.  Biometric Authentication and Identification using Keystroke Dynamics: A Survey , 2012 .

[30]  M. Akila,et al.  Biometric personal authentication using keystroke dynamics: A review , 2011, Appl. Soft Comput..

[31]  Xiaojun Bi,et al.  Word Clarity as a Metric in Sampling Keyboard Test Sets , 2017, CHI.

[32]  K. Nakayama,et al.  Is the Web as good as the lab? Comparable performance from Web and lab in cognitive/perceptual experiments , 2012, Psychonomic Bulletin & Review.

[33]  K. A. Ericsson,et al.  A deliberate practice account of typing proficiency in everyday typists. , 2007, Journal of experimental psychology. Applied.

[34]  I. Scott MacKenzie,et al.  A Model of Two-Thumb Text Entry , 2002, Graphics Interface.

[35]  T. Salthouse Effects of age and skill in typing. , 1984, Journal of experimental psychology. General.

[36]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[37]  Frode Eika Sandnes Evaluating mobile text entry strategies with finite state automata , 2005, Mobile HCI.

[38]  Ngoc Tran Nguyen Distance-based classification of keystroke dynamics , 2016, International Workshop on Pattern Recognition.

[39]  Per Ola Kristensson,et al.  Performance comparisons of phrase sets and presentation styles for text entry evaluations , 2012, IUI '12.

[40]  P. Dowland,et al.  A long-term trial of alternative user authentication technologies , 2004, Inf. Manag. Comput. Secur..