DeepWriting: Making Digital Ink Editable via Deep Generative Modeling

Digital ink promises to combine the flexibility and aesthetics of handwriting and the ability to process, search and edit digital text. Character recognition converts handwritten text into a digital representation, albeit at the cost of losing personalized appearance due to the technical difficulties of separating the interwoven components of content and style. In this paper, we propose a novel generative neural network architecture that is capable of disentangling style from content and thus making digital ink editable. Our model can synthesize arbitrary text, while giving users control over the visual appearance (style). For example, allowing for style transfer without changing the content, editing of digital ink at the word level and other application scenarios such as spell-checking and correction of handwritten text. We furthermore contribute a new dataset of handwritten text with fine-grained annotations at the character level and report results from an initial user evaluation.

[1]  C. Lawrence Zitnick,et al.  Handwriting beautification using token means , 2013, ACM Trans. Graph..

[2]  Jean-Michel Morel,et al.  A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  Yann Riche,et al.  As We May Ink?: Learning from Everyday Analog Pen Use to Improve Digital Ink Experiences , 2017, CHI.

[4]  Christine Reid,et al.  The Myth of the Paperless Office , 2003, J. Documentation.

[5]  Geoffrey Leech,et al.  The tagged LOB Corpus : user's manual , 1986 .

[6]  Ivan E. Sutherland,et al.  Sketchpad a Man-Machine Graphical Communication System , 1899, Outstanding Dissertations in the Computer Sciences.

[7]  Stacey D. Scott,et al.  The NiCE Discussion Room: Integrating Paper and Digital Media to Support Co-Located Group Meetings , 2010, CHI.

[8]  Daniel M. Oppenheimer,et al.  The Pen Is Mightier Than the Keyboard , 2014, Psychological science.

[9]  Marc Toussaint,et al.  Modelling motion primitives and their timing in biologically executed movements , 2007, NIPS.

[10]  Harry Shum,et al.  Combining shape and physical modelsfor online cursive handwriting synthesis , 2004, International Journal of Document Analysis and Recognition (IJDAR).

[11]  Fabrice Matulic,et al.  Sensing techniques for tablet+stylus interaction , 2014, UIST.

[12]  Takeo Igarashi,et al.  Flatland: new dimensions in office whiteboards , 1999, CHI '99.

[13]  Aaron Adler,et al.  The Impact of Pen and Touch Technology on Education , 2015, Human–Computer Interaction Series.

[14]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[15]  Stephen DiVerdi,et al.  HelpingHand: example-based stroke stylization , 2012, ACM Trans. Graph..

[16]  James Arvo,et al.  Fluid sketches: continuous recognition and morphing of simple hand-drawn shapes , 2000, UIST '00.

[17]  Irfan Ahmad,et al.  Handwriting synthesis: classifications and techniques , 2014, International Journal on Document Analysis and Recognition (IJDAR).

[18]  Jeff Sauro,et al.  The Factor Structure of the System Usability Scale , 2009, HCI.

[19]  Robert DeLine,et al.  Let's go to the whiteboard: how and why software developers use drawings , 2007, CHI.

[20]  James Arvo,et al.  Appearance-preserving manipulation of hand-drawn graphs , 2005, GRAPHITE '05.

[21]  Gabriel J. Brostow,et al.  My Text in Your Handwriting , 2016, ACM Trans. Graph..

[22]  Michael Haller,et al.  cLuster: Smart Clustering of Free-Hand Sketches on Large Interactive Surfaces , 2015, UIST.

[23]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[24]  Borut Zalik,et al.  Towards a feature-based interactive system for intelligent font design , 1999, 1999 IEEE International Conference on Information Visualization (Cat. No. PR00210).

[25]  Christian O'Reilly,et al.  Recent developments in the study of rapid human movements with the kinematic theory: Applications to handwriting and signature synthesis , 2014, Pattern Recognit. Lett..

[26]  Nicholas Chen,et al.  TextTearing: opening white space for digital ink annotation , 2013, UIST.

[27]  Geoffrey E. Hinton,et al.  Inferring Motor Programs from Images of Handwritten Digits , 2005, NIPS.

[28]  Ken Hinckley,et al.  Writing handwritten messages on a small touchscreen , 2013, MobileHCI '13.

[29]  Jungpil Shin,et al.  A statistical handwriting model for style-preserving and variable character synthesis , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[30]  Sung-Hyuk Cha,et al.  Individuality of handwriting. , 2002, Journal of forensic sciences.

[31]  Salvador España Boquera,et al.  Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Beryl Plimmer,et al.  Freeform digital ink annotations in electronic documents: A systematic mapping study , 2016, Comput. Graph..

[33]  Ravi Kiran Sarvadevabhatla,et al.  DeLiGAN: Generative Adversarial Networks for Diverse and Limited Data , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[35]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  James A. Landay,et al.  NotePals: lightweight note sharing by the group, for the group , 1999, CHI '99.

[37]  James Arvo,et al.  Aiding Manipulation of Handwritten Mathematical Expressions through Style-Preserving Morphs , 2001, Graphics Interface.

[38]  Johanna Drucker,et al.  The Alphabetic Labyrinth: The Letters in History and Imagination , 1995 .

[39]  Donald Ervin Knuth,et al.  The METAFONTbook , 1986 .

[40]  Nadir Weibel,et al.  Digital pen and paper practices in observational research , 2012, CHI.

[41]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[42]  Nicholas Chen,et al.  RichReview: blending ink, speech, and gesture to support collaborative document review , 2014, UIST.

[43]  Peter Brandl,et al.  NiCEBook: supporting natural note taking , 2010, CHI.

[44]  Donald E. Knuth,et al.  The Metafont book , 1989 .

[45]  Réjean Plamondon,et al.  A sigma-lognormal model-based approach to generating large synthetic online handwriting sample databases , 2017, International Journal on Document Analysis and Recognition (IJDAR).

[46]  Marcus Liwicki,et al.  IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[47]  Ariel Shamir,et al.  Feature-Based Design of Fonts Using Constraints , 1998, EP.

[48]  Sargur N. Srihari,et al.  Individuality of handwritten characters , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[49]  William Buxton,et al.  WritLarge: Ink Unleashed by Unified Scope, Action, & Zoom , 2017, CHI.

[50]  Bing-Yu Chen,et al.  Data‐driven Handwriting Synthesis in a Conjoined Manner , 2015, Comput. Graph. Forum.

[51]  Harry Shum,et al.  Learning-based cursive handwriting synthesis , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[52]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[53]  Ryan O'Neill A Story in Writing , 2011 .

[54]  William Buxton,et al.  Thumb + Pen Interaction on Tablets , 2017, CHI.

[55]  Marcus Liwicki,et al.  A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks , 2007 .

[56]  Yoshua Bengio,et al.  A Recurrent Latent Variable Model for Sequential Data , 2015, NIPS.