A language modeling-like approach to sketching

Sketching is a universal communication tool that, despite its simplicity, is able to efficiently express a large variety of concepts and, in some limited contexts, it can be even more immediate and effective than natural language. In this paper we explore the feasibility of using neural networks to approach sketching in the same way they are commonly used in Language Modeling. We propose a novel approach to what we refer to as "Sketch Modeling", in which a neural network is exploited to learn a probabilistic model that estimates the probability of sketches. We focus on simple sketches and, in particular, on the case in which sketches are represented as sequences of segments. Segments and sequences can be either given - when the sketches are originally drawn in this format - or automatically generated from the input drawing by means of a procedure that we designed to create short sequences, loosely inspired by the human behavior. A Recurrent Neural Network is used to learn the sketch model and, afterward, the network is seeded with an incomplete sketch that it is asked to complete, generating one segment at each time step. We propose a set of measures to evaluate the outcome of a Beam Search-based generation procedure, showing how they can be used to identify the most promising generations. Our experimental analysis assesses the feasibility of this way of modeling sketches, also in the case in which several different categories of sketches are considered.

[1]  Tao Xiang,et al.  Learning to Sketch with Shortcut Cycle Consistency , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2]  Ricardo Matsumura de Araújo,et al.  On the Performance of GoogLeNet and AlexNet Applied to Sketches , 2016, AAAI.

[3]  Liqing Zhang,et al.  Free Hand-Drawn Sketch Segmentation , 2012, ECCV.

[4]  Marco Gori,et al.  The Role of Coherence in Facial Expression Recognition , 2018, AI*IA.

[5]  Hongan Wang,et al.  SketchGAN: Joint Sketch Completion and Recognition With Generative Adversarial Network , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Ravi Kiran Sarvadevabhatla,et al.  Pictionary-Style Word Guessing on Hand-Drawn Object Sketches: Dataset, Analysis and Deep Network Models , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  David H. Douglas,et al.  ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[8]  James Hays,et al.  The sketchy database , 2016, ACM Trans. Graph..

[9]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Gamini Dissanayake,et al.  C-LOG: A Chamfer distance based algorithm for localisation in occupancy grid-maps , 2016, CAAI Trans. Intell. Technol..

[11]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[12]  Marc Alexa,et al.  How do humans sketch objects? , 2012, ACM Trans. Graph..

[13]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[14]  Basil Tikoff,et al.  Sketch Worksheets in STEM Classrooms: Two Deployments , 2018, AAAI.

[15]  Keiichi Abe,et al.  Topological structural analysis of digitized binary images by border following , 1985, Comput. Vis. Graph. Image Process..

[16]  Lukás Burget,et al.  Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Xin Yan,et al.  AI-Sketcher : A Deep Generative Model for Producing High-Quality Sketches , 2019, AAAI.

[18]  Xiaodong Liu,et al.  Unified Language Model Pre-training for Natural Language Understanding and Generation , 2019, NeurIPS.

[19]  Jiri Matas,et al.  Robust Detection of Lines Using the Progressive Probabilistic Hough Transform , 2000, Comput. Vis. Image Underst..

[20]  Jaakko Lehtinen,et al.  Sketching Clothoid Splines Using Shortest Paths , 2010, Comput. Graph. Forum.

[21]  Nicu Sebe,et al.  Attribute-Guided Sketch Generation , 2019, 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019).

[22]  Tetsuya Ogata,et al.  Adaptive Drawing Behavior by Visuomotor Learning Using Recurrent Neural Networks , 2019, IEEE Transactions on Cognitive and Developmental Systems.

[23]  Dejan Todorovic,et al.  Gestalt principles , 2008, Scholarpedia.

[24]  Marco Maggini,et al.  Learning in Text Streams: Discovery and Disambiguation of Entity and Relation Instances , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[26]  Jie Yang,et al.  Unpaired Image-to-Sketch Translation Network for Sketch Synthesis , 2019, 2019 IEEE Visual Communications and Image Processing (VCIP).

[27]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[28]  Anil A. Bharath,et al.  Adversarial Training for Sketch Retrieval , 2016, ECCV Workshops.