论文信息 - Data2Vis: Automatic Generation of Data Visualizations Using Sequence-to-Sequence Recurrent Neural Networks

Data2Vis: Automatic Generation of Data Visualizations Using Sequence-to-Sequence Recurrent Neural Networks

Rapidly creating effective visualizations using expressive grammars is challenging for users who have limited time and limited skills in statistics and data visualization. Even high-level, dedicated visualization tools often require users to manually select among data attributes, decide which transformations to apply, and specify mappings between visual encoding variables and raw or transformed attributes. In this paper we introduce Data2Vis, an end-to-end trainable neural translation model for automatically generating visualizations from given datasets. We formulate visualization generation as a language translation problem, where data specifications are mapped to visualization specifications in a declarative language (Vega-Lite). To this end, we train a multilayered attention-based encoder–decoder network with long short-term memory (LSTM) units on a corpus of visualization specifications. Qualitative results show that our model learns the vocabulary and syntax for a valid visualization specification, appropriate transformations (count, bins, mean), and how to use common data selection patterns that occur within data visualizations. We introduce two metrics for evaluating the task of automated visualization generation (language syntax validity, visualization grammar syntax validity) and demonstrate the efficacy of bidirectional models with attention mechanisms for this task. Data2Vis generates visualizations that are comparable to manually created visualizations in a fraction of the time, with potential to learn more complex visualization strategies at scale.

Çagatay Demiralp | Victor Dibia | Ç. Demiralp | Victor C. Dibia | Çağatay Demiralp

[1] Sebastian Nowozin,et al. DeepCoder: Learning to Write Programs , 2016, ICLR.

[2] Lihong Li,et al. Neuro-Symbolic Program Synthesis , 2016, ICLR.

[3] Carlos Eduardo Scheidegger,et al. An Algebraic Process for Visualization Design , 2014, IEEE Transactions on Visualization and Computer Graphics.

[4] R. Shepard,et al. Toward a universal law of generalization for psychological science. , 1987, Science.

[5] R. Grossman,et al. Graph-theoretic scagnostics , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[6] Quoc V. Le,et al. Massive Exploration of Neural Machine Translation Architectures , 2017, EMNLP.

[7] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[8] Christopher D. Manning,et al. Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models , 2016, ACL.

[9] John W. Tukey,et al. PRIM-9: An Interactive Multi-dimensional Data Display and Analysis System , 1975, ACM Pacific.

[10] Leland Wilkinson,et al. AutoVis: Automatic Visualization , 2010, Inf. Vis..

[11] Alexander M. Rush,et al. Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[12] Fei-Fei Li,et al. Visualizing and Understanding Recurrent Networks , 2015, ArXiv.

[13] Phil Blunsom,et al. Recurrent Continuous Translation Models , 2013, EMNLP.

[14] Rico Sennrich,et al. Edinburgh Neural Machine Translation Systems for WMT 16 , 2016, WMT.

[15] Aditya G. Parameswaran,et al. SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics , 2015, Proc. VLDB Endow..

[16] Daniel D. Johnson,et al. Generating Polyphonic Music Using Tied Parallel Networks , 2017, EvoMUSART.

[17] Kanit Wongsuphasawat,et al. Voyager 2: Augmenting Visual Analysis with Partial View Specifications , 2017, CHI.

[18] Bowen Zhou,et al. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[19] Stephen M. Casner,et al. Task-analytic approach to the automated design of graphic presentations , 1991, TOGS.

[20] S. Lewandowsky,et al. Discriminating strata in scatterplots , 1989 .

[21] Aditya G. Parameswaran,et al. Towards Visualization Recommendation Systems , 2016, SGMD.

[22] John J. Bertin,et al. The semiology of graphics , 1983 .

[23] Jaakko Lehtinen,et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[24] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[25] B. G. Shortridge. Stimulus Processing Models from Psychology: Can We Use Them in Cartography? , 1982 .

[26] Yoshua Bengio,et al. Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription , 2012, ICML.

[27] HeerJeffrey,et al. D3 Data-Driven Documents , 2011 .

[28] Man Lung Yiu,et al. Extracting Top-K Insights from Multi-dimensional Data , 2017, SIGMOD Conference.

[29] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Fei-Fei Li,et al. Deep visual-semantic alignments for generating image descriptions , 2015, CVPR.

[31] Tomas Mikolov,et al. Alternative structures for character-level RNNs , 2015, ArXiv.

[32] Kanit Wongsuphasawat,et al. Towards a general-purpose query language for visualization recommendation , 2016, HILDA '16.

[33] Jeffrey Heer,et al. Reverse‐Engineering Visualizations: Recovering Visual Encodings from Chart Images , 2017, Comput. Graph. Forum.

[34] Alex Endert,et al. Task-Based Effectiveness of Basic Visualizations , 2017, IEEE Transactions on Visualization and Computer Graphics.

[35] Tony Beltramelli,et al. pix2code: Generating Code from a Graphical User Interface Screenshot , 2017, EICS.

[36] Tim Kraska,et al. VizML: A Machine Learning Approach to Visualization Recommendation , 2018, CHI.

[37] Wang Ling,et al. Latent Predictor Networks for Code Generation , 2016, ACL.

[38] Mirella Lapata,et al. Language to Logical Form with Neural Attention , 2016, ACL.

[39] Kartik Talamadupula,et al. A Cognitive Assistant for Visualizing and Analyzing Exoplanets , 2018, AAAI.

[40] Jade Goldstein-Stewart,et al. Interactive graphic design using automatic presentation knowledge , 1994, CHI '94.

[41] Dawn Xiaodong Song,et al. Tree-to-tree Neural Networks for Program Translation , 2018, NeurIPS.

[42] Arvind Satyanarayan,et al. Reactive Vega: A Streaming Dataflow Architecture for Declarative Interactive Visualization , 2016, IEEE Transactions on Visualization and Computer Graphics.

[43] Pushmeet Kohli,et al. RobustFill: Neural Program Learning under Noisy I/O , 2017, ICML.

[44] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[45] Pat Hanrahan,et al. Show Me: Automatic Presentation for Visual Analysis , 2007, IEEE Transactions on Visualization and Computer Graphics.

[46] Jeffrey Heer,et al. Beyond Heuristics: Learning Visualization Design , 2018, ArXiv.

[47] Peter J. Haas,et al. Foresight: Recommending Visual Insights , 2017, Proc. VLDB Endow..

[48] W. Cleveland,et al. Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods , 1984 .

[49] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[50] Ben Shneiderman,et al. A Rank-by-Feature Framework for Unsupervised Multidimensional Data Exploration Using Low Dimensional Projections , 2004, IEEE Symposium on Information Visualization.

[51] ParameswaranAditya,et al. Effortless data exploration with zenvisage , 2016, VLDB 2016.

[52] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[53] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[54] Pat Hanrahan,et al. VizQL: a language for query, analysis and visualization , 2006, SIGMOD Conference.

[55] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[56] Douglas Eck,et al. A Neural Representation of Sketch Drawings , 2017, ICLR.

[57] Jeffrey Heer,et al. SpanningAspectRatioBank Easing FunctionS ArrayIn ColorIn Date Interpolator MatrixInterpola NumObjecPointI Rectang ISchedu Parallel Pause Scheduler Sequen Transition Transitioner Transiti Tween Co DelimGraphMLCon IData JSONCon DataField DataSc Dat DataSource Data DataUtil DirtySprite LineS RectSprite , 2011 .

[58] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[59] Jeffrey Heer,et al. Formalizing Visualization Design Knowledge as Constraints: Actionable and Extensible Models in Draco , 2018, IEEE Transactions on Visualization and Computer Graphics.

[60] Zhen Wen,et al. Behavior-driven visualization recommendation , 2009, IUI.

[61] Jock D. Mackinlay,et al. Automating the design of graphical presentations of relational information , 1986, TOGS.

[62] Gilles Venturini,et al. VizAssist: an interactive user assistant for visual data mining , 2016, The Visual Computer.

[63] Alan M. MacEachren,et al. How Maps Work - Representation, Visualization, and Design , 1995 .

[64] J. Schmidhuber,et al. A First Look at Music Composition using LSTM Recurrent Neural Networks , 2002 .

[65] Richard Socher,et al. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning , 2018, ArXiv.

[66] Alex Graves,et al. Sequence Transduction with Recurrent Neural Networks , 2012, ArXiv.

[67] Guoliang Li,et al. DeepEye: Creating Good Data Visualizations by Keyword Search , 2018, SIGMOD Conference.

[68] Quoc V. Le,et al. Addressing the Rare Word Problem in Neural Machine Translation , 2014, ACL.

[69] Pat Hanrahan,et al. Polaris: a system for query, analysis and visualization of multi-dimensional relational databases , 2000, IEEE Symposium on Information Visualization 2000. INFOVIS 2000. Proceedings.

[70] Jeffrey Heer,et al. Visual Embedding: A Model for Visualization , 2014, IEEE Computer Graphics and Applications.

[71] Daniel Asimov,et al. The grand tour: a tool for viewing multidimensional data , 1985 .

[72] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[73] Jürgen Schmidhuber,et al. Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[74] Yoshua Bengio,et al. On Using Very Large Target Vocabulary for Neural Machine Translation , 2014, ACL.

[75] Yoshua Bengio,et al. Audio Chord Recognition with Recurrent Neural Networks , 2013, ISMIR.

[76] Leland Wilkinson. The Grammar of Graphics , 1999 .

[77] Hadley Wickham,et al. A Layered Grammar of Graphics , 2010 .

[78] Jeffrey Heer,et al. Protovis: A Graphical Toolkit for Visualization , 2009, IEEE Transactions on Visualization and Computer Graphics.

[79] Kanit Wongsuphasawat,et al. Voyager: Exploratory Analysis via Faceted Browsing of Visualization Recommendations , 2016, IEEE Transactions on Visualization and Computer Graphics.

[80] Arvind Satyanarayan,et al. Vega-Lite: A Grammar of Interactive Graphics , 2018, IEEE Transactions on Visualization and Computer Graphics.