论文信息 - Reverse-engineering bar charts using neural networks

Reverse-engineering bar charts using neural networks

Reverse-engineering bar charts extract textual and numeric information from the visual representations of bar charts to support application scenarios that require the underlying information. In this paper, we propose a neural network-based method for reverse-engineering bar charts. We adopt a neural network-based object detection model to simultaneously localize and classify textual information. This approach improves the efficiency of textual information extraction. We design an encoder-decoder framework that integrates convolutional and recurrent neural networks to extract numeric information. We further introduce an attention mechanism into the framework to achieve high accuracy and robustness. Synthetic and real-world datasets are used to evaluate the effectiveness of the method. To the best of our knowledge, this work takes the lead in constructing a complete neural network-based method of reverse-engineering bar charts.

[1] Ming Chen,et al. Adaptive BER-constraint-based power allocation for downlink MC-CDMA systems with linear MMSE receiver , 2010, 2010 IEEE 12th International Conference on Communication Technology.

[2] Hairong Wang,et al. Visual Analytics for Electromagnetic Situation Awareness in Radio Monitoring and Management , 2020, IEEE Transactions on Visualization and Computer Graphics.

[3] Wei Chen,et al. ScatterNet: A Deep Subjective Similarity Model for Visual Analysis of Scatterplots , 2020, IEEE Transactions on Visualization and Computer Graphics.

[4] Anthony K. H. Tung,et al. LDSScanner: Exploratory Analysis of Low-Dimensional Structures in High-Dimensional Datasets , 2018, IEEE Transactions on Visualization and Computer Graphics.

[5] Wei Chen,et al. DataV: Data Visualization on large high-resolution displays , 2020, Vis. Informatics.

[6] Chew Lim Tan,et al. Hough technique for bar charts detection and recognition in document images , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[7] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[8] Maneesh Agrawala,et al. Graphical Overlays: Using Layered Elements to Aid Chart Reading , 2012, IEEE Transactions on Visualization and Computer Graphics.

[9] Jun Yuan,et al. A survey of visual analytics techniques for machine learning , 2020, Computational Visual Media.

[10] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[11] Jeffrey Heer,et al. Reverse‐Engineering Visualizations: Recovering Visual Encodings from Chart Images , 2017, Comput. Graph. Forum.

[12] Zhi-Hua Zhou,et al. Abductive learning: towards bridging machine learning and logical reasoning , 2019, Science China Information Sciences.

[13] Jiazhi Xia,et al. RSATree: Distribution-Aware Data Representation of Large-Scale Tabular Datasets for Flexible Visual Query , 2019, IEEE Transactions on Visualization and Computer Graphics.

[14] Didier Sornette,et al. Cascading Logistic Regression Onto Gradient Boosted Decision Trees to Predict Stock Market Changes Using Technical Analysis , 2018 .

[15] C. Lee Giles,et al. Automatic Summary Generation for Scientific Data Charts , 2016, AAAI Workshop: Scholarly Big Data.

[16] Mingjie Tang,et al. A Natural-language-based Visual Query Approach of Uncertain Human Trajectories , 2019, IEEE Transactions on Visualization and Computer Graphics.

[17] Ray Smith. An Overview of the Tesseract OCR Engine , 2007 .

[18] Richard E. Ladner,et al. Automated tactile graphics translation: in the field , 2007, Assets '07.

[19] François Chollet,et al. Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Jeffrey Heer,et al. ReVision: automated classification, analysis and redesign of chart images , 2011, UIST.

[22] Chew Lim Tan,et al. A system for understanding imaged infographics and its applications , 2007, DocEng '07.

[23] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[24] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[25] David S. Rosenberg,et al. Scatteract: Automated Extraction of Data from Scatter Plots , 2017, ECML/PKDD.

[26] Chongke Bi,et al. Evacuation route recommendation using auto-encoder and Markov decision process , 2019, Appl. Soft Comput..

[27] C. Lee Giles,et al. Automatic Extraction of Data from Bar Charts , 2015, K-CAP.

[28] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[29] Zhe Chen,et al. DiagramFlyer: A Search Engine for Data-Driven Diagrams , 2015, WWW.

[30] Jie Li,et al. Supporting Story Synthesis: Bridging the Gap between Visual Analytics and Storytelling , 2020, IEEE Transactions on Visualization and Computer Graphics.

[31] Ansgar Scherp,et al. Multi-oriented Text Extraction from Information Graphics , 2015, DocEng.

[32] Miguel A. Nacenta,et al. iVoLVER: Interactive Visual Language for Visualization Extraction and Reconstruction , 2016, CHI.

[33] Tony Beltramelli,et al. pix2code: Generating Code from a Graphical User Interface Screenshot , 2017, EICS.

[34] Matti Pietikäinen,et al. Deep Learning for Generic Object Detection: A Survey , 2018, International Journal of Computer Vision.

[35] Shijie Li,et al. A Visual Analysis Approach for Understanding Durability Test Data of Automotive Products , 2019, ACM Trans. Intell. Syst. Technol..

[36] Bongshin Lee,et al. ChartSense: Interactive Data Extraction from Chart Images , 2017, CHI.

[37] Meng Wang,et al. Chart decoder: Generating textual and numeric information from chart images automatically , 2018, J. Vis. Lang. Comput..

[38] Ying Zhao,et al. A survey of visualization for smart manufacturing , 2018, Journal of Visualization.

[39] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[40] Diego Klabjan,et al. Data Extraction from Charts via Single Deep Neural Network , 2019, ArXiv.

[41] C. Lee Giles,et al. Scalable algorithms for scholarly figure mining and semantics , 2016, SBD '16.

[42] Wei Chen,et al. Evaluating Perceptual Bias During Geometric Scaling of Scatterplots , 2019, IEEE Transactions on Visualization and Computer Graphics.

[43] Ali Farhadi,et al. FigureSeer: Parsing Result-Figures in Research Papers , 2016, ECCV.

[44] Xiaoping Fan,et al. IDSRadar: a real-time visualization framework for IDS alerts , 2012, Science China Information Sciences.

[45] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46] Michael Stonebraker,et al. Beagle : Automated Extraction and Interpretation of Visualizations from the Web , 2017 .

[47] Chongke Bi,et al. A survey on visualization of tensor field , 2019, Journal of Visualization.