Localizing and Analyzing the Infographics in Document Using Deep Learning

In explaining complicated concepts, infographics are a far more effective medium of communication than regular prose. In recent years, deep learning has seen a lot of success in a range of applications requiring pattern identification and artificial intelligence. One of these applications is image recognition. There are many kinds of infographics that may be used in resumes and CVs to demonstrate the degree of competence. The objective was to identify those infographics that were already existing in the CV and to determine the word that corresponded to each infographic before attempting to measure each infographic using numeric characters. The YOLO algorithm was used to identify infographics, while OCR was utilized in order to identify associated words. The filled component of the infographic was distinguished from the unfilled area by using the image intensity histogram analysis, thresholding and contours.

[1]  R. Guo,et al.  YOLO-table: disclosure document table detection with involution , 2022, International Journal on Document Analysis and Recognition (IJDAR).

[2]  N I Widiastuti,et al.  Document Image Extraction System Design , 2020, IOP Conference Series: Materials Science and Engineering.

[3]  Andreas Dengel,et al.  Benchmarking Object Detection Networks for Image Based Reference Detection in Document Images , 2019, 2019 Digital Image Computing: Techniques and Applications (DICTA).

[4]  Bharat Bhushan,et al.  Real Time Indian License Plate Detection using Deep Neural Networks and Optical Character Recognition using LSTM Tesseract , 2019, 2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS).

[5]  Yongdong Zhang,et al.  Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching , 2019, ACM Multimedia.

[6]  C. V. Jawahar,et al.  Graphical Object Detection in Document Images , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[7]  Andrew S. Agbemenu,et al.  An Automatic Number Plate Recognition System using OpenCV and Tesseract OCR Engine , 2018 .

[8]  Concetto Spampinato,et al.  A Saliency-based Convolutional Neural Network for Table and Chart Detection in Digitized Documents , 2018, ICIAP.

[9]  Xiaogang Wang,et al.  Context Encoding for Semantic Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[10]  Johannes Michael,et al.  A two-stage method for text line detection in historical documents , 2018, International Journal on Document Analysis and Recognition (IJDAR).

[11]  Muhammad Imran Malik,et al.  Table Detection Using Deep Learning , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[12]  Lei Zhang,et al.  Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13]  Zhi Tang,et al.  A Table Detection Method for PDF Documents Based on Convolutional Neural Networks , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).