Associating text and graphics for scientific chart understanding

This paper presents our recent work that aims at associating the recognition results of textual and graphical information contained in the scientific chart images. Text components are first located in the input image and then recognized using OCR. On the other hand, the graphical objects are segmented and form high level symbols. Both logical and semantic correspondence between text and graphical symbols are identified. The association of text and graphics allows us to capture the semantic meaning carried by scientific chart images in a more complete way. The result of scientific chart image understanding is presented using XML documents.

[1]  Ernest Valveny,et al.  Scan-to-XML: automatic generation of browsable technical documents , 2002, Object recognition supported by user interaction for service robots.

[2]  Bart Lamiroy,et al.  Text/Graphics Separation Revisited , 2002, Document Analysis Systems.

[3]  Chew Lim Tan,et al.  Model-Based Chart Image Recognition , 2003, GREC.

[4]  Chew Lim Tan,et al.  Agent-Based Text Extraction from Pyramid Images , 1999 .

[5]  Tony P. Pridmore,et al.  Knowledge-Directed Interpretation of Mechanical Engineering Drawings , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Chew Lim Tan,et al.  Hough-based model for recognizing bar charts in document images , 2000, IS&T/SPIE Electronic Imaging.

[7]  Ioannis A. Kakadiaris,et al.  Understanding diagrams in technical documents , 1992, Computer.

[8]  Bart Lamiroy,et al.  Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation , 2001 .

[9]  James R. Gattiker,et al.  A System for Interpretation of Line Drawings , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Bart Lamiroy,et al.  Graphics recognition - from re-engineering to retrieval , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[11]  Chew Lim Tan,et al.  Learning-based scientific chart recognition , 2001 .

[12]  Toyohide Watanabe,et al.  Layout-Based Approach for Extracting Constructive Elements of Bar-Charts , 1997, GREC.