Chart Image Classification Using Multiple-Instance Learning

An important step in chart image understanding is to identify the type of the input image so that corresponding interpretation can be performed. In this paper, we model the chart image classification as a multiple-instance learning problem. A chart image is treated as a bag containing a set of instances that are graphical symbols. For both training and recognition, shape detection is performed and general shape descriptors are used to form feature vectors. For the training images, the correlation factor (CF) of each shape is calculated for each chart type. The learnt CFs are then used to estimate the type of a new input image. Comparing with traditional multiple-instance learning algorithms, we allow negative examples to be less restrictive and hence easier to provide. Using our method, both the type and the data components of the chart image can be obtained in one-pass. The experimental results show that our approach works reasonably well

[1]  Joaquim A. Jorge,et al.  Polygon Detection from a Set of Lines , 2023, ArXiv.

[2]  Oded Maron,et al.  Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.

[3]  Tomás Lozano-Pérez,et al.  A Framework for Multiple-Instance Learning , 1997, NIPS.

[4]  Bart Lamiroy,et al.  Text/Graphics Separation Revisited , 2002, Document Analysis Systems.

[5]  Chew Lim Tan,et al.  Model-Based Chart Image Recognition , 2003, GREC.

[6]  Nancy Green,et al.  Understanding Information Graphics: A Discourse-Level Problem , 2003, SIGDIAL Workshop.

[7]  Toyohide Watanabe,et al.  Layout-Based Approach for Extracting Constructive Elements of Bar-Charts , 1997, GREC.

[8]  Maciej M. Syslo,et al.  An Efficient Cycle Vector Space Algorithm for Listing all Cycles of a Planar Graph , 1981, SIAM J. Comput..

[9]  Tomás Lozano-Pérez,et al.  Image database retrieval with multiple-instance learning techniques , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[10]  Ingrid Zukerman,et al.  A Probabilistic Framework for Recognizing Intention in Information Graphics , 2005, IJCAI.

[11]  W. Eric L. Grimson,et al.  A framework for learning query concepts in image classification , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[12]  Bart Lamiroy,et al.  Graphics recognition - from re-engineering to retrieval , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[13]  Chew Lim Tan,et al.  Learning-based scientific chart recognition , 2001 .

[14]  Chew Lim Tan,et al.  Hough technique for bar charts detection and recognition in document images , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).