Detection of matrices and segmentation of matrix elements in scanned images of scientific documents

We proposed a method for recognizing matrices which contain abbreviation symbols, and a format for representing the structure of matrices, and reported experimental results in our paper (2002). The method consisted of 4 processes: detection of matrices, segmentation of elements, construction of networks and analysis of the matrix structure. In the paper, our work is described with a focus on the construction of networks and the analysis of the matrix structure. However, we concluded that improvements in the other two processes were very important for obtaining a high accuracy rate for recognition. In this paper, we describe the two improved processes, the detection of matrices and the segmentation of elements, and we report the experimental results.

[1]  Dorothea Blostein,et al.  RECOGNITION OF MATHEMATICAL NOTATION , 1997 .

[2]  Kanahori Toshihiro,et al.  A Recognition Method of Matrices by Using Variable Block Pattern Elements Generating Rectangular Area , 2001, GREC.

[3]  Masakazu Suzuki,et al.  Mathematical formula recognition using virtual link network , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[4]  Masayuki Okamoto,et al.  Structure analysis and recognition of mathematical expressions , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.