Performance Evaluation of Document Structure Extraction Algorithms

This paper presents a performance metric for the document structure extraction algorithms by finding the correspondences between detected entities and ground truth. We describe a method for determining an algorithm's optimal tuning parameters. We evaluate a group of document layout analysis algorithms on 1600 images from the UW-III Document Image Database, and the quantitative performance measures in terms of the rates of correct, miss, false, merging, splitting, and spurious detections are reported.

[1]  Robert M. Haralick,et al.  Document layout structure extraction using bounding boxes of different entitles , 1996, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.

[2]  A. Iserles Numerical recipes in C—the art of scientific computing , by W. H. Press, B. P. Flannery, S. A. Teukolsky and W. T. Vetterling. Pp 735. £27·50. 1988. ISBN 0-521-35465-X (Cambridge University Press) , 1989, The Mathematical Gazette.

[3]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[4]  Michael D. Garris,et al.  Evaluating spatial correspondence of zones in document recognition systems , 1995, Proceedings., International Conference on Image Processing.

[5]  Robert M. Haralick,et al.  Performance evaluation of document layout analysis algorithms on the UW data set , 1997, Electronic Imaging.

[6]  Shahram Latifi How Can Permutations Be Used in The Evaluation of Zoning Algorithms? , 1996, Int. J. Pattern Recognit. Artif. Intell..

[7]  George Nagy,et al.  Automated Evaluation of OCR Zoning , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Robert M. Haralick,et al.  Document page decomposition using bounding boxes of connected components of black pixels , 1995, Electronic Imaging.

[9]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[10]  Luc M. Vincent,et al.  Benchmarking page segmentation algorithms , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Garret N. Vanderplaats,et al.  Numerical optimization techniques for engineering design , 1999 .

[12]  George Nagy,et al.  HIERARCHICAL REPRESENTATION OF OPTICALLY SCANNED DOCUMENTS , 1984 .

[13]  William H. Press,et al.  Numerical recipes in C , 2002 .

[14]  Emanuele Trucco,et al.  Computer and Robot Vision , 1995 .

[15]  Robert M. Haralick,et al.  CD-ROM document database standard , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[16]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[17]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[18]  Robert M. Haralick,et al.  Document image understanding: geometric and logical layout , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.