ICDAR 2003 page segmentation competition

There is a significant need to objectively evaluate layout analysis (page segmentation and region classification) methods. This paper describes the Page Segmentation Competition (modus operandi, dataset and evaluation criteria) held in the context of ICDAR2003 and presents the results of the evaluation of the candidate methods. The main objective of the competition was to evaluate such methods using scanned documents from commonly-occurring publications. The results indicate that although methods seem to be maturing, there is still a considerable need to develop robust methods that deal with everyday documents.

[1]  Luigi Cinque,et al.  DAN: An Automatic Segmentation and Classification Engine for Paper Documents , 2002, Document Analysis Systems.

[2]  Ihsin T. Phillips,et al.  The Second International Graphics Recognition Contest - Raster to Vector Conversion: A Report , 1997, GREC.

[3]  Ihsin T. Phillips,et al.  Empirical Performance Evaluation of Graphics Recognition Systems , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Basilios Gatos,et al.  First International Newspaper Segmentation contest , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[5]  George Nagy,et al.  Automated Evaluation of OCR Zoning , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Robert M. Haralick,et al.  CD-ROM document database standard , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[7]  Robert M. Haralick,et al.  A Performance Evaluation Protocol for Graphics Recognition Systems , 1997, GREC.

[8]  Apostolos Antonacopoulos,et al.  A Ground-Truthing Tool for Layout Analysis Performance Evaluation , 2002, Document Analysis Systems.