Software architecture of PSET: a page segmentation evaluation toolkit

Abstract. Empirical performance evaluation of page segmentation algorithms has become increasingly important due to the numerous algorithms that are being proposed each year. In order to choose between these algorithms for a specific domain it is important to empirically evaluate their performance. To accomplish this task the document image analysis community needs: i) standardized document image datasets with groundtruth; ii) evaluation metrics that are agreed upon by researchers; and iii) freely available software for evaluating new algorithms and replicating other researchers' results. In an earlier paper (IEEE Transactions on Pattern Analysis and Machine Intelligence 2001) we published evaluation results for various popular page segmentation algorithms using the University of Washington dataset. In this paper we describe the software architecture of the PSET evaluation package, which was used to evaluate the segmentation algorithms. The description of the architecture will allow researchers to understand the software better, replicate our results, evaluate new algorithms, experiment with new metrics and datasets, etc. The software is written using the C language on the SUN/UNIX platform and is being made available to researchers at no cost.

[1]  Song Mao,et al.  Empirical performance evaluation of page segmentation algorithms , 1999, Electronic Imaging.

[2]  Emanuele Trucco,et al.  Computer and Robot Vision , 1995 .

[3]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[4]  Song Mao,et al.  A Methodology for Empirical Performance Evaluation of Page Segmentation Algorithms , 1999 .

[5]  Song Mao,et al.  Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Ellen M. Voorhees,et al.  The seventh text REtrieval conference (TREC-7) , 1999 .

[7]  Motoi Iwata,et al.  Segmentation of Page Images Using the Area Voronoi Diagram , 1998, Comput. Vis. Image Underst..

[8]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..

[10]  Mahesh Viswanathan,et al.  A prototype document image analysis system for technical journals , 1992, Computer.

[11]  Stephen V. Rice,et al.  The Fourth Annual Test of OCR Accuracy , 1995 .

[12]  Richard A. Becker,et al.  The New S Language , 1989 .

[13]  Tapas Kanungo,et al.  TRUEVIZ: a groundtruth/metadata editing and visualizing toolkit for OCR , 2000, IS&T/SPIE Electronic Imaging.

[14]  Song Mao,et al.  Automatic training of page segmentation algorithms: an optimization approach , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[15]  Jeremy J. Foster Data analysis using SPSS for Windows : a beginner's guide , 1998 .