We propose a combined method to segment the images of scanned documents, which, in contrast to known methods, implies a preliminary separation of the graphics and photograph regions from the text regions and a background. In this case, an analysis of the connected components is performed, which are different for graphics, photographs, and text regions. In order to classify the selected regions into the photograph and graphics regions, a block method is employed. It was established that such a technique for splitting the regions into blocks less affects the quality of segmentation when compared to applying the block method directly to the original image. To extract the text regions that are more complex in their shape from the background, the neighborhood of each pixel was processed. To detect the boundaries of illustrations on the images of scanned documents, we applied the bloomberg method. In order to classify into photographs and graphics, it is proposed to split an illustration into blocks of pixels. Each block of pixels is identified with a vector of two features: the mean value of the local gradient magnitude, and the mean value of the function that localizes at the images of scanned documents the linear objects (graphics and text characters). The derived feature vectors were classified using a support vector machine. When extracting the text regions, we applied a low-frequency filtering and a thresholding. The combined method was implemented in practice to segment the test images of scanned newspaper articles from the document database mediateam at oulu university (finland). It was established that the combined method is characterized by an increase in performance speed during image segmentation at high quality processing.
[1]
Alesya Ishchenko,et al.
Document image segmentation using averaging filtering and mathematical morphology
,
2018,
2018 14th International Conference on Advanced Trends in Radioelecrtronics, Telecommunications and Computer Engineering (TCSET).
[2]
M. K. Kundu,et al.
A new approach for segmentation of image and text in natural and commercial color documents
,
2012,
2012 International Conference on Communications, Devices and Intelligent Systems (CODIS).
[3]
Charles A. Bouman,et al.
Text Segmentation for MRC Document Compression
,
2011,
IEEE Transactions on Image Processing.
[4]
Peter Bauer,et al.
Text, photo, and line extraction in scanned documents
,
2012,
J. Electronic Imaging.
[5]
Driss Mammass,et al.
A Document Image Segmentation System Using Analysis of Connected Components
,
2013,
2013 12th International Conference on Document Analysis and Recognition.
[6]
N. Otsu.
A threshold selection method from gray level histograms
,
1979
.
[7]
S. Keerthi,et al.
A general formulation for support vector machines
,
2002,
Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..
[8]
D. Sasirekha,et al.
Enhanced Techniques for PDF Image Segmentation and Text Extraction
,
2012,
ArXiv.
[9]
Syed Saqib Bukhari,et al.
Improved document image segmentation algorithm using multiresolution morphology
,
2011,
Electronic Imaging.
[10]
Alejandro F. Frangi,et al.
Muliscale Vessel Enhancement Filtering
,
1998,
MICCAI.