Document image analysis: A primer

Document image analysis refers to algorithms and techniques that are applied to images of documents to obtain a computer-readable description from pixel data. A well-known document image analysis product is the Optical Character Recognition (OCR) software that recognizes characters in a scanned document. OCR makes it possible for the user to edit or search the document’s contents. In this paper we briefly describe various components of a document analysis system. Many of these basic building blocks are found in most document analysis systems, irrespective of the particular domain or language to which they are applied. We hope that this paper will help the reader by providing the background necessary to understand the detailed descriptions of specific techniques presented in other papers in this issue.

[1]  Rangachar Kasturi,et al.  Generation Of A Line Description File For Graphics Recognition , 1988, Defense, Security, and Sensing.

[2]  Wen-Yen Wu,et al.  Detecting the Dominant Points by the Curvature-Based Polygonal Approximation , 1993, CVGIP Graph. Model. Image Process..

[3]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Akshar Bharati,et al.  Panel: Computational Linguistics in India: An Overview , 2000, ACL.

[5]  G. Medioni,et al.  Corner detection and curve representation using cubic B-splines , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[6]  Gabriella Sanniti di Baja,et al.  Euclidean skeleton via centre-of-maximal-disc extraction , 1993, Image Vis. Comput..

[7]  A FletcherLloyd,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988 .

[8]  Azriel Rosenfeld,et al.  A method of detecting the orientation of aligned components , 1986, Pattern Recognit. Lett..

[9]  Lawrence O'Gorman,et al.  Document Image Analysis , 1996 .

[10]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[11]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[12]  T. Pavlidis Algorithms for Graphics and Image Processing , 1981, Springer Berlin Heidelberg.

[13]  Ching Y. Suen,et al.  Thinning Methodologies - A Comprehensive Survey , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Norihiro Hagita,et al.  Text-Line Extraction and Character Recognition of Document Headlines With Graphical Designs Using Complementary Similarity Measure , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  P.K Sahoo,et al.  A survey of thresholding techniques , 1988, Comput. Vis. Graph. Image Process..

[17]  Rainer Hoch,et al.  From paper to office document standard representation , 1992, Computer.

[18]  Wen-Hsiang Tsai,et al.  Moment-preserving thresholding: a new approach , 1995 .

[19]  Gabriella Sanniti di Baja Well-Shaped, Stable, and Reversible Skeletons from the (3, 4)-Distance Transform , 1994, J. Vis. Commun. Image Represent..

[20]  H. R. Keshavan,et al.  An optimal multiple threshold scheme for image segmentation , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  Gérard G. Medioni,et al.  Corner detection and curve representation using cubic B-splines , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[22]  Henry S. Baird,et al.  The skew angle of printed documents , 1995 .

[23]  Larry D. Hostetler,et al.  k-nearest-neighbor Bayes-risk estimation , 1975, IEEE Trans. Inf. Theory.

[24]  Larry S. Davis,et al.  A Corner-Finding Algorithm for Chain-Coded Curves , 1977, IEEE Transactions on Computers.

[25]  Xinhua Zhuang,et al.  Image Analysis Using Mathematical Morphology , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  S. V. Rice A report on the accuracy of OCR devices , 1992 .

[27]  Wen-Hsiang Tsai,et al.  Moment-preserving thresolding: A new approach , 1985, Comput. Vis. Graph. Image Process..

[28]  L. O'Gorman Image and document processing techniques for the RightPages electronic library system , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[29]  Rama Chellappa,et al.  Design, Integration, and Evaluation of Form-Based Handprint and OCR Systems | NIST , 1996 .

[30]  Murray Hill,et al.  Curvilinear Feature Detection from Curvature Estimation , 1988 .

[31]  Michael D. Garris,et al.  Form Design for High Accuracy Optical Character Recognition , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[33]  Josef Kittler,et al.  A survey of the hough transform , 1988, Comput. Vis. Graph. Image Process..

[34]  Lawrence O'Gorman Binarization and Multithresholding of Document Images Using Connectivity , 1994, CVGIP Graph. Model. Image Process..

[35]  Gérard G. Medioni,et al.  Corner detection and curve representation using cubic B-splines , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[36]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..

[37]  Lawrence O'Gorman,et al.  K × K Thinning , 1990, Comput. Vis. Graph. Image Process..

[38]  Herbert Freeman,et al.  Computer Processing of Line-Drawing Images , 1974, CSUR.

[39]  A. Lawrence Spitz,et al.  Determination of the Script and Language Content of Document Images , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Gabriella Sanniti di Baja,et al.  A Width-Independent Fast Thinning Algorithm , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Urs Ramer,et al.  An iterative procedure for the polygonal approximation of plane curves , 1972, Comput. Graph. Image Process..

[42]  Ching Y. Suen,et al.  An Evaluation of Parallel Thinning Algorithms for Character Recognition , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Øivind Due Trier,et al.  Evaluation of Binarization Methods for Document Images , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.