Document segmentation using textural features summarization and feedforward neural network

Document Segmentation is a process that aims to filter documents while identifying certain regions of interest. Generally, the regions of interest include texts, graphics (image occupied regions) and the background. This paper presents a novel top-bottom approach to perform document segmentation using texture features that are extracted from the specified/selected documents. A mask of suitable size is used to summarize textural features, and statistical parameters are captured as blocks in document images. Four textural features that are extracted from masks using the gray level co-occurrence matrix (glcm) include entropy, contrast, energy and homogeneity. Furthermore, two statistical parameters extracted from corresponding masks are the modal and median pixel values. The extracted attributes allow the classification of each mask or block as text, graphics, and background. A feedforward network is trained on the 6 extracted attributes, using documents obtained from a public database ; an error rate of 15.77 % is achieved. Furthermore, it is shown that this novel approach produces promising performance in segmenting documents and is expected to be significantly efficient for content-based information retrieval systems. Detection of duplicate documents within large databases is another potential area of application.

[1]  I. V. Safonov,et al.  Algorithm for segmentation of documents based on texture features , 2013, Pattern Recognition and Image Analysis.

[2]  Jules-Raymond Tapamo,et al.  A texture-based method for document segmentation and classification , 2006, South Afr. Comput. J..

[3]  Adnan Khashman,et al.  Data Mining of Students' Performance : Turkish Students as a Case Study , 2015 .

[4]  Hao Wang,et al.  Automatic character location and segmentation in color scene images , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[5]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[6]  Isam Shahrour,et al.  Application of Artificial Neural Networks (ANN) to model the failure of urban water mains , 2010, Math. Comput. Model..

[7]  Adnan Amin,et al.  Page segmentation and classification utilising a bottom-up approach , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[8]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[9]  Adnan Amin,et al.  Page Segmentation and Classification Utilizing Bottom-Up Approach , 2001, Int. J. Image Graph..

[10]  S. Garruzzo,et al.  MASHA-EL: A Multi-Agent System for Supporting Adaptive E-Learning , 2007 .

[11]  Laurence Likforman-Sulem,et al.  Text line segmentation of historical documents: a survey , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[12]  Y. Nikiforov,et al.  Tyrosine kinase expression is increased in papillary thyroid carcinoma of children and young adults. , 2000, Frontiers in bioscience : a journal and virtual library.

[13]  Nazar Zaki,et al.  Detection of Masses in Digital Mammogram Using Second Order Statistics and Artificial Neural Network , 2011 .

[14]  I A Basheer,et al.  Artificial neural networks: fundamentals, computing, design, and application. , 2000, Journal of microbiological methods.

[15]  A. Krogh What are artificial neural networks? , 2008, Nature Biotechnology.

[16]  David A. McMeekin,et al.  Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired , 2014 .

[17]  C. Baker,et al.  Higher order texture statistics impair contrast boundary segmentation. , 2011, Journal of vision.

[18]  Hai-Rong Ma,et al.  Automatic Image Segmentation with PCNN Algorithm Based on Grayscale Correlation , 2014 .

[19]  Suresh N. Mali,et al.  Evaluation of Texture and Shape Features for Classification of Four Paddy Varieties , 2014 .

[20]  Dah-Jing Jwo,et al.  Applying Back-propagation Neural Networks to GDOP Approximation , 2002, Journal of Navigation.

[21]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[22]  Dnyandeo Mhaske,et al.  Noise Detection and Noise Removal Techniques in Medical Images , 2012 .

[23]  K Sathian,et al.  Visual search: bottom-up or top-down? , 2000, Frontiers in bioscience : a journal and virtual library.

[24]  S. Kastner,et al.  Interactions of Top-Down and Bottom-Up Mechanisms in Human Visual Cortex , 2011, The Journal of Neuroscience.

[25]  Dong Wei,et al.  An Algorithm for Scanned Document Image Segmentation Based on Voronoi Diagram , 2012, 2012 International Conference on Computer Science and Electronics Engineering.

[26]  Amit Kumar Das,et al.  A fast algorithm for skew detection of document images using morphology , 2001, International Journal on Document Analysis and Recognition.

[27]  Gyanendra Kumar Goyal,et al.  Potential of artificial neural network technology for predicting shelf life of processed cheese , 2012 .

[28]  M. Ravi Kumar,et al.  Text Line Segmentation of Handwritten Documents using Clustering Method based on Thresholding Approach , 2012 .

[29]  B. Kruatrachue,et al.  A fast and efficient method for document segmentation for OCR , 2001, Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239).

[30]  Vijaya Kumar Koppula,et al.  Comparative Study of Text Line Segmentation Algorithms on Low Quality Documents , 2012 .

[31]  Eric Andonoff,et al.  Protocol Management Systems as a Middleware for Inter-Organizational Workflow Coordination , 2014, Int. J. Comput. Sci. Appl..

[32]  J. Gallant,et al.  Identifying natural images from human brain activity , 2008, Nature.

[33]  G. Pourtois,et al.  What is Bottom-Up and What is Top-Down in Predictive Coding? , 2013, Front. Psychol..

[34]  Reza Safabakhsh,et al.  Document image segmentation using fuzzy classifier and the dual-tree DWT , 2009, 2009 14th International CSI Computer Conference.

[35]  Savvas A. Chatzichristofis,et al.  Text localization using standard deviation analysis of structure elements and support vector machines , 2011, EURASIP J. Adv. Signal Process..

[36]  Parisa Sheykhi Hesarlo PERSIAN/ARABIC DOCUMENT SEGMENTATION BASED ON HYBRID APPROACH , 2014 .