Segmentation of Handwritten Text Using Bacteria Foraging Optimization

Text collected for digitization in the form of handwritten paragraphs is difficult to segment into individual characters. Identification of whole words is more difficult, so the idea is to segment the word into individual characters. These characters will be joined to make artificial words. Actual words may be predicted from these artificial words by any word prediction algorithm. A novel bacteria foraging optimization-based pixel model has been used to optimally segment all valid characters from the word. The bacteria colonies arrange in a specific shape determining a character. The offspring bacterial colonies produce the candidate segmented characters. The unhealthy colonies from the candidate segmented characters are eliminated deducing the pruned offspring bacterial colonies identified as optimal segmented characters. Paragraphs written by more than 50 subjects have been segmented with higher character segmentation accuracy.

[1]  Noman Islam,et al.  A Survey on Optical Character Recognition System , 2017, ArXiv.

[2]  Salvador España Boquera,et al.  Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Robert Sablatnig,et al.  Writer Retrieval and Writer Identification Using Local Features , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[4]  Horst Bunke,et al.  Text line segmentation and word recognition in a system for general writer independent handwriting recognition , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[5]  Jorge J. Villalón An eMarking Tool for Paper Based Evaluations , 2012, 2012 IEEE 12th International Conference on Advanced Learning Technologies.

[6]  Horst Bunke,et al.  Handwritten sentence recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[7]  Madasu Hanmandlu,et al.  Fuzzy Model Based Recognition of Handwritten Hindi Numerals using Bacterial Foraging , 2007, 6th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2007).

[8]  Nei Kato,et al.  A Handwritten Character Recognition System Using Directional Element Feature and Asymmetric Mahalanobis Distance , 1999, IEEE Trans. Pattern Anal. Mach. Intell..