In this work, an approach for Arabic handwriting word segmentation is proposed. In this approach words are over-segmented and the segmentation points (SPs) are then validated. As the validation stage accuracy controls the whole system accuracy, an improved validation approach is proposed to alleviate other approaches' limitations and enhances the accuracy. In this validation approach, a set of zoning features are extracted and used to train an efficient Random Forests (RF) ensemble of classifiers. These features are considered here due to their strength in capturing local as well as global characteristics of handwritten characters. The proposed approach is tested using 500 words from the standard IFN/ENIT database. Additionally, its accuracy is compared against one of the recent and efficient approaches which utilizes the modified directional features (MDF) and neural network classifier. These results prove the accuracy of the proposed approach and its ability to alleviate the limitations found in the previous techniques.
[1]
Noura A. Semary,et al.
Isolated Printed Arabic Character Recognition Using KNN and Random Forest Tree Classifiers
,
2014,
AMLTA.
[2]
Ashraf Elnagar,et al.
A Multi-Agent Approach to Arabic Handwritten Text Segmentation
,
2012
.
[3]
Ashraf B. Elsisi,et al.
An Enhanced Technique for Offline Arabic Handwritten Words Segmentation
,
2015,
CICLing.
[4]
Dinesh Dileep Gaurav,et al.
A feature extraction technique based on character geometry for character recognition
,
2012,
ArXiv.
[5]
Raed Abu Zitar,et al.
Development of an efficient neural-based segmentation technique for Arabic handwriting recognition
,
2010,
Pattern Recognit..
[6]
Zubair A. Shaikh,et al.
Character Segmentation of Sindhi, an Arabic Style Scripting Language, using Height Profile Vector
,
2009
.