In conventional document retrieval (DIR) systems based on locally likely arrangement hashing (LLAH), the word detection approach is sensitive to the distance between the camera and the text document, e.g. a single word may be detected as several words when the camera is too close. Thus, the systems work well only when the distance in which the text document was registered is similar to the one of the retrieval. Moreover, they were implemented in a desktop setup where it might not suffer from the distance problem since the camera is rigidly attached to the computer. In this paper, a new word segmentation approach is proposed to increase the robustness of LLAH-based DIR systems so that they may be implemented on a mobile platform where the distance between the camera and text document may be easily changeable. The proposed method uses a deep neural network to classify spaces between connected components as between-words space or intra-word space. From experiments results, the proposed method successfully could detect the same words in different camera distances and orientation as the neural networks offered classification accuracy as high as 92.5%. Moreover, it showed higher robustness than the state-of-the-art methods when implemented on a mobile platform.
[1]
Hideo Saito,et al.
On-line document registering and retrieving system for AR annotation overlay
,
2010,
AH.
[2]
Ernest Valveny,et al.
A kernel-based approach to document retrieval
,
2010,
DAS '10.
[3]
Robert M. Haralick,et al.
Textural Features for Image Classification
,
1973,
IEEE Trans. Syst. Man Cybern..
[4]
Masakazu Iwamura,et al.
Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH
,
2011,
2011 International Conference on Document Analysis and Recognition.
[5]
M. S. Shirdhonkar,et al.
Handwritten Document Image Retrieval
,
2012
.
[6]
Alireza Alaei,et al.
A brief review of document image retrieval methods: Recent advances
,
2016,
2016 International Joint Conference on Neural Networks (IJCNN).
[7]
Ernest Valveny,et al.
Large-scale document image retrieval and classification with runlength histograms and binary embeddings
,
2013,
Pattern Recognit..
[8]
C. V. Jawahar,et al.
Image Retrieval Using Textual Cues
,
2013,
2013 IEEE International Conference on Computer Vision.