论文信息 - Learning-based word segmentation for reliable text document retrieval and augmentation

Learning-based word segmentation for reliable text document retrieval and augmentation

Imagine that one may have access to a part of a text document, say a page, and from that would want to identify the document to which it belongs. In such cases, there is a need to perform a content-based document retrieval in a large database.

Hanhoon Park | Jean-Pierre Lomaliza

[1] Masakazu Iwamura,et al. Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH , 2011, 2011 International Conference on Document Analysis and Recognition.

[2] Masakazu Iwamura,et al. Use of Affine Invariants in Locally Likely Arrangement Hashing for Camera-Based Document Image Retrieval , 2006, Document Analysis Systems.

[3] Hideo Saito,et al. Augmenting text document by on-line learning of local arrangement of keypoints , 2009, 2009 8th IEEE International Symposium on Mixed and Augmented Reality.

[4] Jong-Il Park,et al. stAR: visualizing constellations with star retrieval , 2011, SA '11.