Camera-based document image retrieval system using local features - comparing SRIF with LLAH, SIFT, SURF and ORB

In this paper, we present camera-based document retrieval systems using various local features as well as various indexing methods. We employ our recently developed features, named Scale and Rotation Invariant Features (SRIF), which are computed based on geometrical constraints between pairs of nearest points around a keypoint. We compare SRIF with state-of-the-art local features. The experimental results show that SRIF outperforms the state-of-the-art in terms of retrieval time with 90.8% retrieval accuracy.

[1]  Masakazu Iwamura,et al.  Real-Time Document Image Retrieval on a Smartphone , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[2]  T. Nakai,et al.  Hashing with Local Combinations of Feature Points and Its Application to Camera-Based Document Image Retrieval — Retrieval in 0 . 14 Second from 10 , 000 Pages — , 2005 .

[3]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[4]  Su Yang,et al.  Symbol Recognition via Statistical Integration of Pixel-Level Constraint Histograms: A New Descriptor , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[6]  Masakazu Iwamura,et al.  Camera Based Document Image Retrieval with More Time and Memory Efficient LLAH , 2008 .

[7]  Haim J. Wolfson,et al.  Geometric hashing: an overview , 1997 .

[8]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[9]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[10]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[11]  H. Pedrini,et al.  Dimensionality reduction through PCA over SIFT and SURF descriptors , 2013, 2012 IEEE 11th International Conference on Cybernetic Intelligent Systems (CIS).

[12]  Josep Lladós,et al.  Spotting Graphical Symbols in Camera-Acquired Documents in Real Time , 2013, GREC.

[13]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[14]  Zhe Wang,et al.  Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search , 2007, VLDB.

[15]  Masakazu Iwamura,et al.  Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH , 2011, 2011 International Conference on Document Analysis and Recognition.

[16]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[17]  Masakazu Iwamura,et al.  Real-Time Retrieval for Images of Documents in Various Languages Using a Web Camera , 2009, 2009 10th International Conference on Document Analysis and Recognition.