Online Visual Vocabularies

The idea of an online visual vocabulary is proposed. In contrast to the accepted strategy of generating vocabularies offline, using the k-means clustering over all the features extracted form all the images in a dataset, an online vocabulary is dynamic and evolves iteratively over time as new observations are made. Hence, it is much more suitable for online robotic applications, such as exploration, landmark detection, and SLAM, where the future is unknown. We present two different strategies for building online vocabularies. The first strategy produces a vocabulary, which optimizes the k-centres objective of minimizing the maximum distance of a a feature from the closest vocabulary word. The second strategy produces a vocabulary by randomly sampling from the current vocabulary and the features in the current observation. We show that both the algorithms are able to produce distance matrices which have positive rank correlation with distance matrices computed using an offline k-means vocabulary. We discover that the online random vocabulary is consistently effective at approximating the behaviour of the offline k-means vocabulary, at least for the moderate sized datasets we examine.

[1]  C. F. Kossack,et al.  Rank Correlation Methods , 1949 .

[2]  Dmitriy Fradkin,et al.  Experiments with random projections for machine learning , 2003, KDD '03.

[3]  김재호,et al.  SURF(Speeded Up Robust Features)와 Kalman Filter를 이용한 컬러 객체 추적 방법의 제안 , 2012 .

[4]  George L. Nemhauser,et al.  Easy and hard bottleneck location problems , 1979, Discret. Appl. Math..

[5]  W. Hoeffding,et al.  Rank Correlation Methods , 1949 .

[6]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[8]  Vincent Lepetit,et al.  View-based Maps , 2010, Int. J. Robotics Res..

[9]  Andrew Zisserman,et al.  Video Google: Efficient Visual Search of Videos , 2006, Toward Category-Level Object Recognition.

[10]  Gregory Dudek,et al.  Online navigation summaries , 2010, 2010 IEEE International Conference on Robotics and Automation.

[11]  Frank Dellaert,et al.  Bayesian surprise and landmark detection , 2009, 2009 IEEE International Conference on Robotics and Automation.

[12]  Teofilo F. GONZALEZ,et al.  Clustering to Minimize the Maximum Intercluster Distance , 1985, Theor. Comput. Sci..

[13]  Gregory Dudek,et al.  ONSUM: A system for generating online navigation summaries , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .