An Innovative SIFT-Based Method for Rigid Video Object Recognition

This paper presents an innovative SIFT-based method for rigid video object recognition (hereafter called RVO-SIFT). Just like what happens in the vision system of human being, this method makes the object recognition and feature updating process organically unify together, using both trajectory and feature matching, and thereby it can learn new features not only in the training stage but also in the recognition stage, which can improve greatly the completeness of the video object’s features automatically and, in turn, increases the ratio of correct recognition drastically. The experimental results on real video sequences demonstrate its surprising robustness and efficiency.

[1]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[2]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Long Quan,et al.  Self-calibration of an affine camera from multiple views , 1996, International Journal of Computer Vision.

[4]  Aabha Jain,et al.  Video Object Tracking based on Automatic Background Segmentation and updating using RBF neural network , 2013 .

[5]  Jie Yu,et al.  Distinguishing moving objects from traffic video by the dynamic background skeleton based model , 2013, 2013 International Conference on Communications, Circuits and Systems (ICCCAS).

[6]  Gary R. Bradski,et al.  Peripheral-Foveal Vision for Real-time Object Recognition and Tracking in Video , 2007, IJCAI.

[7]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[8]  Gian Luca Foresti,et al.  Object recognition and tracking for remote video surveillance , 1999, IEEE Trans. Circuits Syst. Video Technol..

[9]  David G. Lowe,et al.  Local feature view clustering for 3D object recognition , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[11]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[12]  Anil K. Jain,et al.  Object localization using color, texture and shape , 1997, Pattern Recognit..

[13]  Xing Xie,et al.  Salient Region Detection Using Weighted Feature Maps Based on the Human Visual Attention Model , 2004, PCM.

[14]  Arnold W. M. Smeulders,et al.  Color Based Object Recognition , 1997, ICIAP.

[15]  Josef Kittler,et al.  A survey of the hough transform , 1988, Comput. Vis. Graph. Image Process..

[16]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[17]  Pat Hanrahan,et al.  A realistic camera model for computer graphics , 1995, SIGGRAPH.

[18]  Sebastian Thrun,et al.  Unsupervised learning of invariant features using video , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  David G. Lowe,et al.  Shape indexing using approximate nearest-neighbour search in high-dimensional spaces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  Ameesh Makadia,et al.  Feature Tracking for Wide-Baseline Image Retrieval , 2010, ECCV.

[21]  H. H. Chen,et al.  Video Object Extraction via MRF-Based Contour Tracking , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Anil K. Jain,et al.  Object localization using color, texture and shape , 2000, Pattern Recognit..