Multiclass object recognition using smart phone and cloud computing for augmented reality and video surveillance applications

This paper presents multiclass object classification and recognition using smartphone and cloud computing (client server) technology. Smart phone camera is used as image acquisition device. Smartphone is working as a client and high speed computer act as a server. Our system is a feature based novel approach that requires huge computing power and stand-alone smart phone is not capable for performing the whole task. So we have used the smart phone as image acquisition and rendering device, it is also worked as a client and high speed computer server is used as a major computing unit like a cloud. We have adapted the bag of words approach for training features of multiclass objects with the usage of visual codebooks which are having significant applications in the natural language processing. Our work is mainly focused on classification and recognition of multiclass natural objects which can be utilized either for augmented reality and also video surveillance applications. We use Scale Invariant Feature Transforms (SIFT) for feature extraction. We form visual codebook from the high dimensional feature vectors using clustering algorithm and classify and recognize using naïve Bayes classifier.

[1]  Cordelia Schmid,et al.  An Affine Invariant Interest Point Detector , 2002, ECCV.

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[4]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Quan Wang,et al.  Real-Time Image Matching Based on Multiple View Kernel Projection , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Alexei A. Efros,et al.  Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[8]  Bernd Kleinjohann,et al.  Marker-less vision based tracking for mobile augmented reality , 2003, The Second IEEE and ACM International Symposium on Mixed and Augmented Reality, 2003. Proceedings..

[9]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[11]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[12]  Antonio Torralba,et al.  Learning hierarchical models of scenes, objects, and parts , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.