A Comparative Study of Object Tracking using CNN and SDAE

Object tracking which refers to automatic estimation of the trajectory is a challenging problem. To track the object robustly and efficiently, we explored an autonomous object tracking methodological framework that adopts the deep learning architectures, specifically the convolutional neural network (CNN) and the stacked denoising autoencoder (SDAE), as opposed to the most frequently used tracking algorithms that only learn the appearance of the tracked object. Moreover, we conduct a comparative study of both approaches in terms of tracking accuracy and efficiency. The results show that the features learned by both CNN and SDAE are very supportive in object tracking problem and the detailed comparisons are demonstrated in this work.

[1]  Gerald Penn,et al.  Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[2]  Dit-Yan Yeung,et al.  Learning a Deep Compact Image Representation for Visual Tracking , 2013, NIPS.

[3]  Anamitra Makur,et al.  Robust Object Tracking with Radial Basis Function Networks , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[4]  Shai Avidan,et al.  Ensemble Tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Anamitra Makur,et al.  Online adaptive radial basis function networks for robust object tracking , 2010, Comput. Vis. Image Underst..

[6]  Jürgen Schmidhuber,et al.  Multi-column deep neural network for traffic sign classification , 2012, Neural Networks.

[7]  Yasser Baleghi,et al.  Improved Object Tracking Using Radial Basis Function Neural Networks , 2011, 2011 7th Iranian Conference on Machine Vision and Image Processing.

[8]  Rina Dechter,et al.  Learning While Searching in Constraint-Satisfaction-Problems , 1986, AAAI.

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[11]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[12]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.