Spatial image polynomial decomposition with application to video classification

Abstract. This paper addresses the use of orthogonal polynomial basis transform in video classification due to its multiple advantages, especially for multiscale and multiresolution analysis similar to the wavelet transform. In our approach, we benefit from these advantages to reduce the resolution of the video by using a multiscale/multiresolution decomposition to define a new algorithm that decomposes a color image into geometry and texture component by projecting the image on a bivariate polynomial basis and considering the geometry component as the partial reconstruction and the texture component as the remaining part, and finally to model the features (like motion and texture) extracted from reduced image sequences by projecting them into a bivariate polynomial basis in order to construct a hybrid polynomial motion texture video descriptor. To evaluate our approach, we consider two visual recognition tasks, namely the classification of dynamic textures and recognition of human actions. The experimental section shows that the proposed approach achieves a perfect recognition rate in the Weizmann database and highest accuracy in the Dyntex++ database compared to existing methods.

[1]  Md Atiqur Rahman Ahad,et al.  Action recognition algorithm based on optical flow and RANSAC in frequency domain , 2011, SICE Annual Conference 2011.

[2]  Lihi Zelnik-Manor,et al.  Event-based analysis of video , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  Christine Fernandez-Maloigne,et al.  Motion Estimation in Color Image Sequences , 2012 .

[4]  Narendra Ahuja,et al.  Maximum Margin Distance Learning for Dynamic Texture Recognition , 2010, ECCV.

[5]  R. El Moubtahij,et al.  A polynomial texture extraction with application in dynamic texture classification , 2015, International Conference on Quality Control by Artificial Vision.

[6]  Martin Druon Modélisation du mouvement par polynômes orthogonaux : application à l'étude d'écoulements fluides , 2009 .

[7]  Ronen Basri,et al.  Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[8]  Alberto Del Bimbo,et al.  Multi-scale and real-time non-parametric approach for anomaly detection and localization , 2012, Comput. Vis. Image Underst..

[9]  David Picard,et al.  Local polynomial space–time descriptors for action classification , 2016, Machine Vision and Applications.

[10]  Tony F. Chan,et al.  Structure-Texture Image Decomposition—Modeling, Algorithms, and Parameter Selection , 2006, International Journal of Computer Vision.

[11]  Li-min Xia,et al.  The Complex Action Recognition via the Correlated Topic Model , 2014, TheScientificWorldJournal.

[12]  Stefano Soatto,et al.  Dynamic Textures , 2003, International Journal of Computer Vision.

[13]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[14]  Saeed Sadri,et al.  A new efficient method to characterize dynamic textures based on a two-phase texture and dynamism analysis , 2014, Pattern Recognit. Lett..

[15]  Krystian Mikolajczyk,et al.  Action recognition with motion-appearance vocabulary forest , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Ling Shao,et al.  A Wavelet Based Local Descriptor for Human Action Recognition , 2010, BMVC.

[17]  Matti Pietikäinen,et al.  Automatic Dynamic Texture Segmentation Using Local Descriptors and Optical Flow , 2013, IEEE Transactions on Image Processing.

[18]  Yves Meyer,et al.  Oscillating Patterns in Image Processing and Nonlinear Evolution Equations: The Fifteenth Dean Jacqueline B. Lewis Memorial Lectures , 2001 .

[19]  Loris Nanni,et al.  High Performance Set of Features for Human Action Classification , 2009, IPCV.

[20]  Cordelia Schmid,et al.  A Spatio-Temporal Descriptor Based on 3D-Gradients , 2008, BMVC.

[21]  Ying-Ke Lei,et al.  Face recognition via Weighted Sparse Representation , 2013, J. Vis. Commun. Image Represent..

[22]  Mark J. Huiskes,et al.  DynTex: A comprehensive database of dynamic textures , 2010, Pattern Recognit. Lett..

[23]  Jean-Michel Morel,et al.  Cartoon+Texture Image Decomposition , 2011, Image Process. Line.

[24]  Brian C. Lovell,et al.  Directional Space-Time Oriented Gradients for 3D Visual Pattern Analysis , 2012, ECCV.

[25]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[26]  H. Hashimoto,et al.  Human action recognition using wavelet signal analysis as an input in 4W1H , 2010, 2010 8th IEEE International Conference on Industrial Informatics.

[27]  Christine Fernandez-Maloigne,et al.  Vectorial Computation of the Optical Flow in Color Image Sequences , 2005, Color Imaging Conference.

[28]  Xudong Jiang,et al.  Dynamic texture recognition using enhanced LBP features , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[29]  Matti Pietikäinen,et al.  Human Activity Recognition Using a Dynamic Texture Based Method , 2008, BMVC.

[30]  Mubarak Shah,et al.  A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.

[31]  Yong Xu,et al.  Dynamic texture classification using dynamic fractal analysis , 2011, 2011 International Conference on Computer Vision.

[32]  Brian C. Lovell,et al.  Discriminative Non-Linear Stationary Subspace Analysis for Video Classification , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[34]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Volume Local Binary Patterns , 2006, WDV.

[35]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Sloven Dubois,et al.  Indexation de Textures Dynamiques à l'aide de Décompositions Multi-échelles , 2012 .

[37]  S. Suzuki,et al.  Feature extraction of temporal texture based on spatiotemporal motion trajectory , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[38]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[39]  Rémi Ronfard,et al.  A survey of vision-based methods for action representation, segmentation and recognition , 2011, Comput. Vis. Image Underst..

[40]  G. Aubert,et al.  Image decomposition: application to textured images and SAR images , 2003 .

[41]  ANTONIN CHAMBOLLE,et al.  An Algorithm for Total Variation Minimization and Applications , 2004, Journal of Mathematical Imaging and Vision.

[42]  Michel Ménard,et al.  Analyse de Textures Dynamiques par décompositions spatio-temporelles: application à l'estimation du mouvement global , 2010 .

[43]  Somayeh Danafar,et al.  Action Recognition for Surveillance Applications Using Optic Flow and SVM , 2007, ACCV.