Using Clustered Frames to Classify Videos

This paper presents a semi-supervised learning technique to classify video clips. Usually, many tasks are done by categorizing video clips using deep learning techniques. However, based on the number of online videos today, it is necessary to use high computing power to accomplish this task. The authors propose methods that use Self-Organizing Map (SOM) to create a feature space representing clusters of video frames. The authors then classified them using simple voting, calculating entropy, neural networks, and Long-Short Term Memory (LSTM). The researchers also show finding frame numbers that are used to cluster video frames according to accuracy and training time. The results of this approach are presented based on testing 18 specific classes of real-world datasets from TV-programs containing 912 videos. The authors evaluated the techniques using five-fold cross-validation that our method archived 71.98% of average accuracy. Their computing time was then assessed, which achieved approximately 40 minutes of average computing time. Moreover, the researchers also compared the present proposal to other baseline models, including C3D and CNN-LSTM, and also used scene and action-recognition datasets, namely Hollywood2 to evaluate the technique. The authors archived 93.72% of average accuracy.

[1]  Shaharyar Kamal,et al.  Dense RGB-D Map-Based Human Tracking and Activity Recognition using Skin Joints Features and Self-Organizing Map , 2015, KSII Trans. Internet Inf. Syst..

[2]  Milos Manic,et al.  Parallalizable deep self-organizing maps for image classification , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[3]  Yogi Ramadhani,et al.  Characteristics Identification of Myeloblast Cell Using K-Means Clustering for Uncontrolled Images , 2019, International Journal of Machine Learning and Computing.

[4]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[5]  Cordelia Schmid,et al.  Actions in context , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Mubarak Shah,et al.  UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[7]  Apostol Natsev,et al.  YouTube-8M: A Large-Scale Video Classification Benchmark , 2016, ArXiv.

[8]  Sokchoo Ng,et al.  Effect of Neighbourhood Size Selection in SOM-Based Image Feature Extraction , 2019, International Journal of Machine Learning and Computing.

[9]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Xi Wang,et al.  Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification , 2015, ACM Multimedia.

[11]  Silvio Savarese,et al.  Unsupervised Semantic Parsing of Video Collections , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[12]  Noel E. O'Connor,et al.  Simple vs complex temporal recurrences for video saliency prediction , 2019, BMVC.

[13]  Nitish Srivastava,et al.  Exploiting Image-trained CNN Architectures for Unconstrained Video Classification , 2015, BMVC.

[14]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[15]  Jiwen Lu,et al.  Nonlinear subspace clustering for image clustering , 2017, Pattern Recognit. Lett..

[16]  Jun Wang,et al.  Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification , 2014, ACM Multimedia.

[17]  Mznah Al-Rodhaan,et al.  Content-based image retrieval using PSO and k-means clustering algorithm , 2015, Arabian Journal of Geosciences.

[18]  Yambem Jina Chanu,et al.  Image Segmentation Using K -means Clustering Algorithm and Subtractive Clustering Algorithm , 2015 .

[19]  Xuelong Li,et al.  Unsupervised Video Action Clustering via Motion-Scene Interaction Constraint , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Bugao Xu,et al.  Color segmentation of multi-colored fabrics using self-organizing-map based clustering algorithm , 2017 .