论文信息 - An Enhanced ANN-HMM based classification of video recordings with the aid of audio-visual feature extraction

An Enhanced ANN-HMM based classification of video recordings with the aid of audio-visual feature extraction

INTRODUCTION: As an essential part of life, the use of the Internet has increased exponentially. This rising Internet bandwidth speed has made video data transmission a more popular and modern form of information exchange. For classification of video date files there is a requirement of human efforts.Also for reducing the rate of clutter in video data on Internet, a suitable automatic video classification method is required. OBJECTIVES: In this work, we tried to find a successful model for video classification. METHODS: To make a successful model we use different schemes of visual and audio data analysis. On the other hand we choose some music, traffic and sports videos for different analysis. The model is based on Hidden Markov model (HMM) and Artificial neural network (ANN) classifiers.In order to gather the final results, we developed an “enhanced ANN-HMM based” model. RESULTS: Our approach attained an average of 90% success rate among all three classification classes. CONCLUSION: In aim of this work is to categorize and caption the videos automatically.Here we proposed an enhanced HMMANN based classification of video recordings with the aid of audio visual feature extraction.

[1] Thomas Sikora,et al. Sound Classification and Similarity , 2006 .

[2] Omar Bouattane,et al. An Efficient Audio Classification Approach Based on Support Vector Machines , 2016 .

[3] Francesco Camastra,et al. Machine Learning for Audio, Image and Video Analysis , 2015, Advanced Information and Knowledge Processing.

[4] Kiyoharu Aizawa,et al. Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30 - December 3, 2004, Proceedings, Part I , 2005, Pacific Rim Conference on Multimedia.

[5] Mahesh S. Chavan,et al. Channel Robust MFCCs for Continuous Speech Speaker Recognition , 2014, SIRS.

[6] Chong-Wah Ngo,et al. Motion characterization by temporal slices analysis , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[7] Virgílio A. F. Almeida,et al. Characterizing Videos, Audience and Advertising in Youtube Channels for Kids , 2017, SocInfo.

[8] Andrea Cavallaro,et al. Trajectory Clustering for Scene Context Learning and Outlier Detection , 2010, Video Search and Mining.

[9] B. S. Manjunath,et al. Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[10] Jana Eggink,et al. Automatic classification of personal video recordings based on audiovisual features , 2015, Knowl. Based Syst..

[11] M. Ghanbari,et al. Scene content classification from MPEG coded bit streams , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).