Generation of description metadata for video files

Automatic Metadata Generation in the context of e-learning standards is usually referred to algorithms able to process and annotate semi structured documents in plain text. As most of the information available on the web nowadays is unstructured and in the form of multimedia files, the need for more general approaches arises. We propose an automatic metadata generation procedure that allows to label specific unstructured data (video lectures) with metadata compliant to the Learning Object Metadata standard. After preprocessing, three different summarization algorithms are tested and used to obtain a synthetic description of video content, both in terms of Description and Title. Results show that, in the provided context, the obtained Description has a good agreement with the lesson abstract written by its author.

[1]  Jun Miyazaki,et al.  An Automatic and Immediate Metadata Extraction Method by Heterogeneous Sensors for Meeting Video Streams , 2005, 2005 Symposium on Applications and the Internet Workshops (SAINT 2005 Workshops).

[2]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[3]  Jiawei Han,et al.  Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions , 2010, COLING.

[4]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[5]  Claudio Gennaro,et al.  Automatic metadata extraction and indexing for reusing e-learning multimedia objects , 2007, MS '07.

[6]  Inderjeet Mani,et al.  The Challenges of Automatic Summarization , 2000, Computer.

[7]  Dipanjan Das Andr,et al.  A Survey on Automatic Text Summarization , 2007 .

[8]  V. A. Yatsko,et al.  A method for evaluating modern systems of automatic text summarization , 2007, Automatic Documentation and Mathematical Linguistics.

[9]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[10]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[11]  Yoshihiko Hayashi,et al.  Speech-based and video-supported indexing of multimedia broadcast news , 2003, SIGIR '03.

[12]  Christoph Meinel,et al.  Towards to an automatic semantic annotation for multimedia learning objects , 2007, Emme '07.

[13]  Alfredo Petrosino,et al.  Integrating Navigational and Structural Information in SCORM Content Aggregation Modeling , 2012, 2012 IEEE 12th International Conference on Advanced Learning Technologies.

[14]  Matthias Jarke,et al.  ALOA: A Web Services Driven Framework for Automatic Learning Object Annotation , 2008, EC-TEL.

[15]  Alfredo Petrosino,et al.  Automatic generation of SCORM compliant metadata for portable document format files , 2012, CompSysTech '12.

[16]  Udo Winand,et al.  The sharable content object reference model (SCORM) - a critical review , 2002, International Conference on Computers in Education, 2002. Proceedings..

[17]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[18]  Rafeeq Al-Hashemi Text Summarization Extraction System (TSES) Using Extracted Keywords , 2010, Int. Arab. J. e Technol..