A First Summarization System of a Video in a Target Language

In this paper, we present the first results of the project AMIS (Access Multilingual Information opinionS) funded by Chist-Era. The main goal of this project is to understand the content of a video in a foreign language. In this work, we consider the understanding process, such as the aptitude to capture the most important ideas contained in a media expressed in a foreign language. In other words, the understanding will be approached by the global meaning of the content of a support and not by the meaning of each fragment of a video.

[1]  Juan-Manuel Torres-Moreno,et al.  Sentence Boundary Detection for French with Subword-Level Information Vectors and Convolutional Neural Networks , 2018, ArXiv.

[2]  Pascale Fung,et al.  Active learning with semi-automatic annotation for extractive speech summarization , 2012, TSLP.

[3]  Andreas Stolcke,et al.  A study in machine learning from imbalanced data for sentence boundary detection in speech , 2006, Comput. Speech Lang..

[4]  Sadaoki Furui,et al.  Speech-to-text and speech-to-speech summarization of spontaneous speech , 2004, IEEE Transactions on Speech and Audio Processing.

[5]  Lukás Burget,et al.  Sequence-discriminative training of deep neural networks , 2013, INTERSPEECH.

[6]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[7]  Juan-Manuel Torres-Moreno,et al.  Multi-Sentence Compression with Word Vertex-Labeled Graphs and Integer Linear Programming , 2018, TextGraphs@NAACL-HLT.

[8]  Daniel DeMenthon,et al.  Automatic Performance Evaluation for Video Summarization , 2004 .

[9]  Khalid Choukri,et al.  Network of Data Centres (NetDC): BNSC - An Arabic Broadcast News Speech Corpus , 2004, LREC.

[10]  Kamel Smaïli,et al.  Video Summarization Framework for Newscasts and Reports - Work in Progress , 2017, MCSS.

[11]  Lukás Burget,et al.  Semi-supervised training of Deep Neural Networks , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[12]  Heidi Christensen,et al.  From Text Summarisation to Style-Specific Summarisation for Broadcast News , 2004, ECIR.

[13]  Kamel Smaïli,et al.  Is statistical machine translation approach dead , 2017 .

[14]  Mehryar Mohri,et al.  Speech Recognition with Weighted Finite-State Transducers , 2008 .

[15]  Juan-Manuel Torres-Moreno Artex is AnotheR TEXt summarizer , 2012, ArXiv.

[16]  Juan-Manuel Torres-Moreno,et al.  Automatic Text Summarization: Torres-Moreno/Automatic Text Summarization , 2014 .

[17]  Andreas Stolcke,et al.  Entropy-based Pruning of Backoff Language Models , 2000, ArXiv.

[18]  Marcin Junczys-Dowmunt,et al.  The United Nations Parallel Corpus v1.0 , 2016, LREC.

[19]  Peter Bell,et al.  A system for automatic broadcast news summarisation, geolocation and translation , 2015, INTERSPEECH.

[20]  Luc Van Gool,et al.  Video summarization by learning submodular mixtures of objectives , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Kamel Smaïli,et al.  Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect , 2017, ACLING.

[22]  Remigiusz Baran,et al.  The IMCOP System for Data Enrichment and Content Discovery and Delivery , 2015, 2015 International Conference on Computational Science and Computational Intelligence (CSCI).

[23]  Mark J. F. Gales,et al.  Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..

[24]  Boqing Gong,et al.  Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Avid,et al.  Adaptation of speech recognition vocabularies for improved transcription of YouTube videos , 2018 .

[26]  Alexandre Quemy,et al.  Unsupervised Video Semantic Partitioning Using IBM Watson and Topic Modelling , 2018, EDBT/ICDT Workshops.