A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content

Analysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring process could significantly reduce time, labor, and costs. Therefore, there is a need for unified access to multilingual and multicultural news story material, beyond the level of a nation, ensuring context-aware, spatiotemporal, and semantic interpretation, correlating also and summarizing the interpreted material into a coherent gist. In this paper, we present a platform integrating multimodal analytics techniques, which are able to support journalists in handling large streams of real-time and diverse information. Specifically, the platform automatically crawls and indexes multilingual and multimedia information from heterogeneous resources. Textual information is automatically summarized and can be translated (on demand) into the language of the journalist. High-level information is extracted from both textual and multimedia content for fast inspection using concept clouds. The textual and multimedia content is semantically integrated and indexed using a common representation, to be accessible through a web-based search engine. The evaluation of the proposed platform was performed by several groups of journalists revealing satisfaction from the user side.

[1]  Mor Naaman,et al.  Diamonds in the rough: Social media visual analytics for journalistic inquiry , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[2]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3]  Leo Wanner,et al.  Towards Multilingual Natural Language Generation Within Abstractive Summarization , 2016, International Conference of the Catalan Association for Artificial Intelligence.

[4]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[5]  Meng Wang,et al.  Event analysis in social multimedia: a survey , 2016, Frontiers of Computer Science.

[6]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[7]  Sophia Ananiadou,et al.  The C-value/NC-value Method of Automatic Recognition for Multi-Word Terms , 1998, ECDL.

[8]  Yiannis Kompatsiaris,et al.  A Hybrid Framework for News Clustering Based on the DBSCAN-Martingale and LDA , 2016, MLDM.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Yun Zhu,et al.  Support vector machines and Word2vec for text classification with semantic features , 2015, 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC).

[11]  Chiara Francalanci,et al.  Influence-based Twitter browsing with NavigTweet , 2017, Inf. Syst..

[12]  Lijun Liu,et al.  An Efficient Method for Document Categorization Based on Word2vec and Latent Semantic Analysis , 2015, 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing.

[13]  Carol M. Barnum Usability Testing Essentials , 2011 .

[14]  Lee Gillam,et al.  University of Surrey Participation in TREC8: Weirdness Indexing for Logical Document Extrapolation and Retrieval (WILDER) , 1999, TREC.

[15]  MIGUEL BALLESTEROS,et al.  Data-driven deep-syntactic dependency parsing† , 2015, Natural Language Engineering.

[16]  Alon Lavie,et al.  Meteor 1.3: Automatic Metric for Reliable Optimization and Evaluation of Machine Translation Systems , 2011, WMT@EMNLP.

[17]  Daniel A. Keim,et al.  A Survey on Visual Analytics of Social Media Data , 2016, IEEE Transactions on Multimedia.

[18]  Fernando Batista,et al.  MISNIS: An intelligent platform for twitter topic mining , 2017, Expert Syst. Appl..

[19]  Georg Heigold,et al.  The RWTH aachen university open source speech recognition system , 2009, INTERSPEECH.

[20]  Neil Hurley,et al.  Hashtagger+: Efficient High-Coverage Social Tagging of Streaming News , 2018, IEEE Transactions on Knowledge and Data Engineering.

[21]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[22]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[23]  Hermann Ney,et al.  The RWTH large vocabulary continuous speech recognition system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[24]  Hiroaki Sato,et al.  The FrameNet Database and Software Tools , 2002, LREC.

[25]  Yiannis Kompatsiaris,et al.  Gradual transition detection using color coherence and other criteria in a video shot meta-segmentation framework , 2008, 2008 15th IEEE International Conference on Image Processing.

[26]  Marcello Federico,et al.  Domain Adaptation for Statistical Machine Translation with Monolingual Resources , 2009, WMT@EACL.

[27]  Pierre Nugues,et al.  A High-Performance Syntactic and Semantic Dependency Parser , 2010, COLING.

[28]  Ioannis Patras,et al.  Online multi-task learning for semantic concept detection in video , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[29]  Yiannis Kompatsiaris,et al.  News Articles Classification Using Random Forests and Weighted Multimodal Features , 2014, IRFC.

[30]  Gabriella Kazai,et al.  Towards a science of user engagement (Position Paper) , 2011 .