Unsupervised Graph-based Topic Modeling from Video Transcriptions

To unfold the tremendous amount of audiovisual data uploaded daily to social media platforms, effective topic modelling techniques are needed. Existing work tends to apply variants of topic models on text data sets. In this paper, we aim at developing a topic extractor on video transcriptions. The model improves coherence by exploiting neural word embeddings through a graph-based clustering method. Unlike typical topic models, this approach works without knowing the true number of topics. Experimental results on the real-life multimodal data set MuSeCaR demonstrates that our approach extracts coherent and meaningful topics, outperforming baseline methods. Furthermore, we successfully demonstrate the generalisability of our approach on a pure text review data set.

[1]  Fabrizio Ferraro,et al.  Structural Cohesion: Visualization and Heuristics for Fast Computation , 2015, J. Soc. Struct..

[2]  Mariana Mocanu,et al.  Clustering Documents using the Document to Vector Model for Dimensionality Reduction , 2020, 2020 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR).

[3]  Alice Baird,et al.  The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements , 2021, IEEE Transactions on Affective Computing.

[4]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.

[5]  Michael Röder,et al.  Exploring the Space of Topic Coherence Measures , 2015, WSDM.

[6]  Moula Husain,et al.  Multimodal Fusion of Speech and Text using Semi-supervised LDA for Indexing Lecture Videos , 2019, 2019 National Conference on Communications (NCC).

[7]  Jeffrey Xu Yu,et al.  Influential Community Search in Large Networks , 2015, Proc. VLDB Endow..

[8]  Mark Johnson,et al.  More Efficient Topic Modelling Through a Noun Only Approach , 2015, ALTA.

[9]  Yaxin Bi,et al.  Aggregated topic models for increasing social media topic coherence , 2019, Applied Intelligence.

[10]  Hwee Tou Ng,et al.  An Unsupervised Neural Attention Model for Aspect Extraction , 2017, ACL.

[11]  Timothy Baldwin,et al.  Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality , 2014, EACL.

[12]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[13]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[14]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[15]  Wiebke Wagner,et al.  Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit , 2010, Lang. Resour. Evaluation.

[16]  Leland McInnes,et al.  Accelerated Hierarchical Density Based Clustering , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[17]  Omer Levy,et al.  Improving Distributional Similarity with Lessons Learned from Word Embeddings , 2015, TACL.

[18]  Suzanna Sia,et al.  Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too! , 2020, EMNLP.

[19]  D. Matula k-Components, Clusters and Slicings in Graphs , 1972 .

[20]  Erik Cambria,et al.  Sentiment Analysis and Topic Recognition in Video Transcriptions , 2021, IEEE Intelligent Systems.

[21]  Magnus Sahlgren,et al.  Rethinking Topic Modelling: From Document-Space to Term-Space , 2020, FINDINGS.

[22]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[23]  Mauricio Barahona,et al.  Graph-based Topic Extraction from Vector Embeddings of Text Documents: Application to a Corpus of News Articles , 2020, COMPLEX NETWORKS.

[24]  Asit Kumar Das,et al.  A Graph Based Clustering Approach for Relation Extraction From Crime Data , 2019, IEEE Access.

[25]  Paul J. Kennedy,et al.  An evaluation of document clustering and topic modelling in two online social networks: Twitter and Reddit , 2020, Inf. Process. Manag..

[26]  Jiajun Zhang,et al.  Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video , 2019, IEEE Transactions on Knowledge and Data Engineering.

[27]  Yiannis Kompatsiaris,et al.  MuSe 2020 Challenge and Workshop: Multimodal Sentiment Analysis, Emotion-target Engagement and Trustworthiness Detection in Real-life Media: Emotional Car Reviews in-the-wild , 2020, MuSe @ ACM Multimedia.

[28]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[29]  Guoying Zhao,et al.  The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress , 2021, MuSe @ ACM Multimedia.

[30]  M. Newman,et al.  Fast Approximation Algorithms for Finding Node-Independent Paths in Networks , 2001 .

[31]  Raymond Chiong,et al.  Multilingual sentiment analysis: from formal to informal and scarce resource languages , 2016, Artificial Intelligence Review.

[32]  Soroush Vosoughi,et al.  An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data , 2020, WNUT.