Semantic knowledge inference from online news media using an LDA-NLP approach

The amount of news delivered by the different media in the current environment can be overwhelming. Although the events being reported are factually the same, the ways with which the news is delivered vary with the media sources involved. In many cases, it is difficult to reliably uncover the latent information hidden within the news reports due to the great diversity of topics and the sheer volume of news. Analysis of the news media has always been of interest to news analysts, politicians and policy makers in order to aggregate and make sense of the information generated every day. News sources try to achieve relevance to their audiences by providing them with news that the audience wants or finds interesting, but often also have implicit motives such as shaping the perceptions of their audience. Although these agendas or target audiences are not explicitly identified, we consider ways in which this information can be inferred by applying the tools of natural language processing and semantic analysis to the news streams from these sources.

[1]  Diego Reforgiato Recupero,et al.  Sentiment Analysis: Adjectives and Adverbs are Better than Adjectives Alone , 2007, ICWSM.

[2]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[3]  Diego Reforgiato Recupero,et al.  The OASYS 2.0 Opinion Analysis System , 2007, ICWSM.

[4]  Thomas L. Griffiths,et al.  A probabilistic approach to semantic representation , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[5]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Henry Lieberman,et al.  A model of textual affect sensing using real-world knowledge , 2003, IUI '03.

[8]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[9]  Dongwoo Kim,et al.  Topic Chains for Understanding a News Corpus , 2011, CICLing.

[10]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[11]  Philip A. Schrodt,et al.  Conflict and Mediation Event Observations (CAMEO): A New Event Data Framework for the Analysis of Foreign Policy Interactions , 2002 .

[12]  C. McClelland World event/interaction survey , 1978 .

[13]  Alice H. Oh,et al.  User Evaluation of a System for Classifying and Displaying Political Viewpoints of Weblogs , 2009, ICWSM.

[14]  R. Lewontin ‘The Selfish Gene’ , 1977, Nature.

[15]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.