Classifying the Political Leaning of News Articles and Users from User Votes

Social news aggregator services generate readers’ subjective reactions to news opinion articles. Can we use those as a resource to classify articles as liberal or conservative, even without knowing the self-identified political leaning of most users? We applied three semi-supervised learning methods that propagate classifications of political news articles and users as conservative or liberal, based on the assumption that liberal users will vote for liberal articles more often, and similarly for conservative users and articles. Starting from a few labeled articles and users, the algorithms propagate political leaning labels to the entire graph. In cross-validation, the best algorithm achieved 99.6% accuracy on held-out users and 96.3% accuracy on held-out articles. Adding social data such as users’ friendship or text features such as cosine similarity did not improve accuracy. The propagation algorithms, using the subjective liking data from users, also performed better than an SVM based text classifier, which achieved 92.0% accuracy on articles.

[1]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[2]  William W. Cohen,et al.  The MultiRank Bootstrap Algorithm: Self-Supervised Political Blog Classification and Ranking Using Semi-Supervised Link Classification , 2021, ICWSM.

[3]  M. Laver,et al.  Extracting Policy Positions from Political Texts Using Words as Data , 2003, American Political Science Review.

[4]  Burt L. Monroe,et al.  Fightin' Words: Lexical Feature Selection and Evaluation for Identifying the Content of Political Conflict , 2008, Political Analysis.

[5]  Lanny W. Martin,et al.  A Robust Transformation Procedure for Interpreting Political Text , 2007, Political Analysis.

[6]  automatic classification of , 2009 .

[7]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[8]  Stefan Kaufmann,et al.  Classifying Party Affiliation from Political Speech , 2008 .

[9]  Michael D. Smith,et al.  Predicting the Political Sentiment of Web Log Posts Using Supervised Machine Learning Techniques Coupled with Feature Selection , 2006, WEBKDD.

[10]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[11]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[12]  Jungwoo Kim,et al.  The politics of comments: predicting political orientation of news stories with commenters' sentiment patterns , 2011, CSCW.

[13]  Rob Malouf,et al.  A Preliminary Investigation into Sentiment Analysis of Informal Political Discourse , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[14]  Wei-Hao Lin,et al.  A Joint Topic and Perspective Model for Ideological Discourse , 2008, ECML/PKDD.

[15]  Miles Efron The liberal media and right-wing conspiracies: using cocitation information to estimate political orientation in web documents , 2004, CIKM.

[16]  Sean A. Munson,et al.  Presenting diverse political opinions: how and how much , 2010, CHI.

[17]  Sven-Oliver Proksch,et al.  A Scaling Model for Estimating Time-Series Party Positions from Texts , 2007 .

[18]  B. Pang,et al.  Mining Sentiment Classification from Political Web Logs , 2006 .

[19]  Daniel B. Klein,et al.  Liberal Versus Conservative Stinks , 2008 .

[20]  Sean A. Munson,et al.  Sidelines: An Algorithm for Increasing Diversity in News and Opinion Aggregators , 2009, ICWSM.

[21]  Charles M. Grinstead,et al.  Introduction to probability , 1999, Statistics for the Behavioural Sciences.

[22]  Wei-Hao Lin,et al.  Identifying Perspectives at the Document and Sentence Levels Using Statistical Models , 2006, NAACL.

[23]  Michael Gamon,et al.  BLEWS: Using Blogs to Provide Context for News Articles , 2008, ICWSM.

[24]  Alice H. Oh,et al.  User Evaluation of a System for Classifying and Displaying Political Viewpoints of Weblogs , 2009, ICWSM.

[25]  Shlomo Argamon,et al.  Political Leaning Categorization by Exploring Subjectivities in Political Blogs , 2008, DMIN.

[26]  Robert Malouf,et al.  Graph-based user classification for informal online political discourse , 2007 .