Active learning in annotating micro-blogs dealing with e-reputation

Elections unleash strong political views on Twitter, but what do people really think about politics? Opinion and trend mining on micro blogs dealing with politics has recently attracted researchers in several fields including Information Retrieval and Machine Learning (ML). Since the performance of ML and Natural Language Processing (NLP) approaches are limited by the amount and quality of data available, one promising alternative for some tasks is the automatic propagation of expert annotations. This paper intends to develop a so-called active learning process for automatically annotating French language tweets that deal with the image (i.e., representation, web reputation) of politicians. Our main focus is on the methodology followed to build an original annotated dataset expressing opinion from two French politicians over time. We therefore review state of the art NLP-based ML algorithms to automatically annotate tweets using a manual initiation step as bootstrap. This paper focuses on key issues about active learning while building a large annotated data set from noise. This will be introduced by human annotators, abundance of data and the label distribution across data and entities. In turn, we show that Twitter characteristics such as the author's name or hashtags can be considered as the bearing point to not only improve automatic systems for Opinion Mining (OM) and Topic Classification but also to reduce noise in human annotations. However, a later thorough analysis shows that reducing noise might induce the loss of crucial information.

[1]  Yi Zhang,et al.  Incorporating Diversity and Density in Active Learning for Relevance Feedback , 2007, ECIR.

[2]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[3]  Alan F. Smeaton,et al.  Using NLP or NLP Resources for Information Retrieval Tasks , 1999 .

[4]  Luc Lamontagne,et al.  Combining Multiple Similarity Metrics Using a Multicriteria Approach , 2006, ECCBR.

[5]  Kalina Bontcheva,et al.  Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines , 2014, LREC.

[6]  Matthew Hurst,et al.  Deriving marketing intelligence from online discussion , 2005, KDD '05.

[7]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[8]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[9]  JungherrAndreas,et al.  Why the Pirate Party Won the German Election of 2009 or The Trouble With Predictions , 2012 .

[10]  Frédéric Béchet,et al.  Opinion Detection as a Topic Classification Problem , 2012, Textual Information Access: Statistical Models.

[11]  Julien Boyadjian,et al.  Twitter, un nouveau « baromètre de l'opinion publique » ? , 2014 .

[12]  Eric SanJuan,et al.  Investigating the Image of Entities in Social Media: Dataset Design and First Results , 2014, LREC.

[13]  Martin Znidarsic,et al.  Monitoring the Twitter sentiment during the Bulgarian elections , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[14]  Julio Gonzalo,et al.  ORMA: A Semi-automatic Tool for Online Reputation Monitoring in Twitter , 2014, ECIR.

[15]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[16]  Min-Yen Kan,et al.  Perspectives on crowdsourcing annotations for natural language processing , 2012, Language Resources and Evaluation.

[17]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[18]  Suzan Burton,et al.  Interactive or reactive? : marketing with Twitter , 2011 .

[19]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[20]  Rob Malouf,et al.  Taking sides: user classification for informal online political discourse , 2008, Internet Res..

[21]  Haris Papageorgiou,et al.  SemEval-2016 Task 5: Aspect Based Sentiment Analysis , 2016, *SEMEVAL.

[22]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[23]  Andrea Back,et al.  A Text Mining Approach to Evaluate Submissions to Crowdsourcing Contests , 2013, 2013 46th Hawaii International Conference on System Sciences.

[24]  D. Schill,et al.  Presidential Campaigning and Social Media: An Analysis of the 2012 Campaign , 2014 .

[25]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[26]  Philipp Koehn,et al.  Empirical Methods for Compound Splitting , 2003, EACL.

[27]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[28]  Panagiotis Takis Metaxas,et al.  How (Not) to Predict Elections , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[29]  David Cohn,et al.  Active Learning , 2010, Encyclopedia of Machine Learning.

[30]  Shrikanth S. Narayanan,et al.  A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle , 2012, ACL.

[31]  Lluís F. Hurtado,et al.  Political Tendency Identification in Twitter using Sentiment Analysis Techniques , 2014, COLING.

[32]  M. de Rijke,et al.  Active Learning for Entity Filtering in Microblog Streams , 2015, SIGIR.

[33]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[34]  Sylvie Ratté,et al.  Multi-classifier System for Sentiment Analysis and Opinion Mining , 2014, Encyclopedia of Social Network Analysis and Mining.

[35]  Bernhard Rieder,et al.  UvA-DARE ( Digital Academic Repository ) Mining one percent of Twitter : collections , baselines , sampling , 2013 .

[36]  Ross A. Malaga Web-Based Reputation Management Systems: Problems and Suggested Solutions , 2001, Electron. Commer. Res..

[37]  Caroline Brun,et al.  Decomposing Hashtags to Improve Tweet Polarity Classification (Décomposition des « hash tags » pour l’amélioration de la classification en polarité des « tweets ») [in French] , 2014, JEP/TALN/RECITAL.

[38]  Qiang Yang,et al.  Incorporating Reviewer and Product Information for Review Rating Prediction , 2011, IJCAI.

[39]  Gökhan Tür,et al.  Bootstrapping Spoken Dialog Systems with Data Reuse , 2004, SIGDIAL Workshop.

[40]  Julio Gonzalo,et al.  Overview of RepLab 2013: Evaluating Online Reputation Monitoring Systems , 2013, CLEF.

[41]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[42]  José Carlos González,et al.  TASS - Workshop on Sentiment Analysis at SEPLN , 2013, Proces. del Leng. Natural.

[43]  Ron Artstein,et al.  Survey Article: Inter-Coder Agreement for Computational Linguistics , 2008, CL.

[44]  Prasad Tadepalli,et al.  Active Learning with Committees for Text Categorization , 1997, AAAI/IAAI.

[45]  Shlomo Argamon,et al.  Committee-Based Sampling For Training Probabilistic Classi(cid:12)ers , 1995 .

[46]  Jungwoo Kim,et al.  The politics of comments: predicting political orientation of news stories with commenters' sentiment patterns , 2011, CSCW.

[47]  Killian Janod,et al.  NLP-Based Classifiers to Generalize Expert Assessments in E-Reputation , 2015, CLEF.

[48]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.