Objective Assessment of Subjective Tasks in Crowdsourcing Applications

Labelling, or annotation, is the process by which we assign labels to an item with regards to a task. In some Artificial Intelligence problems, such as Computer Vision tasks, the goal is to obtain objective labels. However, in problems such as text and sentiment analysis, subjective labelling is often required. More so when the sentiment analysis deals with actual emotions instead of polarity (positive/negative) . Scientists employ human experts to create these labels, but it is costly and time consuming. Crowdsourcing enables researchers to utilise non-expert knowledge for scientific tasks. From image analysis to semantic annotation, interested researchers can gather a large sample of answers via crowdsourcing platforms in a timely manner. However, non-expert contributions often need to be thoroughly assessed, particularly so when a task is subjective. Researchers have traditionally used ‘Gold Standard’, ‘Thresholding’ and ‘Majority Voting’ as methods to filter non-expert contributions. We argue that these methods are unsuitable for subjective tasks, such as lexicon acquisition and sentiment analysis. We discuss subjectivity in human centered tasks and present a filtering method that defines quality contributors, based on a set of objectively infused terms in a lexicon acquisition task. We evaluate our method against an established lexicon, the diversity of emotions - i.e. subjectivity- and the exclusion of contributions. Our proposed objective evaluation method can be used to assess contributors in subjective tasks that will provide domain agnostic, quality results, with at least 7% improvement over traditional methods.

[1]  Stefano Tranquillini,et al.  Keep it simple: reward and task design in crowdsourcing , 2013, CHItaly '13.

[2]  P. Ekman Universal facial expressions of emotion. , 1970 .

[3]  Yongtae Park,et al.  Review-based measurement of customer satisfaction in mobile service: Sentiment analysis and VIKOR approach , 2014, Expert Syst. Appl..

[4]  Hillary Anger Elfenbein,et al.  On the universality and cultural specificity of emotion recognition: a meta-analysis. , 2002, Psychological bulletin.

[5]  Robert P. Schumaker,et al.  Predicting wins and spread in the Premier League using a sentiment analysis of twitter , 2016, Decis. Support Syst..

[6]  Ruifang Liu,et al.  End-to-End Deep Memory Network for Visual-Textual Sentiment Analysis , 2018, 2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC).

[7]  Lora Aroyo,et al.  Truth Is a Lie: Crowd Truth and the Seven Myths of Human Annotation , 2015, AI Mag..

[8]  Ting Liu,et al.  Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[9]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[10]  M. Potenza,et al.  Functional and structural neural alterations in Internet gaming disorder: A systematic review and meta-analysis , 2017, Neuroscience & Biobehavioral Reviews.

[11]  Saif Mohammad,et al.  Capturing Reliable Fine-Grained Sentiment Associations by Crowdsourcing and Best–Worst Scaling , 2016, NAACL.

[12]  Long Chen,et al.  Weakly-Supervised Deep Embedding for Product Review Sentiment Analysis , 2018, IEEE Transactions on Knowledge and Data Engineering.

[13]  Elena Paslaru Bontas Simperl,et al.  Crowdsourcing for Beyond Polarity Sentiment Analysis A Pure Emotion Lexicon , 2017, ArXiv.

[14]  Rada Mihalcea,et al.  Learning Multilingual Subjective Language via Cross-Lingual Projections , 2007, ACL.

[15]  M L Phillips,et al.  Gender Differences in the Sensitivity to Negative Stimuli: Cross-Modal Affective Priming Study☆ , 2013, European Psychiatry.

[16]  G. Pourtois,et al.  The perception and categorisation of emotional stimuli: A review , 2010 .

[17]  Pushpak Bhattacharyya,et al.  Deep Ensemble Model with the Fusion of Character, Word and Lexicon Level Information for Emotion and Sentiment Prediction , 2018, ICONIP.

[18]  Choochart Haruechaiyasak,et al.  Discovering Consumer Insight from Twitter via Sentiment Analysis , 2012, J. Univers. Comput. Sci..

[19]  R. Nisbett,et al.  Culture and Aesthetic Preference: Comparing the Attention to Context of East Asians and Americans , 2008, Personality & social psychology bulletin.

[20]  Jonathan A. Smith Qualitative Psychology: A Practical Guide to Research Methods , 2006, QMiP Bulletin.

[21]  Aniket Kittur,et al.  Crowdsourcing user studies with Mechanical Turk , 2008, CHI.

[22]  Satoshi Nakamura,et al.  Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing , 2018, LREC.

[23]  Carsten Eickhoff,et al.  Cognitive Biases in Crowdsourcing , 2018, WSDM.

[24]  Jeffrey N. Rouder,et al.  The interplay between subjectivity, statistical practice, and psychological science , 2016 .

[25]  Francisco Herrera,et al.  Distinguishing between facts and opinions for sentiment analysis: Survey and challenges , 2018, Inf. Fusion.

[26]  Wendi L. Adair,et al.  Watch Your Tone … Relational Paralinguistic Messages in Negotiation , 2013 .

[27]  Tommaso Caselli,et al.  Temporal Information Annotation: Crowd vs. Experts , 2016, LREC.

[28]  Sajib Hasan,et al.  Emotion Detection from Speech Signals using Voting Mechanism on Classified Frames , 2019, 2019 International Conference on Robotics,Electrical and Signal Processing Techniques (ICREST).

[29]  Nicole Novielli,et al.  EmoTxt: A toolkit for emotion recognition from text , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW).

[30]  Dennis E Reidy,et al.  Gender Role Discrepancy Stress, High-Risk Sexual Behavior, and Sexually Transmitted Disease , 2016, Archives of sexual behavior.

[31]  R. Plutchik A GENERAL PSYCHOEVOLUTIONARY THEORY OF EMOTION , 1980 .

[32]  T. Chaplin,et al.  Gender and Emotion Expression: A Developmental Contextual Perspective , 2015, Emotion review : journal of the International Society for Research on Emotion.

[33]  K. Rayner,et al.  Eye movements during reading: some current controversies , 2001, Trends in Cognitive Sciences.

[34]  Dimosthenis Kontogiorgos,et al.  Crowdsourced Multimodal Corpora Collection Tool , 2018, LREC.

[35]  P. Ekman,et al.  Constants across cultures in the face and emotion. , 1971, Journal of personality and social psychology.

[36]  Kalina Bontcheva,et al.  Challenges of Evaluating Sentiment Analysis Tools on Social Media , 2016, LREC.

[37]  Rada Mihalcea,et al.  Sentiment Analysis , 2014, Encyclopedia of Social Network Analysis and Mining.

[38]  Serkan Ayvaz,et al.  Sentiment analysis on Twitter: A text mining approach to the Syrian refugee crisis , 2018, Telematics Informatics.

[39]  Rudy Prabowo,et al.  Sentiment analysis: A combined approach , 2009, J. Informetrics.

[40]  Judith A. Hall,et al.  The Social Psychology of Perceiving Others Accurately , 2016 .

[41]  Stefan Palan,et al.  Prolific.ac—A subject pool for online experiments , 2017 .

[42]  B. J. Casey,et al.  vlPFC–vmPFC–Amygdala Interactions Underlie Age-Related Differences in Cognitive Regulation of Emotion , 2016, Cerebral cortex.

[43]  Weitong Chen,et al.  A survey of sentiment analysis in social media , 2018, Knowledge and Information Systems.

[44]  Daniel E. O'Leary,et al.  On the relationship between number of votes and sentiment in crowdsourcing ideas and comments for innovation: A case study of Canada's digital compass , 2016, Decis. Support Syst..

[45]  W. P. Lehmann Subjectivity , 2015, Encyclopedia of Evolutionary Psychological Science.

[46]  M. Frank,et al.  Emotional Language and Political Aggression , 2013 .

[47]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[48]  T. Teo,et al.  Homo neoliberalus: From personality to forms of subjectivity , 2018, Theory & Psychology.

[49]  Erik Cambria,et al.  Modeling Inter-Aspect Dependencies for Aspect-Based Sentiment Analysis , 2018, NAACL.

[50]  L. Canetti,et al.  Food and emotion , 2002, Behavioural Processes.

[51]  Malvina Nissim,et al.  Sentiment Polarity Classification at EVALITA: Lessons Learned and Open Challenges , 2018, IEEE Transactions on Affective Computing.

[52]  Guido Caldarelli,et al.  S 1 Appendix , 2016 .

[53]  H. A. Elfenbein Emotional dialects in the language of emotion , 2017 .

[54]  M. Knapp,et al.  Nonverbal communication in human interaction , 1972 .

[55]  Flavius Frasincar,et al.  Supervised and Unsupervised Aspect Category Detection for Sentiment Analysis with Co-occurrence Data , 2018, IEEE Transactions on Cybernetics.

[56]  Raymond Chiong,et al.  Multi-PSO based Classifier Selection and Parameter Optimisation for Sentiment Polarity Prediction , 2018, 2018 IEEE Conference on Big Data and Analytics (ICBDA).

[57]  Raymond Y. K. Lau,et al.  Social analytics: Learning fuzzy product ontologies for aspect-oriented sentiment analysis , 2014, Decis. Support Syst..

[58]  A. Acquisti,et al.  Beyond the Turk: Alternative Platforms for Crowdsourcing Behavioral Research , 2016 .

[59]  L. Pessoa,et al.  Emotion processing and the amygdala: from a 'low road' to 'many roads' of evaluating biological significance , 2010, Nature Reviews Neuroscience.

[60]  Lars Hetmank,et al.  Components and Functions of Crowdsourcing Systems - A Systematic Literature Review , 2013, Wirtschaftsinformatik.

[61]  Kevin B Paterson,et al.  Eye movements during reading and topic scanning: effects of word frequency. , 2015, Journal of experimental psychology. Human perception and performance.

[62]  D. Cox,et al.  Selective processing of food words during insulin-induced hypoglycemia in healthy humans , 2004, Psychopharmacology.

[63]  David A. Shamma,et al.  Characterizing debate performance via aggregated twitter sentiment , 2010, CHI.

[64]  Shampa Chakraverty,et al.  An Approach to Track Context Switches in Sentiment Analysis , 2018 .

[65]  Francisco Herrera,et al.  Consensus vote models for detecting and filtering neutrality in sentiment analysis , 2018, Inf. Fusion.

[66]  Paolo Rosso,et al.  SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter , 2015, *SEMEVAL.

[67]  Svetlana Alexeeva,et al.  An Opinion Word Lexicon and a Training Dataset for Russian Sentiment Analysis of Social Media , 2016 .

[68]  Francisco Javier González-Castaño,et al.  Creating emoji lexica from unsupervised sentiment analysis of their descriptions , 2018, Expert Syst. Appl..

[69]  Christian Wagner,et al.  Paid Crowdsourcing, Low Income Contributors, and Subjectivity , 2019, AIAI.

[70]  Jennifer McCoy,et al.  Mass Partisan Polarization: Measuring a Relational Concept , 2018 .

[71]  Udo Kruschwitz,et al.  Assessing Crowdsourcing Quality through Objective Tasks , 2012, LREC.

[72]  Christian Wagner,et al.  A Multivalued Emotion Lexicon Created and Evaluated by the Crowd , 2018, 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS).