Evaluating the Rationales of Amateur Investors

Social media’s rise in popularity has demonstrated the usefulness of the wisdom of the crowd. Most previous works take into account the law of large numbers and simply average the results extracted from tasks such as opinion mining and sentiment analysis. Few attempt to identify high-quality opinions from the mined results. In this paper, we propose an approach for capturing expert-like rationales from social media platforms without the requirement of the annotated data. By leveraging stylistic and semantic features, our approach achieves an F1-score of 90.81%. The comparison between the rationales of experts and those of the crowd is done from stylistic and semantic perspectives, revealing that stylistic and semantic information provides complementary cues for professional rationales. We further show the advantage of using these superlative analysis results in the financial market, and find that top-ranked opinions identified by our approach increase potential returns by up to 90.31% and reduce downside risk by up to 71.69%, compared with opinions ranked by feedback from social media users. Moreover, the performance of our method on downside risk control is comparable with that of professional analysts.

[1]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[2]  Hsin-Hsi Chen,et al.  Numeral Attachment with Auxiliary Tasks , 2019, SIGIR.

[3]  Rada Mihalcea,et al.  CASCADE: Contextual Sarcasm Detection in Online Discussion Forums , 2018, COLING.

[4]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[5]  Jia Wang,et al.  Recommendation in Internet Forums and Blogs , 2010, ACL.

[6]  Amit P. Sheth,et al.  Multi-Task Learning Framework for Mining Crowd Intelligence towards Clinical Treatment , 2018, NAACL.

[7]  Ramit Sawhney,et al.  #YouToo? Detection of Personal Recollections of Sexual Harassment on Social Media , 2019, ACL.

[8]  Chung-Chi Chen,et al.  NTUSD-Fin: A Market Sentiment Dictionary for Financial Social Media Data Applications , 2018 .

[9]  André Freitas,et al.  SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs and News , 2017, *SEMEVAL.

[10]  Genta Indra Winata,et al.  Learning to Learn Sales Prediction with Social Media Sentiment , 2019 .

[11]  Nazli Goharian,et al.  Depression and Self-Harm Risk Assessment in Online Forums , 2017, EMNLP.

[12]  M A Just,et al.  A theory of reading: from eye fixations to comprehension. , 1980, Psychological review.

[13]  Chris Reed,et al.  Argument Mining: A Survey , 2020, Computational Linguistics.

[14]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[15]  Wei Wang,et al.  Enhancing Air Quality Prediction with Social Media and Natural Language Processing , 2019, ACL.

[16]  Nikolaos Aletras,et al.  Automatically Identifying Complaints in Social Media , 2019, ACL.

[17]  Lyle H. Ungar,et al.  Identifying Locus of Control in Social Media Language , 2018, EMNLP.

[18]  Hsin-Hsi Chen,et al.  Numeral Understanding in Financial Tweets for Fine-Grained Crowd-Based Forecasting , 2018, 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI).

[20]  Yi Li,et al.  Is This Post Persuasive? Ranking Argumentative Comments in Online Forum , 2016, ACL.

[21]  Vincent Ng,et al.  Modeling and Prediction of Online Product Review Helpfulness: A Survey , 2018, ACL.

[22]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[23]  Kathleen M. Carley,et al.  Tree LSTMs with Convolution Units to Predict Stance and Rumor Veracity in Social Media Conversations , 2019, ACL.

[24]  Marie-Francine Moens,et al.  Argumentation mining , 2011, Artificial Intelligence and Law.

[25]  Sameena Shah,et al.  Learning Stock Market Sentiment Lexicon and Sentiment-Oriented Word Vector from StockTwits , 2017, CoNLL.

[26]  Berlin Chen,et al.  Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain-specific texts , 2019, Natural Language Engineering.

[27]  Claire Cardie,et al.  Exploring the Role of Argument Structure in Online Debate Persuasion , 2020, EMNLP.

[28]  Japinder Singh,et al.  Feature-based opinion mining and ranking , 2012, J. Comput. Syst. Sci..

[29]  Malvina Nissim,et al.  You Write like You Eat: Stylistic Variation as a Predictor of Social Stratification , 2019, ACL.

[30]  Serena Villata,et al.  Five Years of Argument Mining: a Data-driven Analysis , 2018, IJCAI.

[31]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[32]  Shay B. Cohen,et al.  Stock Movement Prediction from Tweets and Historical Prices , 2018, ACL.

[33]  Karin Baier,et al.  The Uses Of Argument , 2016 .

[34]  Hsin-Hsi Chen,et al.  NumClaim: Investor's Fine-grained Claim Detection , 2020, CIKM.

[35]  Manfred Stede,et al.  From Argument Diagrams to Argumentation Mining in Texts: A Survey , 2013, Int. J. Cogn. Informatics Nat. Intell..

[36]  Xin Li,et al.  Your Style Your Identity: Leveraging Writing and Photography Styles for Drug Trafficker Identification in Darknet Markets over Attributed Heterogeneous Information Network , 2019, WWW.

[37]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[38]  Thomas Blake,et al.  Communication and Bargaining Breakdown: An Empirical Analysis , 2020, EC.

[39]  Hsin-Hsi Chen,et al.  Crowd View: Converting Investors' Opinions into Indicators , 2019, IJCAI.

[40]  Benno Stein,et al.  Using Argument Mining to Assess the Argumentation Quality of Essays , 2016, COLING.

[41]  Sebastian Riedel,et al.  Numeracy for Language Models: Evaluating and Improving their Ability to Predict Numbers , 2018, ACL.